Senior ML Platform Engineer
Are you passionate about how technology can make a real impact in cancer? Join us at kaiko.ai in building the state-of-the-art Data & AI platform, enabling large-scale training of multi-modal foundation models, and transforming the clinical workflow to deliver better patient outcomes.
Our culture
At Kaiko, we have an open, creative and non-hierarchical work atmosphere which offers continuous learning and direct impact in return for accountability and team spirit.
We offer flexibility - for instance, through remote working – alongside an expectation for managing and delivering your own goals; our team’s ownership, passion and shared commitment to improving health outcomes through data is something that sets us apart.
At the intersection of healthcare and data we recognize the implications on wellbeing and trust and approach our work with the utmost sensitivity. Data privacy, compliance and security are core to everything we do. Our open, creative environment gives talented people room to explore new ideas and we reward this with an attractive package and opportunities for further personal development.
About the role
We are seeking a highly skilled Senior ML Platform Engineer with a passion for building scalable ML platform and ensuring a high-availability experience to empower our AI research team in their daily work. You'll play a vital role in making our ambitious AI healthcare solutions a practical reality. This exciting role will be based in either The Netherlands or Switzerland.
Your responsibilities
- Design and build Kaiko’s multi-tenant machine learning platform, including our large-scale distributed training systems;
- Create robust distributed training and inference solutions for maximum computational efficiency;
- Implement and maintain workflows and tools (CI/CD, containerization, orchestration, monitoring, logging and alerting systems) for our large training runs;
- Collaborate with AI/ML researchers to develop and implement solutions that enable safe and reproducible model-training experiments;
- Ensure compliance with security best practices and industry standards.
Qualifications/requirements:
- 3+ years of experience building production ML platform and systems;
- Experience building and optimizing latency and throughput of machine learning systems and GPU workloads;
- Hands-on experience with distributed training frameworks (e.g. Ray, Dask, PyTorch Lightning);
- Experience with at least one cloud platform (e.g. AWS, Azure or Google Cloud);
- Strong coding skills in at least one programming language (e.g. Python, Scala, Java, C++);
- Excellent problem-solving and communication skills;
- Self-motivated and able to work well in a fast-paced startup environment.
Nice to have:
- Track record of successfully scaling ML platform;
- Fundamentals of modern Deep Learning;
- Experience with CI/CD tools (e.g. GitLab CI/CD, Github Actions or CircleCI), containerization (e.g. Docker) and orchestration tools (e.g. Kubernetes, Helm, Kustomize);
- Knowledge of monitoring, logging, alerting and observability tools (e.g. Prometheus, Grafana, ELK Stack or Datadog);
- Familiarity with infrastructure-as-code tools (e.g. Terraform, CloudFormation or Pulumi);
- Understanding of networking, security, and system administration concepts;
- Experience of high-performance computing (HPC) systems and workload managers (Slurm).
This Senior ML Platform Engineer position is a full-time role. It is important for the applicant to be a resident in The Netherlands or Switzerland, have a valid work permit and preferably be within commutable distance from our offices in Amsterdam or Zürich. Given the nature of Kaiko’s business and the fact that it deals with sensitive data, a Certificate of Conduct will be required upon finalizing the employment contract.
Our offer
- An inspirational, extremely talented and internationally diverse team which ‘builds a submarine whilst also operating it’ and loves doing that
- A unique working experience in a fast-growing company which intends to revolutionize healthcare
- Autonomy, flexibility and the opportunity to do your work in the way it works best to you, as long as you deliver on your responsibilities
- An attractive and competitive salary, a good pension plan and 25 vacation days per year
- We value your personal & professional development; together we decide how we can support your growth.
Want to join Kaiko as Senior ML Platform Engineer? Send us your application online and we will contact you as soon as possible. At Kaiko we welcome everyone with equal enthusiasm. Should you have any questions, please do not hesitate to contact us at recruitment@kaiko.ai and/or visit Kaiko online at www.kaiko.ai
- Department
- Platform Engineering
- Locations
- Amsterdam (NKI-AvL), Zürich (Puls 5)
- Remote status
- Hybrid Remote
Senior ML Platform Engineer
Loading application form
Already working at Kaiko?
Let’s recruit together and find your next colleague.