Senior Platform Engineer - SRE
About kaiko
In cancer care, treatment decisions can take many days—but patients don’t have that time. One of the reasons for delays? Cancer patients' data is scattered across many places: doctor’s notes, medical imagery, genomics data. At kaiko, we are developing AI foundational models to bring this data together and integrate it into clinical workflows, enabling doctors to make faster, more effective treatments decisions.
We also collaborate closely with the leading Dutch cancer research institute (NKI) on multiple AI research projects and a joint clinical validation initiative. In 2025, we plan on expanding our partnerships to even more hospitals.
We raised significant long-term funding and have offices in Zurich and Amsterdam. Over the past year, our team has nearly doubled in size, now comprising 70+ people from 25 countries.
About the role
You will be part of the Infrastructure Team and will be responsible for improving the availability and performance of our applications. Systems under your responsibility have company-wide reach, therefore company-wide interaction is to be expected. You will be working with our back-end services which handle hybrid fleet management, components that drive our advancements in AI, core services used by every team at Kaiko, networking systems and everything in between.
On the day-to-day, you will cooperate with the Infrastructure Team and the rest of Platform Organization, mentor and help junior engineers around you grow, be a force multiplier of impact. Your ultimate goal is to reduce downtime, improve system reliability, and help us enhance our systems with the latest knowledge and practices.
You will be based either in The Netherlands or Switzerland, with the expectation of spending at least 50% of your time at the office.
Some areas of responsibility
- You will be part of a fast-growing team who own the back-end services that are foundational to the whole company.
- You will create, influence and review ongoing architecture, standards and processes for systems that will help us enhance our incident and risk management.
- You will write and review code, develop documentation and debug challenging problems on (increasingly) complex systems.
- You will manage availability, latency, scalability and efficiency of Kaiko services by engineering reliability into software, systems and processes.
- You will work both on the Cloud and with state-of-the-art GPU-based HPCs and make sure that they all are in top-notch shape for our model training.
If you are a Senior Platform Engineer with a knack for SRE or, vice versa, you are a Senior Reliability Engineer that knows their way around Platform Engineering and ready to provide the best infrastructure to the Machine Learning Engineers of a fast-growing medical startup, we think you will love this job.
Why kaiko
At kaiko, we believe the best ideas come from collaboration, ownership and ambition. We’ve built a team of international experts, and your work has a direct impact. Here’s what we value:
- Ownership: You’ll have the liberty to set your own goals (in alignment with the organizational needs), make critical decisions, and see the direct impact of your work.
- Collaboration: You’ll have to approach disagreement with curiosity, build on common ground and create solutions together.
- Ambition: You’ll be surrounded by people who set high standards for themselves and others, who see obstacles as opportunities, and who are relentless in their work to create better outcomes for patients.
In addition, we offer:
- An attractive and competitive salary, a good pension plan and 25 vacation days per year.
- Great offsites and team events to strengthen the team and celebrate successes together.
- A EUR 1000 learning and development budget to help you grow.
- Autonomy to do your work the way that works best for you, whether you have a kid or prefer early mornings. We would still like to see you around for a few in-person collaborative touchpoints.
- An annual commuting subsidy.
About you
Minimum requirements:
- Proficiency in Kubernetes and its back-end components (hands-on experience in container orchestration environments).
- Proficiency in SRE practices (previously part of SRE team or personally implemented said practices within an organization).
- Proficiency in computer networking (hands-on experience with hybrid network topologies).
- Proficiency with on-premises Linux-based systems (hands-on experience with deployment and management of bare-metal servers).
Nice to have:
- Exposure to Grafana, Victoria Metrics, Loki and IRM systems.
- Exposure to Hybrid Cloud architectures and multi-tenancy.
- Exposure to NVIDIA HPC (or equivalent) and ML Ops practices.
We are excited to gather a broad range of perspectives in our team, as we believe it will help us build better products to support a broader set of people. If you’re excited about us but don’t fit every single qualification, we still encourage you to apply. We’ve had incredible team members join us who didn’t check every box!
- Department
- Platform Engineering
- Locations
- Amsterdam, Zürich (Puls 5)
- Remote status
- Hybrid
Senior Platform Engineer - SRE
Loading application form
Already working at Kaiko?
Let’s recruit together and find your next colleague.