Senior Platform Engineer
About the role
You will contribute to the storage, compute and network infrastructure components. Participate in incident management and ensuring security (all layers of the stack, patching, vulnerability scanning, IDS/IPS/SIEM) of the platform.
You are part of the team that owns our Data & Compute Platform which is backbone of our Multimodal AI & Data Analytics platform, our R&D infrastructure and critical business applications.
You will work in close collaboration with computational biologists, machine learning scientists and software engineers. We apply modern engineering practices to build reliable, maintainable, scalable, and secure systems. Your contributions will enable data-centric AI and drive our mission to diagnose and treat cancer at its most actionable and early stages.
Further, you will focus on the following components:
Reliability and Scalability
You develop and extend the building blocks of our Infrastructure as Code (IaC) terraform codebase, improve reliability and scalability of our AI and ML platform, monitor and troubleshoot infrastructure issues, including performance and availability while ensuring plans for disaster recovery and business continuity paths
Collaboration
You will work with teams offering platform services to design and implement features and services to support efficient ML training and serving workloads on orchestration technologies like Kubernetes. You will also work with development teams to optimize infrastructure and application performance. You will also collaborate with Security office to ensure the platform meets high standards of security and compliance requirements
Growth
We will help you to keep up to date with industry best practices. You will support evaluation of new emerging tools/technologies in cloud and ML infrastructure space. In the process you will also have opportunity to learn the machine learning necessary to optimize training performance across multiple scales of hardware
Our Technology
We work with Terraform to define our Azure cloud resources and Atlantis for deploying cloud resources through GitHub pull requests. We also use kustomize and helm for defining our application's configuration as code deployed to Azure Kubernetes service. We make use of GitHub actions and Pants (build system) for our CI and ArgoCD to manage our applications deployment following GitOps approach and depend on multiple CNCF open-source tools to provide platform services to our developers and users.
You have
- 5+ years of experience managing production infrastructure, automation, and software engineering
- Bachelor’s degree or similar in computer science or comparable experience in related technical field
- Good understanding of Linux internals and working of containerization technologies
- Demonstrate strong understanding of cloud-based storage services and distributed storage systems.
- Experience building and managing secure multi-tenant Kubernetes cluster.
- Proficient in at least one high-level programming language like Python, Go, Rust etc.
- Hands-on using IaC tools like Terraform, Pulumi etc. for managing cloud infrastructure
- Working experience with configuration management tools like Helm, kustomize, jsonnet etc.
- Demonstrate strong analytical and troubleshooting skills with large-scale distributed systems.
- Effective communication and collaboration skills and a powerful desire to own and drive your work.
This position is a full-time role. The applicant needs to be a resident in either The Netherlands or Switzerland, have a valid work permit and be within commutable distance from our office in Amsterdam or Zürich. Given the nature of Kaiko’s business and the fact that it deals with sensitive data, a Certificate of Conduct will be required upon finalizing the employment contract.
What can you expect?
At Kaiko we have an open, creative and non-hierarchical work atmosphere which offers continuous learning and direct impact in return for accountability and team spirit.
We offer flexibility - for instance, through remote working – alongside an expectation for managing and delivering your own goals; our team’s ownership, passion and shared commitment to improving health outcomes through data is something that sets us apart.
At the intersection of healthcare and data we recognize the implications on wellbeing and trust and approach our work with the utmost sensitivity. Data privacy, compliance and security are core to everything we do. Our open, creative environment gives talented people room to explore new ideas and we reward this with an attractive package and opportunities for further personal development.
If you're ready to take on a challenging and rewarding role in a fast-paced start-up environment, we want to hear from you! Send us your application online (www.kaiko.ai) and we will contact you as soon as possible. At Kaiko we welcome everyone with equal enthusiasm. Should you have any questions, please do not hesitate to contact us at recruitment@kaiko.ai
- Department
- Platform Engineering
- Role
- Storage & Compute
- Locations
- Amsterdam (NKI-AvL), Zürich (Puls 5)
- Remote status
- Hybrid Remote
Senior Platform Engineer
Loading application form
Already working at Kaiko?
Let’s recruit together and find your next colleague.