Senior Site Reliability Engineer Kubernetes Platform (m/f/x)
as of now
Permanent role
Fulltime (40 h/week)
100% remote throughout Germany or based in Berlin
Your Mission
As a valued member of our MetaKube team, you will help shape the technological vision and scalability of our managed Kubernetes platform. You bring a deep passion for modern platform engineering, treating infrastructure as a software engineering endeavor. This spans everything from server provisioning and K8s operators all the way to GPU integration. Drawing on your extensive experience, you will gladly mentor our newer colleagues and take true ownership of the product. Armed with technological foresight and fresh perspectives, you will inspire the team as we work together to elevate our open source platform to the next level.
Details
Your responsibilities:
- Ensuring the smooth and reliable operation of our Kubernetes SaaS platform across hundreds of active clusters
- Translating complex system architectures into code, dedicating roughly 30 to 40 percent of your time to software development utilizing Go (Golang), Terraform, Ansible, and Bash
- Developing and automating infrastructure components, covering everything from server provisioning and Kubernetes operators all the way to seamless GPU integration
- Optimizing and strategically expanding our observability platform built around the Prometheus stack
- Overseeing release management, continuously enhancing our CI/CD pipelines and test automation, and troubleshooting intricate customer setups as part of our second and third tier support
- Participating in our on call rotation, which includes additional compensation once you have successfully completed your probationary period.
What you have to bring on board:
- Several years of hands on experience running and scaling highly available Kubernetes clusters in production environments
- Extensive knowledge of Linux system administration coupled with a solid grasp of network Layer 3/4 and Layer 7 protocols
- Substantial development experience using Go, alongside a strong command of automation tools like Ansible and Terraform.
- Ideally, you bring a profound understanding of observability stacks such as Prometheus, Loki, and Mimir, as well as Kubernetes operators.
- A strong collaborative spirit with the ability to inspire others through fresh ideas while providing reliable technical guidance
- Good German language skills at a B1 – B2 level paired with excellent English proficiency
What to expect at SysEleven:
You can look forward to joining a genuine tech team with deeply rooted open source DNA, where the best technical solution always wins out, regardless of hierarchy. We place a premium on technological excellence and a strong hands on mindset, fostering an open and straightforward exchange of knowledge among peers. Our streamlined decision making processes give you the freedom to champion your own ideas, take real ownership, and actively leverage open source technologies all the way down to deep platform development.
Sounds interesting?
Then I look forward to hearing from you:
Share this job with friends: