CrawlJobs Logo

Senior Platform Engineer - Cloud & Infrastructure

ZenML

Location Icon

Location:
Germany , Munich

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

Architect the Infrastructure of MLOps. This is a unique hybrid role where you won't just be maintaining internal clusters; you will be building core product features AND helping our most advanced customers architect their MLOps stacks.

Job Responsibility:

  • Build 'Infra-Heavy' Product Features like native schedulers and workload manager
  • Own the ZenML Pro (SaaS) Infrastructure ensuring resilience, scalability, and security
  • Enterprise Architecture & PoCs for complex customer deployments
  • Developer Experience by abstracting Kubernetes complexity from Data Scientists

Requirements:

  • Deep knowledge of Kubernetes (CKA level)
  • Experience with Docker, Terraform, Helm
  • Proficiency in Python and likely Go
  • Experience with AWS (EKS), GCP (GKE), Azure (AKS)
  • Experience with PostgreSQL, SQLModel, FastAPI
  • Infrastructure as Code (IaC) mastery
  • Ability to write production-quality code
  • Customer empathy and communication skills
  • Problem-solving skills for complex deployments
What we offer:
  • Inspiring international team
  • Genuine connection & lots of fun with team events
  • Annual company offsite
  • Office in the heart of Munich
  • Flexible hours & trust-based work
  • Remote-friendly culture
  • Competitive compensation

Additional Information:

Job Posted:
January 16, 2026

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Senior Platform Engineer - Cloud & Infrastructure

Senior Engineering Manager, Search Infrastructure

Atlassians have flexibility in where they work; The Search Platform team is resp...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years of experience managing high performing, software engineering teams running core services at scale
  • Deep technical experience building and scaling search applications and distributed systems using large amounts of data on cloud platforms, preferably AWS
  • Expert level knowledge and understanding of low-latency distributed data management and query processing systems including Lucene based stacks will be strongly preferred
  • Proven track record of consistent execution delivering outsized results with strong operational rigour
  • Strong organisation and communication skills with the ability to drive clarity in an ambiguous environment
  • Ability to hire, onboard, and retain top talent for your team and foster a culture of innovation, collaboration, and excellence
  • Passion for mentoring and coaching your team members on best practices, code quality, design patterns, testing and operational skills
  • Focus on business outcomes and the 80/20 rule
  • Proactive approach and a desire to innovate in a large, fast-paced organisation
Job Responsibility
Job Responsibility
  • Own a part of the mission for the overall Search Platform team
  • Responsible for building the highest performing teams
  • Develop and work closely with senior engineers to drive technical solutions and architecture
  • Act as a role model for continuously upgrading deep technical skills, engineering judgment and operational rigour
What we offer
What we offer
  • health coverage
  • paid volunteer days
  • wellness resources
  • Fulltime
Read More
Arrow Right

Internship - Cloud Platform Engineering

The Greenlake Platform Cloud Infrastructure Engineer (PIE) team is looking for a...
Location
Location
Ireland , Galway
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently enrolled in or recent graduate of a degree program in Computer Science, Software Engineering, Information Technology, or related field
  • Familiarity with programming languages (Python, Java, or similar), cloud concepts, and fundamental networking is a plus
  • Demonstrates an interest in troubleshooting and problem-solving with guidance from mentors
  • Strong written and verbal communication skills, with an eagerness to ask questions and learn
Job Responsibility
Job Responsibility
  • Support the development team by helping with the design, prototyping, and implementation of cloud-based solutions, under the guidance of experienced developers
  • Apply foundational technical skills to analyze data, troubleshoot issues, and suggest improvements to existing cloud infrastructure
  • Work closely with project managers, senior developers, and other interns to help ensure smooth delivery, deployment, and operation of cloud projects
  • Learn and assist in monitoring cloud environments and tools, understanding key metrics and alerting systems to ensure system reliability and performance
  • Contribute to the documentation of code, processes, and procedures to support knowledge sharing across the team
What we offer
What we offer
  • Initial extensive onboarding to support you with adjusting to the role
  • Ongoing learning and development throughout the program
  • Be mentored by at least one senior member of the team and after two years in the program you can grow into a true professional with valuable relationships and international working experience
  • Competitive salary and great benefits
  • Great work-life balance including hybrid working and Wellness Fridays initiative
  • Fulltime
Read More
Arrow Right

Senior Director of Engineering, Infrastructure

Senior Director of Engineering role leading the Infrastructure group at PagerDut...
Location
Location
United States , San Francisco
Salary
Salary:
233000.00 - 392000.00 USD / Year
https://www.pagerduty.com Logo
PagerDuty
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven experience in senior engineering leadership roles, managing multiple layers of managers
  • Significant experience as a hands-on technical contributor earlier in your career
  • Deep knowledge of modern infrastructure and software delivery: high availability, distributed systems, public cloud (AWS), microservices, containers, CI/CD pipelines, observability, and automation
  • Track record of building and scaling high-performing, inclusive engineering organizations
Job Responsibility
Job Responsibility
  • Define and drive the multi-year strategy for PagerDuty's infrastructure and platform foundations
  • Strong ownership of PagerDuty's reliability patterns and practices
  • Bar raiser for all engineering functions
  • Lead, mentor, and scale a diverse team of Engineering Managers, Senior Managers, and technical leaders across multiple geographies
  • Ensure the reliability, scalability, and security of PagerDuty's global SaaS platform
  • Partner with peers in Engineering, Product, and Security to deliver large cross-functional initiatives
  • Champion engineering excellence: CI/CD maturity, observability best practices, operational rigor, and incident readiness
  • Manage budgets, headcount, and vendor relationships to optimize infrastructure investments
  • Represent Infrastructure externally with customers and partners, and internally with executives, as a trusted voice on technical and business tradeoffs
  • Foster a culture of inclusion, accountability, collaboration, and growth
What we offer
What we offer
  • Competitive salary
  • Comprehensive benefits package
  • Flexible work arrangements
  • Company equity
  • ESPP (Employee Stock Purchase Program)
  • Retirement or pension plan
  • Generous paid vacation time
  • Paid holidays and sick leave
  • Dutonian Wellness Days & HibernationDuty - companywide paid days off in addition to PTO
  • Paid parental leave: 22 weeks for pregnant parent, 12 weeks for non-pregnant parent
  • Fulltime
Read More
Arrow Right

Senior Director of Engineering, Infrastructure

Senior Director of Engineering to lead the Infrastructure group at PagerDuty, se...
Location
Location
United States , Atlanta
Salary
Salary:
233000.00 - 392000.00 USD / Year
https://www.pagerduty.com Logo
PagerDuty
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven experience in senior engineering leadership roles, managing multiple layers of managers
  • Significant experience as a hands-on technical contributor earlier in your career
  • Deep knowledge of modern infrastructure and software delivery: high availability, distributed systems, public cloud (AWS), microservices, containers, CI/CD pipelines, observability, and automation
  • Track record of building and scaling high-performing, inclusive engineering organizations
Job Responsibility
Job Responsibility
  • Define and drive the multi-year strategy for PagerDuty's infrastructure and platform foundations
  • Strong ownership of PagerDuty's reliability patterns and practices
  • Lead, mentor, and scale a diverse team of Engineering Managers, Senior Managers, and technical leaders across multiple geographies
  • Ensure the reliability, scalability, and security of PagerDuty's global SaaS platform
  • Partner with peers in Engineering, Product, and Security to deliver large cross-functional initiatives
  • Champion engineering excellence: CI/CD maturity, observability best practices, operational rigor, and incident readiness
  • Manage budgets, headcount, and vendor relationships to optimize infrastructure investments
  • Represent Infrastructure externally with customers and partners, and internally with executives
  • Foster a culture of inclusion, accountability, collaboration, and growth
What we offer
What we offer
  • Comprehensive benefits package
  • Flexible work arrangements
  • Company equity
  • ESPP (Employee Stock Purchase Program)
  • Retirement or pension plan
  • Generous paid vacation time
  • Paid holidays and sick leave
  • Dutonian Wellness Days & HibernationDuty - companywide paid days off in addition to PTO
  • Paid parental leave: 22 weeks for pregnant parent, 12 weeks for non-pregnant parent
  • Paid volunteer time off: 20 hours per year
  • Fulltime
Read More
Arrow Right

Senior Software Engineer, Cloud Platform

As a Senior Software Engineer, Cloud Platform at Chef Robotics, you'll be respon...
Location
Location
United States , San Francisco
Salary
Salary:
150000.00 - 240000.00 USD / Year
chefrobotics.ai Logo
Chef Robotics
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Engineering, or equivalent practical experience
  • 5+ years of professional experience in cloud infrastructure and DevOps roles
  • Expert knowledge of cloud infrastructure and deployment (AWS, GCP, or Azure)
  • Strong proficiency with containerization (Docker) and orchestration (Kubernetes) technologies
  • Extensive experience with CI/CD practices and infrastructure-as-code principles
  • Experience with system monitoring, logging, and performance optimization
  • Understanding of secure data pipeline design and implementation
  • Understanding of infrastructure requirements for robotics or automation systems
  • Experience with real-time or near-real-time systems and cloud architecture
  • Background in developing reliable systems with high availability requirements
Job Responsibility
Job Responsibility
  • Design and implement cloud infrastructure to support robotics platform deployment and operations
  • Provision robots for seamless deployment across diverse customer environments
  • Enable remote software updates to enhance performance and reliability of deployed systems
  • Implement containerization (Docker) and orchestration (Kubernetes) for scalable deployments
  • Manage cloud infrastructure across AWS, GCP, or Azure platforms
  • Improve the performance and reliability of cloud services supporting the Chef system
  • Implement fault-tolerant design patterns to ensure reliability in production environments
  • Establish performance benchmarks and optimize systems to meet latency requirements for robotics operations
  • Implement comprehensive logging, monitoring, and alerting for cloud infrastructure
  • Create diagnostic tools and dashboards for operational visibility
What we offer
What we offer
  • medical insurance
  • dental insurance
  • vision insurance
  • commuter benefits
  • flexible paid time off (PTO)
  • catered lunch
  • 401(k) matching
  • early-stage equity
  • Fulltime
Read More
Arrow Right

Senior Software Engineer - Cloud Infrastructure

The Cloud Infrastructure Engineering team builds and manages the foundational bl...
Location
Location
Australia
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of relevant software development industry experience building and operating scalable, fault-tolerant, distributed systems
  • Software development experience in Go, C/C++, Java, or another OOP language
  • Experience with cloud technologies such as AWS, Azure, or GCP, including infrastructure-as-code (IaC) tools such as Terraform or CloudFormation
  • Experience developing cloud infrastructure services, preferably with Kubernetes
  • Experience developing cloud native edge or service mesh services, preferably with envoy and Istio
  • Experience leading and shipping large scope technical projects in collaboration with multiple experienced engineers
  • Understanding of network topologies, protocols, and security principles, such as VPNs, firewalls, and load balancers
  • Knowledge of cloud security best practices, including encryption, access controls, and compliance standards like SOC2 and GDPR
  • You have excellent communication skills and the ability to work well within a global team
  • You are a strong problem-solver and have solid production debugging skills
Job Responsibility
Job Responsibility
  • Architect and build a robust, scalable, and highly available distributed infrastructure
  • Build a cutting-edge cloud-native platform on top of the public cloud, and automate our cloud resource management
  • Work closely with our ClickHouse core database development team, and security team and partner with them to produce the SAS offering
  • Work on routing and traffic components to improve the reliability and scalability of our cloud service
  • Systematically improve availability by applying industry and distributed systems best practices
  • Design and build security components & tooling: firewall, PKI and certificate infra, zero trust network, etc.
  • Improve performance and cost efficiency of our infrastructure
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
Read More
Arrow Right

Senior Software Engineer - Cloud Infrastructure

About ClickHouse: Recognized on the 2025 Forbes Cloud 100 list, ClickHouse is on...
Location
Location
Singapore
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of relevant software development industry experience building and operating scalable, fault-tolerant, distributed systems
  • Software development experience in Go, C/C++, Java, or another OOP language
  • Experience with cloud technologies such as AWS, Azure, or GCP, including infrastructure-as-code (IaC) tools such as Terraform or CloudFormation
  • Experience developing cloud infrastructure services, preferably with Kubernetes
  • Experience developing cloud native edge or service mesh services, preferably with envoy and Istio
  • Experience leading and shipping large scope technical projects in collaboration with multiple experienced engineers
  • Understanding of network topologies, protocols, and security principles, such as VPNs, firewalls, and load balancers
  • Knowledge of cloud security best practices, including encryption, access controls, and compliance standards like SOC2 and GDPR
  • You have excellent communication skills and the ability to work well within a global team
  • You are a strong problem-solver and have solid production debugging skills
Job Responsibility
Job Responsibility
  • Architect and build a robust, scalable, and highly available distributed infrastructure
  • Build a cutting-edge cloud-native platform on top of the public cloud, and automate our cloud resource management
  • Work closely with our ClickHouse core database development team, and security team and partner with them to produce the SAS offering
  • Work on routing and traffic components to improve the reliability and scalability of our cloud service
  • Systematically improve availability by applying industry and distributed systems best practices
  • Design and build security components & tooling: firewall, PKI and certificate infra, zero trust network, etc.
  • Improve performance and cost efficiency of our infrastructure
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
Read More
Arrow Right

Senior Software Engineer - Cloud Infrastructure

The Cloud Infrastructure Engineering team builds and manages the foundational bl...
Location
Location
Netherlands , Amsterdam
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of relevant software development industry experience building and operating scalable, fault-tolerant, distributed systems
  • Software development experience in Go, C/C++, Java, or another OOP language
  • Experience with cloud technologies such as AWS, Azure, or GCP, including infrastructure-as-code (IaC) tools such as Terraform or CloudFormation
  • Experience developing cloud infrastructure services, preferably with Kubernetes
  • Experience developing cloud native edge or service mesh services, preferably with envoy and Istio
  • Experience leading and shipping large scope technical projects in collaboration with multiple experienced engineers
  • Understanding of network topologies, protocols, and security principles, such as VPNs, firewalls, and load balancers
  • Knowledge of cloud security best practices, including encryption, access controls, and compliance standards like SOC2 and GDPR
  • You have excellent communication skills and the ability to work well within a global team
  • You are a strong problem-solver and have solid production debugging skills
Job Responsibility
Job Responsibility
  • Architect and build a robust, scalable, and highly available distributed infrastructure
  • Build a cutting-edge cloud-native platform on top of the public cloud, and automate our cloud resource management
  • Work closely with our ClickHouse core database development team, and security team and partner with them to produce the SAS offering
  • Work on routing and traffic components to improve the reliability and scalability of our cloud service
  • Systematically improve availability by applying industry and distributed systems best practices
  • Design and build security components & tooling: firewall, PKI and certificate infra, zero trust network, etc.
  • Improve performance and cost efficiency of our infrastructure
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
Read More
Arrow Right