This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We’re looking for a Senior Engineer to join our Core Platform Service team, someone who thrives on building reliable, scalable, and automated infrastructure. You’ll play a key role in rolling out, operating, and standardizing core platform capabilities, including EKS clusters, HashiCorp Vault for secret management, and Argo for continuous delivery and workflows. This is a hands-on engineering role where you’ll design and implement platform services that empower application teams to deploy, run, and secure workloads reliably at scale.
Job Responsibility:
Design, deploy, and manage production workloads on AWS (Mainly Compute - EKS, EC2 , Lambdas)
Lead and operate EKS clusters across multiple environments, ensuring scalability, performance, and reliability
Implement and maintain automation, monitoring, and alerting using tools like Terraform, Grafana, Prometheus, and Datadog
Manage Linux-based infrastructure, including performance tuning, debugging, and kernel-level analysis
Roll out and standardize ArgoCD and Argo Workflows as part of our GitOps and automation strategy
Collaborate with development teams to design and operate microservices and event-driven architectures at scale
Troubleshoot incidents, drive root-cause analysis, and contribute to postmortems
Design, deploy, and manage HashiCorp Vault, implementing secret management, access policies, and integrations with workloads on EKS
Requirements:
Solid understanding of Linux systems administration, networking (DNS, TLS/SSL, HTTP), and container fundamentals
Experience in designing multi-cluster EKS architectures or hybrid Kubernetes setups
Familiarity with RBAC design, OIDC authentication, and Vault secret injection into workloads
Experience designing distributed, event-driven systems and microservice architectures
Familiarity with SRE practices, monitoring, automation, release engineering, and incident response
Awareness of cloud security best practices and common threat mitigations
Proficiency with Terraform and Helm for infrastructure and application automation
Scripting or programming experience (Go preferred, Python or Bash also acceptable)