This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Join our team dedicated to driving operational excellence and reliability through advanced monitoring and observability practices. As a key member of the DevOps and engineering community, your mission is to ensure system transparency, performance, and resilience by embedding observability across the technology stack.
Job Responsibility:
Design, implement, and maintain comprehensive monitoring solutions using Splunk, Grafana Stack (Grafana, Loki, Tempo), Prometheus, and Apica to ensure system reliability and performance transparency
Configure and optimize alerting workflows with PagerDuty for fast detection, efficient triage, and proactive incident management
Build and maintain actionable dashboards and KPIs that deliver insights into service health, user experience, and infrastructure performance
Develop scripts and integrations to automate data ingestion, correlation, and analysis across various monitoring sources
Continuously improve monitoring coverage and observability maturity
collaborate with teams to define SLIs/SLOs and identify opportunities for tuning
Explore new tools and techniques to enhance observability and automate monitoring workflows, leveraging AI-assisted operations where applicable
Partner closely with engineering, DevOps, and security teams to ensure observability is embedded across the full stack and aligned with organizational goals
Requirements:
Experience in monitoring and observability using: Splunk, Grafana, Loki, Tempo, Prometheus, Apica, and PagerDuty
Experience with infrastructure technologies such as: AWS, Kubernetes, and Docker
Proficiency in automation and configuration tools: Terraform, Ansible, Python
Experience with collaboration and communication tools: Slack, Confluence, Jira
Proficiency in programming languages: Python, Java, JavaScript, TypeScript
Strong analytical and problem-solving skills with attention to detail
Demonstrated DevOps mindset and ability to work collaboratively across teams
English fluent. Additional languages are a plus
You demonstrate strong communication skills and a proactive approach to continuous improvement
Nice to have:
Additional languages are a plus
What we offer:
Experience working in a young and international atmosphere, with colleagues on all 5 continents
Access a variety of training courses and continuously improve your skills
Take part in the events organized by our team: work socials, team building events... moments you won't want to miss!