This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We are seeking a talented Platform Software Engineer to join the team building the Cerebras Inference Platform. You will be instrumental in designing, developing, and operating the core backend services and APIs that power the Inference platform. You'll build the software that allows customers to seamlessly deploy, manage, and serve inference workloads on dedicated Cerebras hardware.
Job Responsibility:
Set Technical Direction for the observability platform
Build telemetry pipelines that handle high-cardinality, high-frequency data
Drive Reliability Across the Organization by defining SLOs and building alerting strategies
Bridge Hardware and Software Observability
Shape Developer Experience by designing instrumentation libraries and standards
Mentor and Grow Engineers
Requirements:
8+ years of software engineering experience
4+ years building or operating observability/monitoring platforms at significant scale (millions of active time series, petabytes of log data)
Deep expertise in the open-source observability ecosystem (Prometheus, Thanos/Cortex/Mimir, Elasticsearch/ClickHouse, or Loki)
Experience with OpenTelemetry for instrumentation across a polyglot services environment
Proficiency in Go preferred, with strong experience in Python
Strong distributed systems and Kubernetes expertise
Experience with observability cost management and capacity planning at scale
Track record of setting technical direction and driving adoption across multiple teams
What we offer:
Build a breakthrough AI platform beyond the constraints of the GPU
Publish and open source cutting-edge AI research
Work on one of the fastest AI supercomputers in the world
Enjoy job stability with startup vitality
Simple, non-corporate work culture that respects individual beliefs