Senior Machine Learning Site Reliability Engineer Job at Prima (Milan)

Senior Machine Learning Site Reliability Engineer

Prima

Location:
Italy , Milan

Category:
IT - Software Development

Contract Type:
Not provided

Salary:

Not provided

Save Job

Apply Position

Job Description:

Are you looking for a new challenge? Fancy helping us shape the future of motor insurance? Prima could be the place for you. Since 2015, we’ve been using our love of data and tech to rethink motor insurance and bring drivers a great experience at a great price. Our story began in Italy, where we’ve quickly become the number one online motor insurance provider. In fact, we’re trusted by over 4 million drivers. And now we’re expanding to help millions more drivers in the UK and Spain. To help fuel that growth, we need a Senior Machine Learning Site Reliability Engineer to join our Infrastructure team. This team is the beating heart of Prima. You’ll be joining over 300 engineers across software development, infrastructure, operations and security. Fueled by curiosity, experimentation and collaboration, you’ll help deliver scalable, impactful solutions that shape the future of insurance.

Job Responsibility:

Hands-on Reliability & System Engineering: Design, build, and operate reliable and scalable systems by defining and monitoring SLOs/SLIs, working directly on production infrastructure, and collaborating closely with software engineers on system design and reliability improvements
Automation, Operations & Incident Response: Actively develop automation for infrastructure and operational workflows to eliminate toil and reduce MTTR, participate in and lead incident response, and drive blameless post-incident reviews with concrete follow-ups implemented in code and tooling
Performance, Capacity & Security: Continuously analyze and optimize system performance and cost, provide data, insights, and recommendations to inform capacity planning, and support security best practices through hands-on vulnerability remediation and threat mitigation

Requirements:

SRE & Cloud Engineering: Hands-on experience with SRE practices in production, strong AWS expertise, Kubernetes, networking, DNS, and Infrastructure as Code (Pulumi preferred, Terraform a plus)
Automation, Software Engineering and MLOps: Demonstrate strong software engineering fundamentals with an emphasis on code quality and maintainability. This includes solid Python proficiency and deep knowledge of the Python ecosystem (testing, debugging, packaging), hands-on experience with PySpark, and a consistent focus on writing clean, well-structured, and maintainable code. Familiarity with MLOps practices such as model registries, model versioning, retraining workflows, and end-to-end deployment lifecycles is also expected
Reliability, Data & Operations: Add stakeholder engagement and mentoring e.g. lead incident response and RCAs, improve system reliability, and engage stakeholders to propose solutions, share learnings, and mentor others

Nice to have:

Regulated Environments & Security: Experience operating in highly regulated industries (e.g. Insurance, Banking, Healthcare), managing sensitive data, and supporting secure networking setups, including exposure to security technologies such as Cloudflare
Distributed Systems & Microservices: Strong understanding of microservices architectures, their principles and trade-offs, with the ability to troubleshoot and maintain distributed systems and supporting technologies (RabbitMQ, Kafka, PostgreSQL, Redis)
Observability & Platform Operations: Hands-on experience with Datadog for platform and application monitoring, performance optimisation, and solid fundamentals in database structures and operational troubleshooting, with exposure to systems built in languages such as Rust and Elixir

What we offer:

Grow with us: access to learning resources, mentorship and a growth plan tailored to you
Thrive and perform: private healthcare, gym discounts, wellbeing programs and mental health support

Additional Information:

Job Posted:
January 20, 2026

Employment Type:

Fulltime

Work Type:

Remote work

Prima - All Job Offers

Job Link Share:

Senior Machine Learning Site Reliability Engineer

Prima

Location:
Italy , Milan

Category:
IT - Software Development

Contract Type:
Not provided

Salary:

Job Description:

Job Responsibility:

Requirements:

Nice to have:

Additional Information:

Job Posted:
January 20, 2026

Looking for more opportunities? Search for other job offers that match your skills and interests.

Similar Jobs for Senior Machine Learning Site Reliability Engineer

Senior Software Engineer, Backend

Senior Site Reliability Engineer

Senior Machine Learning Engineer

Senior Manager - DevSecOps & Site Reliability Engineering

Senior Software Engineer

Senior Software Engineer

Senior Software Engineer and Principal Software Engineer - Power Point AI Team

Senior Software Engineer

Senior Machine Learning Site Reliability Engineer

Prima

Location:Italy , Milan

Category:IT - Software Development

Contract Type:Not provided

Salary:

Job Description:

Job Responsibility:

Requirements:

Nice to have:

Additional Information:

Job Posted:January 20, 2026

Looking for more opportunities? Search for other job offers that match your skills and interests.

Similar Jobs for Senior Machine Learning Site Reliability Engineer

Senior Software Engineer, Backend

Senior Site Reliability Engineer

Senior Machine Learning Engineer

Senior Manager - DevSecOps & Site Reliability Engineering

Senior Software Engineer

Senior Software Engineer

Senior Software Engineer and Principal Software Engineer - Power Point AI Team

Senior Software Engineer

Location:
Italy , Milan

Category:
IT - Software Development

Contract Type:
Not provided

Job Posted:
January 20, 2026