CrawlJobs Logo

Lead MLOps Engineer

https://www.randstad.com Logo

Randstad

Location Icon

Location:
United Kingdom , London

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

80000.00 - 150000.00 GBP / Year

Job Description:

This is a high-impact role within a fast-growing AI and robotics organisation focused on building advanced, scalable intelligent systems for real-world industrial applications. The position owns the machine learning infrastructure and MLOps foundations as products, platforms, and teams scale. You will play a key role in transforming machine learning prototypes into reliable production systems, defining pragmatic engineering standards, and enabling fast, safe delivery of ML-powered capabilities. The role combines hands-on engineering, architectural ownership, and close collaboration with engineering and product teams.

Job Responsibility:

  • Own and scale the organisation's ML infrastructure and MLOps foundations
  • Design pragmatic, production-ready system architectures that balance speed, reliability, and cost
  • Build and maintain CI/CD pipelines for ML workflows and application delivery
  • Productionise ML models including training, evaluation, deployment, monitoring, and rollback strategies
  • Ensure reliability, observability, security, and performance across ML systems
  • Automate infrastructure provisioning, deployments, and environment management using cloud-native tooling
  • Partner closely with ML engineers, software engineers, and product teams to deliver ML features end-to-end
  • Act as a technical leader through design reviews, mentorship, and by establishing engineering best practices

Requirements:

  • Staff or lead-level experience in MLOps, DevOps, or Infrastructure Engineering, ideally within high-growth or startup environments
  • Strong Python skills with hands-on experience using modern ML frameworks (e.g., PyTorch, TensorFlow, or similar)
  • Experience working with major cloud platforms (AWS, GCP, or Azure)
  • Proven production experience with Docker and Kubernetes
  • Strong understanding of CI/CD systems (e.g., GitHub Actions, GitLab CI, ArgoCD)
  • Experience with Infrastructure as Code tools such as Terraform and Helm
  • Solid understanding of data engineering fundamentals and ML lifecycle management
  • Ability to design scalable systems without unnecessary complexity
  • Strong debugging and problem-solving skills in distributed systems
  • Ownership mindset with excellent communication and cross-functional collaboration skills
What we offer:
  • Competitive salary and equity participation
  • Paid vacation in line with local labour regulations
  • Opportunities for international collaboration and travel
  • Office benefits including meals, snacks, and team events

Additional Information:

Job Posted:
February 22, 2026

Expiration:
February 28, 2026

Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Lead MLOps Engineer

Senior Data & AI/ML Engineer - GCP Specialization Lead

We are on a bold mission to create the best software services offering in the wo...
Location
Location
United States , Menlo Park
Salary
Salary:
Not provided
techjays.com Logo
techjays
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • GCP Services: BigQuery, Dataflow, Pub/Sub, Vertex AI
  • ML Engineering: End-to-end ML pipelines using Vertex AI / Kubeflow
  • Programming: Python & SQL
  • MLOps: CI/CD for ML, Model deployment & monitoring
  • Infrastructure-as-Code: Terraform
  • Data Engineering: ETL/ELT, real-time & batch pipelines
  • AI/ML Tools: TensorFlow, scikit-learn, XGBoost
  • Min Experience: 10+ Years
Job Responsibility
Job Responsibility
  • Design and implement data architectures for real-time and batch pipelines, leveraging GCP services such as BigQuery, Dataflow, Dataproc, Pub/Sub, Vertex AI, and Cloud Storage
  • Lead the development of ML pipelines, from feature engineering to model training and deployment using Vertex AI, AI Platform, and Kubeflow Pipelines
  • Collaborate with data scientists to operationalize ML models and support MLOps practices using Cloud Functions, CI/CD, and Model Registry
  • Define and implement data governance, lineage, monitoring, and quality frameworks
  • Build and document GCP-native solutions and architectures that can be used for case studies and specialization submissions
  • Lead client-facing PoCs or MVPs to showcase AI/ML capabilities using GCP
  • Contribute to building repeatable solution accelerators in Data & AI/ML
  • Work with the leadership team to align with Google Cloud Partner Program metrics
  • Mentor engineers and data scientists toward achieving GCP certifications, especially in Data Engineering and Machine Learning
  • Organize and lead internal GCP AI/ML enablement sessions
What we offer
What we offer
  • Best in class packages
  • Paid holidays and flexible paid time away
  • Casual dress code & flexible working environment
  • Medical Insurance covering self & family up to 4 lakhs per person
Read More
Arrow Right

Lead Golang Software Engineer

We are Citi’s Application, Platform and Engineering team, a start-up with the ex...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Deep hands on knowledge of Kubernetes
  • Fluency in Golang
  • Experience designing control and sandboxing systems for AI experimentation
  • Experience maintaining and/or contributing to bug bounty and responsible disclosure programs
  • Understanding of language models and transformers
  • Rich understanding of vector stores and search algorithms
  • Large-scale ETL development
  • Direct engineering experience of high performance, large-scale ML systems
  • Hands on MLOps experience
  • Have experience supporting fast-paced startup engineering teams
Job Responsibility
Job Responsibility
  • Lead the 0-1 build of multiple AI products
  • Design and build high-quality, highly reliable products with user experience at the centre
  • Be responsible for engineering innovative, best in class AI platforms for the bank
  • Creating firsts in the Generative AI space for Citi
  • Continually iterate and scale Generative AI products
  • Mentor and nurture other engineers
What we offer
What we offer
  • 27 days annual leave (plus bank holidays)
  • Discretional annual performance related bonus
  • Private Medical Care & Life Insurance
  • Employee Assistance Program
  • Pension Plan
  • Paid Parental Leave
  • Special discounts for employees, family, and friends
  • Fulltime
Read More
Arrow Right

AIML Lead Engineer

We build breakthrough software products that power digital businesses. We are an...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
3pillarglobal.com Logo
3Pillar Global
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of total IT experience
  • at least 4+ years in AI/ML development
  • Strong proficiency in Python and ML frameworks (TensorFlow, PyTorch, Scikit-Learn)
  • Experience with NLP libraries such as spaCy, Hugging Face Transformers, etc.
  • Solid understanding of AI/ML algorithms, data preprocessing, and model evaluation techniques
  • Hands-on experience with Generative AI, LLMs, and Agentic AI
  • Working knowledge of MLOps tools and CI/CD pipelines for AI model deployment
  • Familiarity with computer vision frameworks (OpenCV, etc.)
  • Excellent problem-solving and communication skills
  • Ability to lead and mentor junior engineers
Job Responsibility
Job Responsibility
  • Lead the design and implementation of AI/ML models and solutions for complex business problems
  • Work on NLP, LLMs, and Generative AI to build intelligent systems and conversational agents
  • Develop and optimize models using Python, TensorFlow, PyTorch, and Scikit-Learn
  • Apply deep learning and transformer-based architectures (e.g., BERT, GPT, etc.) for NLP and vision tasks
  • Implement computer vision solutions using OpenCV and related tools
  • Collaborate with cross-functional teams to integrate AI models into production systems
  • Apply MLOps best practices and manage CI/CD pipelines for model deployment
  • Stay updated with the latest AI research, LLM, and Agentic AI trends, and drive innovation across teams
  • Fulltime
Read More
Arrow Right

Lead Software Engineer

Prism Data is building the future of credit scoring with modern technology and d...
Location
Location
United States , NYC or San Diego (La Jolla/UTC)
Salary
Salary:
160000.00 - 195000.00 USD / Year
prismdata.com Logo
Prism Data
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Software engineering experience, ideally in a high-growth or early stage startup
  • Strong expertise in modern software practices and technologies, with ability to quickly adapt to Prism’s services stack
  • Deep hands-on experience with Python, Kubernetes, and AWS, with specific experience supporting machine-learning models from prototyping through production operations
  • Proactive bias-to-solve technical problems and navigate complex design decisions
  • Excellent communication skills with the ability to bridge gaps between technical teams and non-technical stakeholders
Job Responsibility
Job Responsibility
  • Contribute to Prism’s engineering culture by regularly mentoring other engineers and facilitating knowledge sharing, learning, and continuous improvement
  • Advance an architectural runway to support product extensibility and scalability while balancing sustainability
  • Architect and lead the enhancement of production ML-serving infrastructure, including continuous advancement of MLOps capabilities
  • Design, build, and operate enterprise-grade APIs and other platform services, and lead strategic co-development opportunities with key partners and data providers
  • Drive technical direction across platform and model-serving layers, ensuring observability, security, and performance at scale
  • Partner cross-functionally with product, legal, and go-to-market teams to cohesively develop new capabilities that expand cash flow analytics use cases
What we offer
What we offer
  • medical
  • dental
  • vision
  • 401(k)
  • equity-based compensation
  • Fulltime
Read More
Arrow Right

Lead AI ML Engineer

We build breakthrough software products that power digital businesses. We are an...
Location
Location
India , Noida
Salary
Salary:
Not provided
3pillarglobal.com Logo
3Pillar Global
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of total IT experience
  • At least 4+ years in AI/ML development
  • Strong proficiency in Python and ML frameworks (TensorFlow, PyTorch, Scikit-Learn)
  • Experience with NLP libraries such as spaCy, Hugging Face Transformers
  • Solid understanding of AI/ML algorithms, data preprocessing, and model evaluation techniques
  • Hands-on experience with Generative AI, LLMs, and Agentic AI
  • Working knowledge of MLOps tools and CI/CD pipelines for AI model deployment
  • Familiarity with computer vision frameworks (OpenCV)
  • Excellent problem-solving and communication skills
  • Ability to lead and mentor junior engineers
Job Responsibility
Job Responsibility
  • Lead the design and implementation of AI/ML models and solutions for complex business problems
  • Work on NLP, LLMs, and Generative AI to build intelligent systems and conversational agents
  • Develop and optimize models using Python, TensorFlow, PyTorch, and Scikit-Learn
  • Apply deep learning and transformer-based architectures (e.g., BERT, GPT) for NLP and vision tasks
  • Implement computer vision solutions using OpenCV and related tools
  • Collaborate with cross-functional teams to integrate AI models into production systems
  • Apply MLOps best practices and manage CI/CD pipelines for model deployment
  • Stay updated with the latest AI research, LLM, and Agentic AI trends, and drive innovation across teams
  • Fulltime
Read More
Arrow Right

Senior/Architect Data Engineer

We are seeking a highly skilled and experienced Senior/Architect Data Engineer t...
Location
Location
Poland , Warsaw; Poznań; Lublin; Katowice; Rzeszów
Salary
Salary:
Not provided
https://www.inetum.com Logo
Inetum
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven experience architecting solutions on the Databricks Lakehouse using Unity Catalog, Delta Lake, MLflow, Model Serving, Feature Store, AutoML, and Databricks Workflows
  • Expertise in real-time/low latency model serving architectures with auto-scaling, confidence-based routing, and A/B testing
  • Strong knowledge of cloud security and governance on Azure or AWS, including Azure AD/AWS IAM, encryption, audit trails, and compliance frameworks
  • Hands-on MLOps skills across experiment tracking, model registry/versioning, drift monitoring, automated retraining, and production rollout strategies
  • Proficiency in Python and Databricks native tooling, with practical integration of REST APIs/SDKs and Databricks SQL in analytics products
  • Familiarity with React dashboards and human-in-the-loop operational workflows for ML and data quality validation
  • Demonstrated ability to optimize performance, reliability, and cost for large-scale analytics/ML platforms with strong observability
  • Experience leading multi-phase implementations with clear success metrics, risk management, documentation, and training/change management
  • Domain knowledge in telemetry, time series, or industrial data (aerospace a plus) and prior work with agentic patterns on Mosaic AI
  • Databricks certifications and experience in enterprise deployments of the platform are preferred
Job Responsibility
Job Responsibility
  • Lead the design and implementation of a Databricks-centric multi-agent processing engine
  • Design governed data ingestion, storage, and real-time processing workflows using Delta Lake, Structured Streaming, and Databricks Workflows
  • Own the model lifecycle with MLflow, including experiment tracking, registry/versioning, A/B testing, drift monitoring, and automated retraining pipelines
  • Architect low latency model serving endpoints with auto-scaling and confidence-based routing for sub-second agent decisioning
  • Establish robust data governance practices with Unity Catalog, including access control, audit trails, data quality, and compliance
  • Drive performance and cost optimization strategies, including auto-scaling, spot usage, and observability dashboards
  • Define production release strategies (blue-green), monitoring and alerting mechanisms, operational runbooks, and Service Level Objectives (SLOs)
  • Partner with engineering, MLOps, and product teams to deliver human-in-the-loop workflows and dashboards
  • Lead change management, training, and knowledge transfer while managing a parallel shadow processing path
  • Plan and coordinate phased delivery, success metrics, and risk mitigation
What we offer
What we offer
  • Flexible working hours
  • Hybrid work model
  • Cafeteria system
  • Generous referral bonuses (up to PLN6,000)
  • Additional revenue sharing opportunities
  • Ongoing guidance from dedicated Team Manager
  • Tailored technical mentoring from assigned technical leader
  • Dedicated team-building budget for online and on-site team events
  • Opportunities to participate in charitable initiatives and local sports programs
  • Supportive and inclusive work culture
  • Fulltime
Read More
Arrow Right

Principal Machine Learning System Engineer

As a Principal Machine Learning Systems Engineer, you will lead the design, deve...
Location
Location
United States , Seattle; San Francisco
Salary
Salary:
190300.00 - 305600.00 USD / Year
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Lead the design, development, and deployment of scalable machine learning (ML) systems and infrastructure
  • Collaborate closely with data scientists, software engineers, and product teams
  • Optimize model performance
  • Ensure system reliability
  • Implement efficient data pipelines
  • Drive architectural decisions for high-performance computing and cloud-based ML platforms
  • Mentor junior engineers
  • Promote best practices in ML operations (MLOps)
  • Stay updated on emerging technologies
Job Responsibility
Job Responsibility
  • Translate complex ML models into production-ready solutions
  • Ensure scalability and security
  • Deliver robust, scalable, and efficient machine learning solutions that support business growth and innovation
What we offer
What we offer
  • Health coverage
  • Paid volunteer days
  • Wellness resources
  • Fulltime
Read More
Arrow Right

Generative AI Tech Lead

Provectus is an AI-first consultancy that helps global enterprises adopt Machine...
Location
Location
Salary
Salary:
Not provided
provectus.com Logo
Provectus
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of hands-on experience in Machine Learning, Deep Learning, or NLP
  • 2+ years in a technical leadership or team lead role
  • Strong expertise with LLMs (Hugging Face, OpenAI, Anthropic) and modern NLP stacks
  • Strong hands-on experience with AWS ML ecosystem (SageMaker, Bedrock, Lambda, S3, ECS/ECR)
  • Excellent Python engineering skills and proficiency with PyTorch or TensorFlow
  • Experience building ML systems in production, not just research
  • Solid knowledge of MLOps/LLMOps tools, pipelines, and deployment best practices
  • Strong architectural thinking and ability to design scalable ML systems
  • Excellent communication skills and ability to lead cross-functional teams
  • Passion for mentoring engineers and raising the technical bar
Job Responsibility
Job Responsibility
  • Lead, mentor, and grow a team of 5–10 ML, Data, and Software Engineers
  • Define and drive the technical roadmap for ML/AI initiatives
  • Foster a high-performance culture focused on ownership, learning, and engineering excellence
  • Work closely with Product, Data, and Platform teams to deliver end-to-end AI systems
  • Design, fine-tune, and deploy LLMs and ML models for real production use cases
  • Build systems for RAG, summarization, text generation, entity extraction, and other NLP/LLM workflows
  • Explore and implement emerging GenAI/LLM techniques and infrastructure
  • Contribute across the ML stack: NLP, deep learning, CV, RL, and classical ML
  • Architect and operate scalable ML/AI systems using AWS (SageMaker, Bedrock, Lambda, S3, ECS/ECR…)
  • Optimize model training, inference pipelines, and data workflows for scale, cost, and latency
What we offer
What we offer
  • Sing-up bonus
  • 10% Annual bonus
  • Comprehensive private medical insurance or budget for your medical needs
  • Paid sick leave, vacation, and public holidays
  • Continuous learning support, including unlimited AWS certification sponsorship
Read More
Arrow Right