CrawlJobs Logo

Senior ML Operations (MLOps) Engineer

eightsleep.com Logo

Eight Sleep

Location Icon

Location:

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

Join our team as a Sr MLOps Engineer to help us bring current and next generations of Pod ML models to life. You'll be a part of a small team designing and implementing solutions with high levels of autonomy to bring our members better sleep. Your work will go directly to our fleet of existing Pods with low friction and direct impact to the business. We are a fast moving and fast growing company, and we embrace individuals with a growth mindset and strong desire to help us achieve our mission: Improving people's lives through optimal sleep.

Job Responsibility:

  • Pioneer Cutting-Edge Technology: Introduce and implement cutting-edge ML technologies, integrating them into our products and processes to enable the future of health monitoring
  • End-to-End Ownership: Own design and operation of robust ML infrastructure – building scalable data, model, and deployment pipelines that ensure reliable delivery of models to production
  • Cross-functional Collaboration Partner with R&D, firmware, data, and backend teams to ensure ML inference operates reliably and scales to Pods everywhere
  • Optimize for Performance: Drive cost-effective, scalable, and high-performance ML systems by optimizing compute, storage, and deployment resources across training and inference
  • Enhance Tooling and Platforms: Develop tooling, micro services, and frameworks to streamline data processing, experimentation, and deployment
  • Effective Remote Communication: Thrive in a remote work environment, ensuring clear and direct communication

Requirements:

  • 5+ years of software engineering experience with a focus on ML infrastructure, distributed systems, or large-scale data processing in Python (e.g., PyTorch, TensorFlow, or similar)
  • Hands-on experience with ML workflow orchestration and CI/CD pipelines for model deployment
  • Demonstrated success shipping ML models to production at scale, handling telemetry, monitoring, and feedback loops across large device fleets or user populations
  • Strong experience with AWS (Lambda, ECS, DynamoDB, CloudWatch) or equivalent cloud platforms for serving and monitoring ML systems
  • A fast-paced, collaborative, and iterative approach to tackling complex problems

Nice to have:

  • Expertise in real-time ML workflows and streaming systems (e.g., Kinesis, Kafka, Flink)
  • Demonstrated expertise in optimizing ML infrastructure for efficiency, latency, and cloud cost at scale
  • Understanding of secure ML operations, privacy practices, and compliance considerations, particularly for health-related or IoT data
  • Familiarity with health, wellness, or IoT domains, especially wearables or medical-grade devices
What we offer:
  • Equity participation
  • Periodic equity refreshments based on performance
  • Your own Pod
  • Full access to health, vision, and dental insurance for you and your dependents
  • Supplemental life insurance
  • Flexible PTO
  • Commuter benefits to ease your daily commute
  • Paid parental leave

Additional Information:

Job Posted:
February 21, 2026

Employment Type:
Fulltime
Work Type:
Remote work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Senior ML Operations (MLOps) Engineer

Senior Operations Engineer

We are currently seeking an Senior Operations Engineer to join our Data Manageme...
Location
Location
Greece , Athens
Salary
Salary:
Not provided
https://www.metlengroup.com Logo
Metlen Group
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • BSc or MSc in Computer Science or related technical field
  • +4 years of experience in Operations or IT roles (Data Engineering, ML Engineering, Software Engineering, or similar)
  • Experience in system monitoring, technical support, and incident handling
  • Hands-on experience with Cloud platforms
  • Practical exposure to MLOps and DevOps frameworks (Azure DevOps/MLOps, Docker, Kubernetes, AWS DevOps/MLOps)
  • Solid hands-on experience in SQL and Python
  • Experience managing enterprise-scale Data/ML workflows
  • Strong analytical and problem-solving abilities
  • Fluent in English, written and oral
Job Responsibility
Job Responsibility
  • Oversee and optimize DevOps and MLOps operations for model deployment, monitoring, and automation
  • Execute, maintain, and improve CI/CD pipelines for Data Engineering and ML deployments
  • Collaborate closely with Data Engineers to strengthen deployment processes and operational efficiency
  • Monitor, troubleshoot, and ensure smooth execution of daily Corporate Data Warehouse workflows
  • Handle technical support requests efficiently, ensuring SLA compliance
  • Maintain high system availability and reliability through proactive monitoring
  • Implement minor enhancements, bug fixes, and performance optimizations
  • Apply version control best practices and ensure proper deployment governance
  • Collaborate cross-functionally to streamline deployment processes across environments
  • Identify opportunities for automation, observability, and improved monitoring
What we offer
What we offer
  • Competitive remuneration package
  • Ticket Restaurant Card
  • Group Health Insurance Plan
  • Preferential Protergia household energy plan
  • Pension Plan
Read More
Arrow Right

Senior Support and Operations Engineer

Senior Support and Operations Engineer to join Data Management team, taking owne...
Location
Location
Greece , Athens
Salary
Salary:
Not provided
https://www.metlengroup.com Logo
Metlen Group
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • BSc or MSc in Computer Science or related technical field
  • +4 years of experience in Operations or IT roles (Data Engineering, ML Engineering, Software Engineering, or similar)
  • Experience in system monitoring, technical support, and incident handling
  • Hands-on experience with Cloud platforms
  • Practical exposure to MLOps and DevOps frameworks (Azure DevOps/MLOps, Docker, Kubernetes, AWS DevOps/MLOps)
  • Experience managing CI/CD pipelines, especially in Azure is considered a plus
  • Experience with GitHub is an advantage
  • Solid hands-on experience in SQL and Python
  • Experience managing enterprise-scale Data/ML workflows
  • Strong analytical and problem-solving abilities
Job Responsibility
Job Responsibility
  • Oversee and optimize DevOps and MLOps operations for model deployment, monitoring, and automation
  • Execute, maintain, and improve CI/CD pipelines for Data Engineering and ML deployments
  • Collaborate closely with Data Engineers to strengthen deployment processes and operational efficiency
  • Monitor, troubleshoot, and ensure smooth execution of daily Corporate Data Warehouse workflows
  • Handle technical support requests efficiently, ensuring SLA compliance
  • Maintain high system availability and reliability through proactive monitoring
  • Implement minor enhancements, bug fixes, and performance optimizations
  • Apply version control best practices and ensure proper deployment governance
  • Collaborate cross-functionally to streamline deployment processes across environments
  • Identify opportunities for automation, observability, and improved monitoring
What we offer
What we offer
  • Competitive remuneration package
  • Ticket Restaurant Card
  • Group Health Insurance Plan
  • Preferential Protergia household energy plan
  • Pension Plan
  • Fulltime
Read More
Arrow Right

Senior ML Platform Engineer

At WHOOP, we're on a mission to unlock human performance and healthspan. WHOOP e...
Location
Location
United States , Boston
Salary
Salary:
150000.00 - 210000.00 USD / Year
whoop.com Logo
Whoop
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s Degree in Computer Science, Engineering, or a related field
  • or equivalent practical experience
  • 5+ years of experience in software engineering with a focus on ML infrastructure, cloud platforms, or MLOps
  • Strong programming skills in Python, with experience in building distributed systems and REST/gRPC APIs
  • Deep knowledge of cloud-native services and infrastructure-as-code (e.g., AWS CDK, Terraform, CloudFormation)
  • Hands-on experience with model deployment platforms such as AWS SageMaker, Vertex AI, or Kubernetes-based serving stacks
  • Proficiency in ML lifecycle tools (MLflow, Weights & Biases, BentoML) and containerization strategies (Docker, Kubernetes)
  • Understanding of data engineering and ingestion pipelines, with ability to interface with data lakes, feature stores, and streaming systems
  • Proven ability to work cross-functionally with Data Science, Data Platform, and Software Engineering teams, influencing decisions and driving alignment
  • Passion for AI and automation to solve real-world problems and improve operational workflows
Job Responsibility
Job Responsibility
  • Architect, build, own, and operate scalable ML infrastructure in cloud environments (e.g., AWS), optimizing for speed, observability, cost, and reproducibility
  • Create, support, and maintain core MLOps infrastructure (e.g., MLflow, feature store, experiment tracking, model registry), ensuring reliability, scalability, and long-term sustainability
  • Develop, evolve, and operate MLOps platforms and frameworks that standardize model deployment, versioning, drift detection, and lifecycle management at scale
  • Implement and continuously maintain end-to-end CI/CD pipelines for ML models using orchestration tools (e.g., Prefect, Airflow, Argo Workflows), ensuring robust testing, reproducibility, and traceability
  • Partner closely with Data Science, Sensor Intelligence, and Data Platform teams to operationalize and support model development, deployment, and monitoring workflows
  • Build, manage, and maintain both real-time and batch inference infrastructure, supporting diverse use cases from physiological analytics to personalized feedback loops for WHOOP members
  • Design, implement, and own automated observability tooling (e.g., for model latency, data drift, accuracy degradation), integrating metrics, logging, and alerting with existing platforms
  • Leverage AI-powered tools and automation to reduce operational overhead, enhance developer productivity, and accelerate model release cycles
  • Contribute to and maintain internal platform documentation, SDKs, and training materials, enabling self-service capabilities for model deployment and experimentation
  • Continuously evaluate and integrate emerging technologies and deployment strategies, influencing WHOOP’s roadmap for AI-driven platform efficiency, reliability, and scale
What we offer
What we offer
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

Senior AI/ML Engineer

Barbaricum is seeking a highly experienced Senior AI/ML Engineer to support Soft...
Location
Location
United States , Crane
Salary
Salary:
Not provided
barbaricum.com Logo
Barbaricum
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Active DoD Secret Clearance (Top Secret preferred)
  • Bachelor’s degree in Computer Science, Engineering, or related technical discipline (Master’s preferred)
  • 10+ years of progressive experience in AI/ML engineering, software development, or applied data science
  • Expertise in developing, deploying, and securing AI/ML applications within mission-critical or defense environments
  • Demonstrated experience with LLMs, MLOps pipelines, and modern ML frameworks (e.g., PyTorch, TensorFlow)
  • Strong background in software and cyber engineering principles, including system hardening, secure coding, and vulnerability mitigation
  • Proven ability to lead complex technical efforts, mentor junior engineers, and interface with government stakeholders
  • DoD 8570 Advanced certification (e.g., SecurityX, GCSA, CCSP, or equivalent) must be obtained and maintained
Job Responsibility
Job Responsibility
  • Partner with project managers and engineering teams to define objectives for AI/ML systems in support of maneuver, surveillance, and engagement missions
  • Develop and prototype AI/ML systems to address mission-specific requirements, including computer vision, sensor fusion, and decision-support applications
  • Conduct rigorous testing and evaluation of AI/ML performance against operational datasets
  • Analyze test data to identify model strengths, weaknesses, and mission relevance
  • Refine and optimize systems to ensure robustness, scalability, and cyber resilience
  • Troubleshoot complex system challenges and provide technical guidance for deployed solutions
  • Deliver comprehensive documentation and technical reports to stakeholders
  • Maintain awareness of emerging AI/ML technologies, software engineering practices, and cyber defense techniques relevant to mission-critical systems
Read More
Arrow Right

Senior Engineering Manager, Computer Vision

Hover helps people design, improve, and protect the properties they love. With p...
Location
Location
United States , San Francisco/New York
Salary
Salary:
247000.00 - 305000.00 USD / Year
hover.to Logo
HOVER
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 2+ years of managing high impact CV/ML teams (or tech lead / staff+ leadership) with a track record of building high-performing teams
  • 5+ years of hands-on experience in computer vision or ML (ideally 3D reconstruction, multi-view geometry, or ML-based reconstruction)
  • Proven track record partnering with product teams to scope features, run experiments, and iterate based on customer feedback and data
  • Familiarity with modern MLOps stacks (cloud GPUs, CI/CD, monitoring) and a passion for measurable reliability and cost control
  • Ability to articulate complex trade-offs to executives, engineers, and customers alike
  • Bachelor’s, Master’s, or PhD in CS, ML, or related field
Job Responsibility
Job Responsibility
  • Leading the Team: Build and nurture a high-performing, diverse team of senior ICs and emerging leaders. From hiring and onboarding to coaching and career-pathing, you’ll make talent development your first priority
  • Owning a Scaling Product Line: Take end-to-end ownership of a critical computer vision product area, ensuring our research breakthroughs translate into production systems that delight customers at scale
  • Shaping the Roadmap: Partner with Product and Design to translate market opportunities and research advances into a sequenced plan. You’ll balance innovation with operational excellence, driving projects from data strategy and experimentation through to reliable production deployment
  • Driving Technical Excellence: Set engineering standards for accuracy, latency, cost control, and reliability. Model strong cross-functional collaboration and ensure your team’s work integrates smoothly into Hover’s larger platform
  • Communicating Impact: Clearly articulate progress, trade-offs, and technical choices to executives, stakeholders, and the broader team
  • earning trust at every level
What we offer
What we offer
  • Compensation - Competitive salary and meaningful equity in a fast-growing company
  • Healthcare - Comprehensive medical, dental, and vision coverage for you and dependents
  • Paid Time Off - Unlimited and flexible vacation policy
  • Paid Family Leave - We support work/life balance and offer generous paid parental and new child bonding leave
  • Mandatory Self-Care Days - A day set aside each month to allow employees to recharge
  • Remote Wellbeing Resources - We provide recurring fitness classes, meditation/ mindfulness tools, virtual therapy, and family planning assistance
  • Learning - We encourage continued education and will help cover the cost of management training, conferences, workshops, or certifications
  • Fulltime
Read More
Arrow Right

Senior ML Ops Engineer

Join Elsevier as a Senior ML Ops Engineer to lead the development of impactful A...
Location
Location
United States , Philadelphia
Salary
Salary:
95300.00 - 158800.00 USD / Year
edtechjobs.io Logo
EdTech Jobs
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Current experience in ML Engineering, MLOps platforms, shipping ML or search/GenAI systems to production
  • Strong Python, Java, and/or Scala experience
  • Hands-on experience with major cloud vendor solutions (AWS, Azure and/or Google)
  • Experience with Search/vector/graph technologies (e.g., Elasticsearch / OpenSearch / Solr / Neo4j)
  • Experience in evaluating LLM models
  • A strong understanding of the Data Science Life Cycle including feature engineering, model training, and evaluation metrics
  • Familiarity with ML frameworks, e.g., PyTorch, TensorFlow, PySpark
  • Experience with large-scale data processing systems, e.g., Spark
  • Experience with statistical analysis, machine learning theory and natural language processing
Job Responsibility
Job Responsibility
  • Automate and orchestrate machine learning workflows across major cloud and AI platforms (AWS, Azure, Databricks, and foundation model APIs such as OpenAI)
  • Maintain and version model registries and artifact stores to ensure reproducibility and governance
  • Develop and manage CI/CD for ML, including automated data validation, model testing, and deployment
  • Implement ML Engineering solutions using popular MLOps platforms such as AWS SageMaker, MLflow, Azure ML
  • Scale end-end custom Sagemaker pipelines
  • Design and implement the engineering components of GAR+RAG systems (e.g., query interpretation and reflection, chunking, embeddings, hybrid retrieval, semantic search), manage prompt libraries, guardrails and structured output for LLMs hosted on Bedrock/SageMaker or self-hosted
  • Design and implement ML pipelines that utilize Elasticsearch/OpenSearch/Solr, vector DBs, and graph DBs
  • Build evaluation pipelines: offline IR metrics (NDCG, MAP, MRR), LLM quality metrics (faithfulness, grounding), and A/B testing
  • Optimize infrastructure costs through monitoring, scaling strategies, and efficient resource utilization
  • Stay current with the latest GAI research, NLP and RAG and apply the state-of-the-art in our experiments and systems
What we offer
What we offer
  • Annual incentive bonus
  • Country specific benefits
  • Fair and accessible hiring process with accommodation support
  • Fulltime
Read More
Arrow Right

AI Product Manager

We’re scaling AI and machine learning across our products, devices, and operatio...
Location
Location
United States , Boston
Salary
Salary:
121300.00 - 177900.00 USD / Year
simplisafe.com Logo
SimpliSafe
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of product management experience, including significant ownership of AI/ML or data-intensive products
  • Clear track record of shipping production ML systems (not just integrating third-party AI APIs), in close partnership with data science, ML engineering, and MLOps
  • Principal-level impact: leading cross-team initiatives, shaping strategy, and influencing senior stakeholders
  • Strong understanding of core ML concepts and lifecycle: data, labeling, training/validation, evaluation metrics, deployment, monitoring, and retraining
  • ML experience with at least one of following: computer vision or sensor data, LLM-powered applications (prompting, RAG, fine-tuning, evaluation) and/or hardware or edge products (e.g., on-device models, connectivity/latency trade-offs)
  • Familiarity with modern ML infrastructure (cloud platforms, model serving, CI/CD for ML, monitoring/alerting)
  • Comfortable going deep into data, metrics, and model behavior—not just the UX layer
  • Excellent communicator who can make complex AI topics clear to diverse audiences
  • Strong alignment with our values: customer-obsessed, low ego, highly collaborative, comfortable with ambiguity, and biased toward learning and iteration.
Job Responsibility
Job Responsibility
  • Define and communicate the multi-year roadmap for key AI/ML capabilities across SimpliSafe
  • Identify and prioritize AI opportunities where models and data can materially improve safety, customer experience, or efficiency—on both devices and cloud services
  • Make build-vs-buy decisions for AI capabilities in partnership with data science and engineering
  • Partner with data scientists, ML engineers, and MLOps to design and deliver end-to-end ML solutions—from problem framing through data, training, evaluation, deployment, and monitoring
  • Work with hardware and embedded teams to shape edge AI/ML experiences (e.g., on-device detection, low-latency decisions, bandwidth-aware designs)
  • Define model-level requirements (metrics, latency, cost, guardrails) and connect them to business outcomes (e.g., false alarm reduction, detection accuracy, handle time, CSAT)
  • Translate product needs into requirements for ML platform capabilities (model serving, observability, experiment tracking, human-in-the-loop tools)
  • Lead product direction for LLM and multimodal use cases (e.g., text, vision, sensor data)
  • Decide when to use prompt engineering, RAG, fine-tuning, or traditional ML—and how to evaluate quality, safety, and hallucinations
  • Design workflows that incorporate human review and escalation where needed
What we offer
What we offer
  • A mission- and values-driven culture and a safe, inclusive environment where you can build, grow, and thrive
  • A comprehensive total rewards package that supports your wellness and provides security for SimpliSafers and their families
  • Free SimpliSafe system and professional monitoring for your home
  • Employee Resource Groups (ERGs) that bring people together, give opportunities to network, mentor and develop, and advocate for change
  • Participation in our annual bonus program, equity, and other forms of compensation, in addition to a full range of medical, retirement, and lifestyle benefits.
  • Fulltime
Read More
Arrow Right
New

Senior Manager, AI Platform Engineering

Socure is building the identity trust infrastructure for the digital economy — v...
Location
Location
United States
Salary
Salary:
190000.00 - 210000.00 USD / Year
socure.com Logo
Socure
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of professional software engineering experience, including time spent building or operating large-scale ML, data, or distributed systems platforms
  • 3+ years of engineering leadership experience managing multiple teams or engineering managers
  • Strong technical background in ML infrastructure, data engineering, and/or cloud-native distributed systems
  • Demonstrated experience delivering complex, cross-functional platform initiatives
  • Excellent communication and stakeholder management skills, with the ability to translate between technical detail and business priorities
  • Experience working in fast-paced, iterative environments using modern development practices
Job Responsibility
Job Responsibility
  • Develop and own the roadmap for Socure’s AI/ML platform, including data and feature engineering workflows, training infrastructure, experimentation tooling, model deployment/serving, monitoring, and governance
  • Define architecture and standards that create clear, scalable, and secure paths for building and operating AI systems
  • Assess technology options and drive consolidation across the company to reduce fragmentation and improve consistency across the ML toolchain
  • Partner with Data Science, Engineering, Product, and Sales-Enablement teams to develop AI infrastructure that delights Customers
  • Lead the design and operation of the end-to-end ML lifecycle: data ingestion, feature engineering, experimentation, training, model registry, deployment, and continuous monitoring
  • Partner closely with Data Science to enable fast, reproducible experimentation and reduce operational friction
  • Ensure the platform delivers reliability, traceability, observability, and performance for both batch and real-time model workloads
  • Guide the team to deliver high-quality platform capabilities with predictable timelines and strong technical rigor
  • Remove cross-team bottlenecks, align dependencies, and ensure seamless execution across Data, Infrastructure, and Product
  • Establish SLAs, operational standards, and production-readiness guidelines for ML pipelines and serving systems
What we offer
What we offer
  • Offers Equity
  • Offers Bonus
  • benefits
  • Fulltime
Read More
Arrow Right