CrawlJobs Logo

Staff Product Manager, Managed Inference

crusoe.ai Logo

Crusoe

Location Icon

Location:
United States , San Francisco

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

204000.00 - 247000.00 USD / Year

Job Description:

As a core member of the Crusoe Managed AI Services team, you will own the complete product lifecycle for our Managed Inference offerings, from initial concept and strategic roadmap to successful execution and market adoption. You will be the champion for our inference service offerings, translating market needs and technical complexities into clear product specifications, compelling narratives, and product decisions that drive business growth for Crusoe Cloud.

Job Responsibility:

  • Own the end-to-end product lifecycle for Crusoe’s Managed Inference services, including roadmap definition, execution, and iteration
  • Translate customer needs, market signals, and technical constraints into clear product requirements and prioritization
  • Partner closely with Engineering, Infrastructure, and Platform teams to deliver scalable, reliable inference services
  • Drive product decisions across performance, reliability, cost efficiency, and developer experience
  • Define and track success metrics for inference services in production environments
  • Collaborate with go-to-market teams to support product launches, positioning, and customer adoption
  • Communicate product strategy and tradeoffs clearly to cross-functional partners and leadership

Requirements:

  • 6+ years of experience in technical product management or engineering roles with product responsibilities
  • Experience building and launching cloud infrastructure, platform, or AI/ML services used in production
  • Strong understanding of cloud infrastructure (e.g., AWS, GCP, Azure) and modern compute architectures
  • Familiarity with the machine learning lifecycle, particularly model deployment, inference, and monitoring
  • Strong communication and collaboration skills, with experience working across engineering, product, and business teams
  • Demonstrated ability to operate independently with strong product judgment and a bias for action
  • Bachelor’s degree in Computer Science or a related technical field (or equivalent experience)

Nice to have:

  • Experience building developer-facing platforms or services
  • Exposure to inference-as-a-service, model serving frameworks, or ML infrastructure tooling
  • Participation in developer communities or open-source projects
  • Strong interest in trends across AI infrastructure and inference at scale
What we offer:
  • Restricted Stock Units in a fast growing, well-funded technology company
  • Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
  • Employer contributions to HSA accounts
  • Paid Parental Leave
  • Paid life insurance, short-term and long-term disability
  • Teladoc
  • 401(k) with a 100% match up to 4% of salary
  • Generous paid time off and holiday schedule
  • Cell phone reimbursement
  • Tuition reimbursement
  • Subscription to the Calm app
  • MetLife Legal
  • Company paid commuter benefit
  • $300/month

Additional Information:

Job Posted:
January 19, 2026

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Staff Product Manager, Managed Inference

Staff Product Security Engineer

We’re looking for a Staff Product Security Engineer to lead the design and imple...
Location
Location
United States
Salary
Salary:
184000.00 - 252000.00 USD / Year
alpha-sense.com Logo
AlphaSense
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of experience in product, application, or cloud security engineering
  • Deep understanding of secure SDLC, threat modeling, and secure architecture design
  • Proven expertise with AWS cloud security concepts and best practices
  • Strong experience with container security, orchestration, and runtime protection
  • Proficiency in Python, Java, and/or JavaScript for security automation, code review, and tooling
  • Experience securing AI/ML pipelines, data workflows, or model-serving infrastructure
  • Familiarity with DevSecOps and continuous integration/deployment environments
Job Responsibility
Job Responsibility
  • Embed robust security practices throughout the software and AI development lifecycle (SDLC)
  • Lead secure design reviews, threat modeling, and risk assessments for AI-driven products, APIs, and backend services
  • Partner with engineering and product teams to ensure security, privacy, and compliance by design
  • Build and maintain security automation and governance frameworks that integrate seamlessly into development workflows
  • Architect and enforce security controls for AI/ML systems, including model training, data pipelines, and inference environments
  • Identify and mitigate AI-specific attack vectors such as data poisoning, model inversion, prompt injection, and model theft
  • Collaborate with governance and compliance teams to align with ethical AI principles and frameworks like NIST AI RMF and the EU AI Act
  • Implement model provenance, integrity, and auditability controls to ensure responsible and secure AI operations
  • Partner with DevOps and SRE teams to secure service meshes, container networking, and secrets management
  • Drive software supply chain security, including artifact integrity, dependency management, and vulnerability reduction
What we offer
What we offer
  • Competitive compensation, benefits, and career growth opportunities
  • Opportunity to shape and drive product security strategy
  • Collaborative and security-minded engineering culture
  • Work on cutting-edge security challenges in a fast-growing company
  • Performance-based bonus, equity, and a generous benefits program
  • Fulltime
Read More
Arrow Right

Staff Data Scientist, Inference - Customer Support

As a Staff Data Scientist working on Inference in CS, you will have the opportun...
Location
Location
United States
Salary
Salary:
194000.00 - 240000.00 USD / Year
airbnb.com Logo
Airbnb
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 9+ years of relevant industry experience (e.g. data scientist, tech lead, junior faculty)
  • Master’s degree or PhD in relevant fields
  • Strong fluency in Python, SQL
  • Familiarity with causal inference and experimentation and a desire to continue to learn
  • Proven ability to communicate clearly and effectively to audiences of varying technical levels
  • Proven mix of strong intellectual curiosity with high level of pragmatism and engagement with the technical community
Job Responsibility
Job Responsibility
  • Lead DS efforts building segmentation features for CS including data exploration, optimization and model prototyping
  • Work collaboratively with cross functional partners including software engineers, product managers, operations and research, to refine requirements for segmentation models, drive scientific decisions, and quantify impact
  • Develop simulations to validate the impact of potential product and operational changes to the business
  • Design and utilize new experiment frameworks to measure the effectiveness feature launches, even when A/B tests are infeasible
  • Regularly present work internally at monthly meetings to technical, engineering and product stakeholders to iterate and generate excitement on roadmap progress
  • Bring new ideas to the team which can improve the operational efficiency of customer support at airbnb
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Employee Travel Credits
  • Fulltime
Read More
Arrow Right

Staff Data Scientist, Platform (Inference/Payments)

We are looking for a passionate Staff-level Data Scientist to lead quantitative ...
Location
Location
United States
Salary
Salary:
194000.00 - 240000.00 USD / Year
airbnb.com Logo
Airbnb
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 9+ years of industry experience in a quantitative analysis role with a Master’s degree in a quantitative field (math / economics / statistics, and etc.), or 6+ years of experience with a PhD degree
  • Strong knowledge of causal inference, experimentation, and statistical modeling
  • Skilled in statistical programming (Python or R) and database usage (SQL)
  • Proven ability to communicate clearly and effectively to audiences of varying technical levels
  • Ability to translate complex findings and results into compelling narratives that drive impact
  • Excellent project management, communication, and teamwork skills
Job Responsibility
Job Responsibility
  • Inference: Develop and apply causal inference methods, including experimental, econometric regressions, and quasi-experimental methods to measure a wide-range of platform/product impacts
  • AI/ML: Build methods for robust evaluation of ML/AI model efficiency and performance. Ability to identify use-cases for and develop predictive models to classify, segment, and interpret our users’ behavior
  • Optimization: Develop methodologies to explore/simulate the impact of new interventions and develop data products to optimize product/operational strategies
  • Communication: Deliver robust research reports and effective data visualizations. Collaborate with and present to stakeholders to identify opportunities and communicate findings, and drive impact
  • Empowerment: Think strategically about opportunities to improve and scale our brand measurement and customer insights
What we offer
What we offer
  • bonus, equity, benefits, and Employee Travel Credits
  • Fulltime
Read More
Arrow Right

Staff Backend Engineer, Speech AI

Our intelligent runtime must seamlessly connect to foundational models to power ...
Location
Location
United States , Mountain View
Salary
Salary:
200000.00 - 300000.00 USD / Year
inworld.ai Logo
Inworld AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A BA/BS degree in Computer Science or a related technical field, or equivalent practical experience
  • 5+ years of professional experience in software development, with a proven track record of shipping high-quality, user-facing products
  • Strong product sense and an ability to think critically about user experience and business impact
  • Demonstrated experience in building and scaling production-grade backend APIs and distributed systems
  • Strong proficiency in Python and professional experience with one or more of the following: Java/Kotlin, or Go
  • Hands-on experience with containerization (Docker) and deploying services on orchestration platforms like Kubernetes
  • A solid foundation in data structures, algorithms, and system design
Job Responsibility
Job Responsibility
  • Own features end-to-end, from collaborating on the initial concept with product managers to shipping and monitoring the final product
  • Translate product requirements and user needs into robust, scalable, and maintainable backend services and APIs
  • Design, build, and launch user-facing APIs and backend systems in Python, Java/Kotlin, and Go that deliver seamless voice experiences
  • Partner closely with Product Managers and ML engineers to define scope, identify technical trade-offs, and drive the product roadmap forward
  • Write high-quality, production-grade code that powers real-time audio processing, model inference, and complex data pipelines
  • Champion engineering and product excellence, with a focus on delivering tangible value to our users quickly and iteratively
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • relocation assistance
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, Inference

We're looking for an ML infrastructure engineer to bridge the gap between resear...
Location
Location
United States
Salary
Salary:
240000.00 - 290000.00 USD / Year
runwayml.com Logo
Runway
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years of experience running ML model inference at scale in production environments
  • Strong experience with PyTorch and multi-GPU inference for large models
  • Experience with Kubernetes for ML workloads—deploying, scaling, and debugging GPU-based services
  • Comfortable working across multiple cloud providers and managing GPU driver compatibility
  • Experience with monitoring and observability for ML systems (errors, throughput, GPU utilization)
  • Self-starter who can work embedded with research teams and move fast
  • Strong systems thinking and pragmatic approach to production reliability
  • Humility and open mindedness
Job Responsibility
Job Responsibility
  • Productionize model checkpoints end-to-end: from research completion to internal testing to production deployment to post-release support
  • Build and optimize inference systems for large-scale generative models running on multi-GPU environments
  • Design and implement model serving infrastructure specialized for diffusion models and real-time diffusion workflows
  • Add monitoring and observability for new model releases—track errors, throughput, GPU utilization, and latency
  • Embed with research teams to gather training data, run preprocessing scripts, and support the model development process
  • Explore and integrate with GPU inference providers (Modal, E2E, Baseten, etc.)
  • Fulltime
Read More
Arrow Right

Senior Staff Machine Learning Engineer

Help design our AI platform and develop our next generation of machine learning ...
Location
Location
United States , San Francisco
Salary
Salary:
216500.00 - 324500.00 USD / Year
gofundme.com Logo
GoFundMe
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 9+ years of hands-on experience in machine learning engineering, AI development, software engineering, or related fields
  • Experience emphasizing secure, large-scale, distributed system design, AI/ML pipeline development, and implementation
  • Extensive experience designing, developing, and operating scalable backend systems
  • Experience applying software engineering best practices such as domain-driven design, event-driven architectures, and microservices
  • Deep expertise in agentic workflows, AI evaluation solutions, prompt management, and secure AI development and testing practices
  • Strong knowledge of relational and document-based databases, data storage paradigms, and efficient RESTful API design
  • Experience establishing robust CI/CD pipelines, automated testing (unit and integration), and deployment practices
  • Strong leadership skills, including effective planning and management of complex projects, mentoring of team members, and fostering a collaborative, high-performing engineering culture
  • Excellent communicator, able to articulate complex technical concepts clearly to both technical and non-technical stakeholders
  • Bachelor's degree in Computer Science, Software Engineering, or a related technical field (preferred)
Job Responsibility
Job Responsibility
  • Design and implement AI platforms to enable scalable and secure access to LLMs from multiple model providers for diverse use cases
  • Design and implement agentic workflows, agentic tool ecosystems, and LLM prompt management solutions
  • Design, build, and optimize scalable model training, fine tuning, and inference pipelines, ensuring robust integration with production systems
  • Influence technical strategy and approach to developing embedding stores, vector databases, and other reusable assets
  • Lead initiatives to streamline ML and AI workflows, improve operational efficiency, and establish standardized procedures to achieve consistent, high-quality results across our AI systems
  • Design and develop backend services and RESTful APIs using Python and FastAPI, integrating seamlessly with ML pipelines and services
  • Take operational responsibility for team-owned services, including performance monitoring, optimization, troubleshooting, and participation in an on-call rotation
  • Collaborate with both technical and non-technical colleagues, including data and applied scientists, software engineers, product managers, and business stakeholders, to deliver reliable and scalable ML-driven products
  • Coach and mentor fellow ML engineers, promoting a culture of collaboration, continuous improvement, and engineering excellence within the team
  • Employ a diverse set of tools and platforms including Python, AWS, Databricks, Docker, Kubernetes, FastAPI, Terraform, Snowflake, Coralogix, and GitHub to build, deploy, and maintain scalable, highly available machine learning infrastructure
What we offer
What we offer
  • Competitive pay
  • Comprehensive healthcare benefits
  • Financial assistance for things like hybrid work, family planning
  • Generous parental leave
  • Flexible time-off policies
  • Mental health and wellness resources
  • Learning, development, and recognition programs
  • Fulltime
Read More
Arrow Right

Staff Machine Learning Engineer

As a Staff Machine Learning Engineer at Aignostics, you will play a crucial role...
Location
Location
Germany , Berlin
Salary
Salary:
Not provided
aignostics.com Logo
Aignostics
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Advanced degree in a relevant field or extensive work experience
  • 8+ years of industry experience, with at least 2 years as Staff Engineer or an equivalent role
  • Proven track record of driving technical excellence and innovation
  • Solid background in data-intensive systems and software architecture, design patterns and clean coding
  • Expert Python programming and fluency in C/C++ or other low-level language(s)
  • Experience with designing and implementing large-scale, distributed ML systems and platforms
  • Proven track record of deploying ML models into production environments
  • Strong knowledge of machine learning fundamentals
  • Experience with deep learning frameworks (e.g. Pytorch and Tensorflow) and state-of-the-art techniques (e.g. generative models)
  • Deep understanding of cloud technologies (e.g. GCP, AWS), containerization and orchestration (Kubernetes)
Job Responsibility
Job Responsibility
  • Define and drive the technical architecture and system design principles for our AI platform and infrastructure
  • Work in close collaboration with engineering leads to build flexible frameworks and systems for model training, evaluation and inference across different pathology applications
  • Guide the CTO office, product management and fellow engineering leads through complex decisions by providing expert consultation on feasibility, architecture, trade-offs and risk mitigation strategies, while ensuring alignment with our technical vision
  • Foster technical alignment across teams by establishing shared architectural principles and best practices, facilitating cross-team design reviews to enable consistent decision-making across domains
  • Champion technical excellence by leading strategic initiatives that modernize our architecture and reduce technical debt while measuring and improving our technical health metrics
  • Elevate the technical capabilities of our engineering staff through structured mentoring, workshops and establishing comprehensive technical guidelines that enable teams to make better design decisions
  • Drive innovation by evaluating emerging technologies, leading proof-of-concept initiatives and building support for strategic technical investments that advance our engineering capabilities while ensuring measurable business value
What we offer
What we offer
  • Cutting-edge AI research and development, with involvement of Charité, TU Berlin and our other partners
  • Work with a welcoming, diverse and highly international team of colleagues
  • Opportunity to take responsibility and grow your role within the startup
  • Expand your skills by benefitting from our Learning & Development yearly budget of 1,000 € (plus 2 L&D days), language classes and internal development programs
  • Mentoring program, you’ll learn from great experts
  • Flexible working hours and teleworking policy
  • 30 paid vacations days per year
  • We are family & pet friendly and support flexible parental leave options
  • Pick a subsidized membership of your choice among public transport, sports and well-being
  • Enjoy our social gatherings, lunches and off-site events for a fun and inclusive work environment
Read More
Arrow Right
New

Director Agentic AI

We are seeking a Director of Data Science - Amgen’s most senior individual-contr...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
amgen.com Logo
Amgen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years in advanced analytics with 4+ years managing high-performing data-science or ML teams
  • Deep command of classical ML, time-series, deep-learning (CNNs, transformers) and causal-inference techniques
  • Expert Python and strong SQL
  • Hands-on experience deploying models via modern MLOps stacks (MLflow, Kubeflow, SageMaker, Vertex AI or Azure ML)
  • Proven ability to translate complex analytics into concise, outcome-oriented narratives
  • Working knowledge of AWS, Azure or GCP
  • Familiarity with privacy, security and AI-governance frameworks (GDPR, HIPAA, GxP, EU AI Act)
  • Master’s degree with 15 - 17 + years of experience in Computer Science, IT or related field OR Bachelor’s degree with 18 - 20 + years of experience in Computer Science, IT or related field
  • Excellent analytical and troubleshooting skills
  • Strong verbal and written communication skills
Job Responsibility
Job Responsibility
  • Develop and execute a multi-year data-science strategy and roadmap
  • Lead, mentor and grow a high-performing staff of data scientists and ML engineers
  • Own the end-to-end delivery of advanced analytics and machine-learning solutions
  • Prioritise and manage a balanced portfolio of initiatives
  • Provide hands-on guidance on algorithm selection and experimentation
  • Establish and enforce best practices for code quality, version control, MLOps pipelines, model governance and responsible-AI safeguards
  • Partner with Data Engineering, Product, IT Security and Business stakeholders to integrate models into production systems
  • Manage cloud and on-prem analytics environments
  • Champion a data-driven culture by communicating insights and model performance to VP/SVP-level leaders
  • Track emerging techniques, regulatory trends and tooling in AI/ML
Read More
Arrow Right