CrawlJobs Logo

Research Scientist: Post-Training

generalistai.com Logo

Generalist AI

Location Icon

Location:
United States , San Mateo

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

200000.00 - 350000.00 USD / Year

Job Description:

Pretraining gives us a general model. Post-training makes it useful, controllable, safe, and performant in the real world. You will train large pretrained robot models into production-ready systems via fine-tuning, reinforcement learning, steering, human feedback, task specialization, evaluation, and on-robot validation—at scale. Regardless of your initial background, you will grow into becoming a full-stack ML roboticist capable of quickly pinpoint issues on either side of ML or controls, and all the places in between. This is where research meets reality.

Job Responsibility:

  • Designing fine-tuning and adaptation strategies for downstream robotic tasks and embodiments
  • Developing methods for improving reliability, robustness, and controllability
  • Building evaluation frameworks that measure real-world robot performance, not just offline metrics
  • Improving inference-time performance (latency, stability, memory footprint) in collaboration with ML infrastructure
  • Leveraging techniques such as imitation learning, RL, distillation, synthetic data, and curriculum learning
  • Closing the loop between model outputs and physical-world outcomes

Requirements:

  • Experience with fine-tuning large models for downstream tasks (RLHF, IL, RL, distillation, domain adaptation, etc.)
  • Worked on embodied AI, robotics, or real-world ML systems
  • Care deeply about evaluation, benchmarking, and failure analysis
  • Comfortable debugging across the ML stack — from loss curves to robot behavior
  • Enjoy rapid iteration with real-world feedback loops
  • Want to bridge the gap between foundation models and physical deployment
What we offer:

Offers Equity

Additional Information:

Job Posted:
February 18, 2026

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Research Scientist: Post-Training

Research Scientist - Generative AI

As a Research Scientist in the Emergent Machine Intelligence Team at Hewlett Pac...
Location
Location
United States , Santa Barbara
Salary
Salary:
101900.00 - 234500.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD in Computer Science, Artificial Intelligence, Machine Learning, Physics, Mathematics, or other related fields
  • 3-5 years working experience with training and fine-tuning generative AI models including LLMs, diffusion models, or Energy-Based Models
  • Proven track record of research in generative models, demonstrated through publications, patents, or publicly available projects
  • Proficiency in programming languages commonly used in AI research, such as Python, and experience with AI/ML frameworks (e.g., TensorFlow, PyTorch)
  • Deep understanding of machine learning algorithms and principles, especially in the context of generative AI
  • Strong mathematical background, with excellent skills in areas such as statistics, probability, linear algebra
  • Creative and analytical thinking abilities, with a passion for solving complex problems
  • Excellent communication skills, capable of conveying complex ideas clearly and engaging with both technical and non-technical audiences.
Job Responsibility
Job Responsibility
  • Conduct high-quality research in generative AI, including but not limited to designing algorithms for pre-training and post-training current autoregressive and diffusion models for multimodal data
  • Design, implement, and validate new algorithms and models for augmented LLMs, pushing the boundaries of AI capabilities
  • Developing and prototyping novel algorithms for fine-tuning, retrieval augmented generation, and in-context learning for various generative models
  • Developing algorithms for training and inference in Energy-Based Models
  • Collaborate with cross-functional teams to apply research findings to develop new products or enhance existing ones
  • Publish research papers in top-tier journals and conferences, sharing findings with the broader scientific community
  • Stay abreast of the latest AI research and trends, identifying opportunities for innovation and improvement
  • Mentor junior researchers and engineers, fostering a culture of knowledge sharing and collaboration
  • Develop prototypes and proof-of-concept implementations to demonstrate the potential of research findings
  • Engage with the academic community by attending conferences, workshops, and seminars.
What we offer
What we offer
  • A competitive salary and extensive social benefits
  • Diverse and dynamic work environment
  • Work-life balance and support for career development.
  • Fulltime
Read More
Arrow Right

Research Scientist - Generative AI

This role involves conducting high-quality research in generative AI, designing ...
Location
Location
United States
Salary
Salary:
101900.00 - 234500.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD in Computer Science, Artificial Intelligence, Machine Learning, Physics, Mathematics, or other related fields
  • 3-5 years working experience with training and fine-tuning generative AI models including LLMs, diffusion models, or Energy-Based Models
  • Proven track record of research in generative models, demonstrated through publications, patents, or publicly available projects
  • Proficiency in programming languages commonly used in AI research, such as Python, and experience with AI/ML frameworks (e.g., TensorFlow, PyTorch)
  • Deep understanding of machine learning algorithms and principles, especially in the context of generative AI
  • Strong mathematical background, with excellent skills in areas such as statistics, probability, linear algebra
  • Creative and analytical thinking abilities, with a passion for solving complex problems
  • Excellent communication skills, capable of conveying complex ideas clearly and engaging with both technical and non-technical audiences
Job Responsibility
Job Responsibility
  • Conduct high-quality research in generative AI, including but not limited to designing algorithms for pre-training and post-training current autoregressive and diffusion models for multimodal data
  • Design, implement, and validate new algorithms and models for augmented LLMs, pushing the boundaries of AI capabilities
  • Developing and prototyping novel algorithms for fine-turning, retrieval augmented generation, and in-context learning for various generative models
  • Developing algorithms for training and inference in Energy-Based Models
  • Collaborate with cross-functional teams to apply research findings to develop new products or enhance existing ones
  • Publish research papers in top-tier journals and conferences, sharing findings with the broader scientific community
  • Stay abreast of the latest AI research and trends, identifying opportunities for innovation and improvement
  • Mentor junior researchers and engineers, fostering a culture of knowledge sharing and collaboration
  • Develop prototypes and proof-of-concept implementations to demonstrate the potential of research findings
  • Engage with the academic community by attending conferences, workshops, and seminars
What we offer
What we offer
  • A competitive salary and extensive social benefits
  • Diverse and dynamic work environment
  • Work-life balance and support for career development
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

Machine Learning Research Scientist / Research Engineer, Post-Training

Scale works with the industry’s leading AI labs to provide high quality data and...
Location
Location
United States , San Francisco; Seattle; New York
Salary
Salary:
252000.00 - 315000.00 USD / Year
scale.com Logo
Scale
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Ph.D. or Master's degree in Computer Science, Machine Learning, AI, or a related field
  • Deep understanding of deep learning, reinforcement learning, and large-scale model fine-tuning
  • Experience with post-training techniques such as RLHF, preference modeling, or instruction tuning
  • Excellent written and verbal communication skills
  • Published research in areas of machine learning at major conferences (NeurIPS, ICML, ICLR, ACL, EMNLP, CVPR, etc.) and/or journals
  • Previous experience in a customer facing role
Job Responsibility
Job Responsibility
  • Research and develop novel post-training techniques, including SFT, RLHF, and reward modeling, to enhance LLM core capabilities in both text and multimodal modalities
  • Design and experiment new approaches to preference optimization
  • Analyze model behavior, identify weaknesses, and propose solutions for bias mitigation and model robustness
  • Publish research findings in top-tier AI conferences
What we offer
What we offer
  • Comprehensive health, dental and vision coverage
  • retirement benefits
  • a learning and development stipend
  • generous PTO
  • equity based compensation
  • commuter stipend
  • Fulltime
Read More
Arrow Right

Research Engineer / Scientist - Post-training

At Luma, the Post-training team is responsible for unlocking creative control in...
Location
Location
United States , Palo Alto
Salary
Salary:
187500.00 - 395000.00 USD / Year
lumalabs.ai Logo
Luma AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Advanced degree (Master's or PhD) in Computer Science, Artificial Intelligence, Machine Learning, or related technical discipline with concentrated study in deep learning and computer vision methodologies
  • Demonstrated ability to do independent research in Academic or Industry settings
  • Substantial industry experience in large-scale deep learning model training, with demonstrated expertise in at least one of Large Language Models, Vision-Language Models, Diffusion Models, or comparable generative AI architectures
  • Comprehensive technical proficiency and practical experience with leading deep learning frameworks, including advanced competency in one of PyTorch, JAX, TensorFlow, or equivalent platforms for model development and optimization
  • Strong orientation toward applied AI implementations with emphasis on translating product requirements into technical solutions, coupled with exceptional visual discrimination and dedicated focus on enhancing visual fidelity and aesthetic quality of generated content
  • Proficiency in accelerated prototyping and demonstration development for emerging features, facilitating efficient iteration cycles and comprehensive stakeholder evaluation prior to production implementation
  • Established track record of effective cross-functional teamwork, including successful partnerships with teams spanning Product, Design, Evaluation, Applied, and creative specialists
Job Responsibility
Job Responsibility
  • Optimize Luma's image and video generative models through targeted fine-tuning to improve visual quality, instruction adherence, and overall performance metrics
  • Implement reinforcement learning techniques including Direct Preference Optimization and Generalized Reward Preference Optimization to align model outputs with human preferences and quality standards
  • Partner closely with the Applied Research team to identify product requirements, understand diverse use cases across Luma's platforms, and execute targeted fine-tuning initiatives to address performance gaps and enhance user-facing capabilities
  • Conduct comprehensive side-by-side evaluations comparing model performance against leading market competitors, systematically analyzing the impact of post-training techniques on downstream performance metrics and identifying areas for improvement
  • Develop advanced post-training capabilities for Luma’s video models including Camera control, Object & character Reference, Image & Video Editing, Human Performance & Motion Transfer Approaches
  • Architect data processing pipelines for large-scale video and image datasets, implementing filtering, balancing, and captioning systems to ensure training data quality across diverse content categories
  • Research and deploy cutting-edge diffusion sampling methodologies and hyperparameter optimization strategies to achieve superior performance on established visual quality benchmarks
  • Research emerging post-training methodologies in generative AI, evaluate their applicability to Luma's product ecosystem, and integrate promising techniques into our Post-training recipe
  • Fulltime
Read More
Arrow Right

AI Research Lead

Perplexity is seeking an exceptional AI Research Tech Lead to drive our research...
Location
Location
United States , San Francisco
Salary
Salary:
300000.00 - 470000.00 USD / Year
perplexity.ai Logo
Perplexity
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum of 5 years of experience working on relevant AI/ML projects with 3+ years in a technical leadership role
  • Proven track record of leading and mentoring technical and research teams
  • A Computer Science graduate degree at a premier academic institution
  • Deep expertise with large-scale LLMs and Deep Learning systems
  • Strong programming skills with versatility across multiple languages and frameworks
  • Demonstrated ability to set technical vision and drive execution
  • Experience with pre-training and post-training techniques (self-supervised learning along with SFT/DPO/GRPO/PPO)
  • Self-starter with exceptional ownership mentality and ability to work in ambiguous environments
  • Passion for solving challenging problems and pushing the boundaries of AI research
Job Responsibility
Job Responsibility
  • Define and execute the macro research direction across multiple modalities, including post-training LLMs for agent trajectories and future mid-training initiatives
  • Lead strategic research planning and roadmap development to advance Sonar model capabilities
  • Drive innovation in supervised and reinforcement learning techniques for query answering
  • Collaborate with leadership to align research priorities with product and business objectives
  • Coach and mentor a team of AI research scientists and engineers, fostering their technical and professional growth
  • Establish the long-term macro research direction across the team, including our direction across different modalities
  • Lead hiring and onboarding of new research talent
  • Create a collaborative environment that encourages knowledge sharing and innovation
  • Post-train SOTA LLMs on query answering using cutting-edge supervised and reinforcement learning techniques
  • Own and optimize the full stack data, training, and evaluation pipelines required for LLM post-training
What we offer
What we offer
  • Equity
  • Health
  • Dental
  • Vision
  • Retirement
  • Fitness
  • Commuter and dependent care accounts
  • Fulltime
Read More
Arrow Right
New

AI Research Scientist, Post-Training - Meta Superintelligence Labs

Meta is seeking Research Scientists to join the Post-Training team within Meta S...
Location
Location
United States , Menlo Park
Salary
Salary:
154000.00 - 217000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Ph.D. in Computer Science, Machine Learning, or a related technical field
  • 3+ years of experience in machine learning research, with a focus on deep learning, data alignment, NLP, or related areas
  • Demonstrated ability to lead technical research projects from conception to production
  • Effective communication skills and experience collaborating with technical leadership
Job Responsibility
Job Responsibility
  • Design novel methodologies for post-training data collection, curation, and synthetic data generation
  • Define data quality frameworks and alignment strategies that guide capability development across MSL, particularly for complex reasoning and agentic behaviors
  • Drive the scientific vision for eliciting high-quality data in expert domains (finance, legal, health, STEM) and complex agentic trajectories (Deep research, computer use, UI generation)
  • Conduct research to develop and optimize post-training recipes that directly improve model quality
  • Partner with cross-functional research teams across product and model training to identify and prioritize gaps in model capabilities
  • Contribute to research workstreams that shape the long-term direction of data-centric AI at MSL, working independently while also contributing to team goals and organizational priorities
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

Research Engineering Manager, Post-Training

Meta is seeking a Research Engineering Manager to lead the Post-Training team wi...
Location
Location
United States , Menlo Park
Salary
Salary:
219000.00 - 301000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or Master's degree in Computer Science, Machine Learning, or a related technical field
  • 4+ years of experience in machine learning engineering, machine learning research, or a related technical role
  • 3+ years of experience managing or leading technical teams, including hiring, mentoring, and performance management
  • Proficiency in Python and experience with ML frameworks such as PyTorch
  • Proven track record of leading medium to large-scale technical projects (specifically data pipelines or ML infrastructure) from conception to deployment
  • Software engineering practices including version control, testing, code review, and system design
  • Demonstrated ability to balance hands-on technical work with people management and strategic planning
  • Great communication skills with the ability to influence cross-functional stakeholders
Job Responsibility
Job Responsibility
  • Build, mentor, and grow a team of research engineers focused on full-stack post-training data infrastructure
  • Conduct performance reviews, career development conversations, and provide technical mentorship to team members
  • Foster a Culture of Engineering Excellence, data rigor, and rapid iteration within the team
  • Partner with recruiting to hire world-class research engineering talent
  • Oversee the development and scaling of data collection pipelines for high-value domains (STEM, GDP-valuable tasks, finance, legal, health) and complex agentic workflows (deep research, computer use, shopping agents)
  • Establish and manage partnerships with external data vendors to source and securely prepare expert-level post-training datasets
  • Influence the technical roadmap for data infrastructure in collaboration with the MSL Infra team
  • Translate the strategic vision of research scientists into actionable engineering plans for synthetic data generation, SFT, and RLHF pipelines
  • Partner with research scientists, product teams, and model training teams to align data collection priorities with organizational capability goals
  • Build robust, reusable data pipelines that can rapidly deliver high-quality datasets to multiple model lines
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

Research Scientist Intern, AI Research Multi-modal Post-Training

Meta is seeking Research Scientist Interns in the Meta Superintelligence org. We...
Location
Location
United States , Menlo Park
Salary
Salary:
7650.00 - 12134.00 USD / Month
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has or is in the process of obtaining a Ph.D. degree in Computer Science, NLP, Reinforcement Learning (RL), Computer Vision, Artificial Intelligence, or relevant technical field
  • Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment
  • Experience in Python or other related programming languages
  • Experience building systems based on machine learning and/or deep learning methods
Job Responsibility
Job Responsibility
  • Perform research to advance the science and technology of generative AI
  • Perform research that enables learning the semantics of data at scale (images, video, text, audio, and other modalities)
  • Improve and propose new methods for post-training foundation models across the spectrum of techniques including reinforcement learning and supervised fine tuning
  • Collaborate with researchers and cross-functional partners including communicating research plans, progress, and results
  • Devise better data-driven models of image multi-modal understanding
  • Publish research results and contribute to research that can be applied to Meta product development
Read More
Arrow Right