CrawlJobs Logo

Research Engineer – Audio & Speech Models

zyphra.com Logo

Zyphra

Location Icon

Location:
United States , Palo Alto

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

As a Research Engineer - Audio & Speech Models, you will be a core contributor on Zyphra’s Audio Team, building the next generation of open-source text-to-speech and audio models. You will be deeply involved in the entire model training process from data gathering and processing to designing novel architectures and training methodologies.

Job Responsibility:

  • Building the next generation of open-source text-to-speech and audio models
  • Deeply involved in the entire model training process from data gathering and processing to designing novel architectures and training methodologies
  • Work across: Large-scale audio training runs
  • Performance optimization of our training stack
  • Audio dataset collection, processing, and evaluation
  • Architecture and training methodology ablations and improvements

Requirements:

  • Strong research taste and intuition. The ability to work through a research project from conception to execution to write-up
  • Strong implementation and prototyping ability (can take an idea from conception to experimentation quickly)
  • The ability to work well with others in a high-paced research setting
  • Can rapidly learn new fields and are excited to implement new ideas
  • Excellent communication and collaboration skills, and can work effectively on both research and engineering implementation at scale.

Nice to have:

  • Expertise and intuition for training models in the audio domain, including text-to-speech, ASR, speech-to-speech, speech-emotion-recognition, or other models
  • Experience in training audio autoencoders
  • Understanding of signal processing, especially of audio signals
  • Experience with diffusion models, consistency models, or GANs
  • Experience with training on large-scale (multi-node) GPU clusters
  • Strong grasp of proper experimental methodology for running rigorous ablations and other hypothesis testing
  • Understanding of and interest in large-scale, highly parallel data processing pipelines
  • Proficiency with PyTorch and Python
  • Experience contributing to large pre-existing codebases and rapidly getting up to speed
  • Previously published machine learning research in well-respected venues
  • Postgraduate degree in a scientific subject (Computer Science, EE/EECS, Mathematics, Physics, Machine Learning)
What we offer:
  • Comprehensive medical, dental, vision, and FSA plans
  • Competitive compensation and 401(k)
  • Relocation and immigration support on a case-by-case basis
  • On-site meals prepared by a dedicated culinary team
  • Thursday Happy Hours

Additional Information:

Job Posted:
January 13, 2026

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Research Engineer – Audio & Speech Models

Research Engineer

We are looking for a Research Engineer to join the research team at ElevenLabs. ...
Location
Location
Poland
Salary
Salary:
Not provided
elevenlabs.io Logo
ElevenLabs
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years industry experience as a Machine Learning Engineer, with a key emphasis on constructing data pipelines, as well as developing and implementing machine learning models
  • Demonstrating the capacity to autonomously evaluate novel concepts or enhance current machine learning projects, with the potential outcome of contributing to published works
  • Extensive background in conducting exploratory research to enhance the excellence of gathered data, particularly within the realm of audio and text-to-speech domains
Job Responsibility
Job Responsibility
  • Creating and upholding a reliable and expandable data management system specialized for text-to-speech projects. This includes establishing guidelines for versioning and ensuring data quality
  • Establishing a streamlined process for autonomously training, assessing, and launching text-to-speech models. This encompasses implementing procedures for dynamic learning, as well as routines for fine-tuning and refreshing validation data
  • Investigating cutting-edge approaches and strategies in machine learning, deep learning, and algorithms pertaining to text-to-speech technology
What we offer
What we offer
  • Innovative culture
  • Growth paths
  • Learning & development: ElevenLabs proactively supports professional development through an annual discretionary stipend
  • Social travel: We also provide an annual discretionary stipend to meet up with colleagues each year, however you choose
  • Annual company offsite
  • Co-working: If you’re not located near one of our main hubs, we offer a monthly co-working stipend
  • Fulltime
Read More
Arrow Right

Machine Learning Researcher Engineer

Machine Learning Engineer / Researcher at BoldVoice, you’ll play a critical role...
Location
Location
United States , New York
Salary
Salary:
150000.00 - 220000.00 USD / Year
helpcare.ai Logo
Helpcare AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • At least 5 years of experience working on machine learning models in production environments, specifically training, fine-tuning, evaluating and directly implementing machine learning models, in the fields of Speech, NLP, and/or Vision
  • Experience in Automatic Speech Recognition (ASR) will be particularly useful, as will knowledge of phonetics and the ability to discern sounds and accents
  • Proficiency in Python and frameworks like TensorFlow, PyTorch, or similar
  • Up to date with latest developments in using LLM tools like Claude Code, Cursor, Codex or similar to rapidly prototype, and ship code quickly
Job Responsibility
Job Responsibility
  • Designing, training, and fine-tuning machine learning models for AI coaching, pronunciation feedback, and accent detection. This will include working on LLMs, speech models like Wav2Wec2.0, and multi-modal models like speech to speech models
  • Deploying these models into production environments for real-time and batch inference
  • Building reusable and organized data preprocessing pipelines for various data, including audio data, text data and more
  • Setting up automated evaluation systems to monitor model performance
  • Optimizing training workflows to reduce time-to-deployment
What we offer
What we offer
  • Excellent fully paid health/vision/dental insurance
  • 401K program
  • Help with relocation to NYC
  • Access to exclusive startup events, conferences and networks
  • Generous stock options
  • Fulltime
Read More
Arrow Right
New

AI Engineer - Speech & NLP

Interhuman AI is building the next generation of social intelligence infrastruct...
Location
Location
Denmark , København
Salary
Salary:
55000.00 - 65000.00 DKK / Year
life-science-talent-solutions.dk Logo
Life Science Talent
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD in Machine Learning, Computer Science, or a related field with a focus on speech processing and/or NLP
  • Track record of building and shipping models
  • Strong proficiency in Python and deep experience with PyTorch (or JAX/TensorFlow)
  • Familiarity with the current landscape of speech and multimodal models (e.g., Whisper, audio-LLMs, speech encoders, vision-language models)
  • You thrive with ambiguity. You can scope your own work, prioritize ruthlessly, and know when to ask for input
  • Clear communicator—you can explain a complex architecture to both engineers and non-technical stakeholders
Job Responsibility
Job Responsibility
  • Design, train, and iterate on speech and language models that extract social and emotional signals from live conversation
  • Own the full model development lifecycle—from data curation and architecture design through training, evaluation, and production deployment
  • Build evaluation frameworks and benchmarks that capture the subtleties of human interaction that standard metrics miss
  • Stay at the frontier of multimodal research and translate relevant advances into our production stack
  • Collaborate closely with engineering to ensure models meet real-time latency and scalability requirements
What we offer
What we offer
  • Competitive salary and meaningful equity in an early-stage, venture-backed company
  • Direct influence on technical direction—your work shapes the product, not just a feature
  • A small, focused team where your contributions are visible and impactful from day one
  • Flexibility on location and working arrangements
  • Fulltime
Read More
Arrow Right

Applied Scientist, Audio Algorithms

Category-defining products such as the Oculus headsets and the Ray Ban Meta smar...
Location
Location
United States , Redmond
Salary
Salary:
181000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Currently has, or is in the process of obtaining a PhD in Computer Science, Computer Engineering, Electrical Engineering, relevant technical field, or equivalent practical experience
  • Experience with C or C++ and with scientific programming languages such as MATLAB, python or similar
  • Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment
Job Responsibility
Job Responsibility
  • Research, design, and develop advanced speech and audio signal processing algorithms
  • Build specialized machine learning (ML) models for resource-constrained platforms (e.g., mobile/wearables)
  • Implement and optimize audio processing algorithms for embedded platforms
  • Analyze audio recordings and simulator outputs to uncover insights into audio processing system performance and function
  • Develop new perceptual audio quality metrics applicable to new product experiences
  • Contribute to roadmap planning for the next generation of platforms
  • Collaborate with DSP engineers, audio systems engineers, product managers, user experience researchers, hearing scientists, psychoacoustics researchers, and other disciplines to create new technologies and integrate them into products
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Research Scientist Intern, Audio

We are seeking a highly motivated and talented Audio Research Scientist Intern t...
Location
Location
United States , Redmond
Salary
Salary:
7313.00 - 12134.00 USD / Month
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or in the process of obtaining a PhD in Computer Science, Electrical Engineering, Auditory Neuroscience, Audio Signal processing or a related field
  • Experience in building deep learning models
  • Experience with LLM models
  • 2+ years experience with Python and PyTorch
  • Understanding of audio processing concepts
  • Proven communication and collaboration skills
  • Demonstrated skill in learning and applying new concepts, techniques, and tools to solve complex problems
  • Must obtain work authorization in country of employment at the time of hire and maintain ongoing work authorization during employment
Job Responsibility
Job Responsibility
  • Train or fine-tune audio models in Pytorch
  • Process and analyze speech and audio data: including binaural data simulation, data cleaning, feature extraction and visualization
  • Collaborate with other researchers to collect data through listening experiments
  • Design and conduct experiments to evaluate the performance of these models and interpret results
  • Communicate findings through written reports and presentations
Read More
Arrow Right

Research Scientist Manager - ML Modeling & Applied Research

Reality Labs at Meta is seeking Research Scientist Manager with experience in ma...
Location
Location
United States , Burlingame
Salary
Salary:
184000.00 - 257000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of experience working autonomously to design, execute, interpret, and present ML research outcomes
  • 4+ years of experience as technical lead for a project of 4 or more individuals
  • 2+ years of direct management experience, managing Researchers or Engineers
  • Experience in the analysis and modeling of high dimensional time series, such as neural signals, multi-channel audio recordings, multi-modal/multi-sensor signals, robotic sensory signals, financial time series, video, or other sensor modalities
  • Experience bringing machine learning-based products from research to production
  • Experience with interdisciplinary and/or cross-functional collaboration
  • Experience with large scale cluster computing for machine learning modeling
  • Experience developing end-to-end ML pipelines, including: dataset preprocessing, model development and evaluation, and software integration
  • Experience in software engineering in industry
  • Must obtain work authorization in country of employment at the time of hire and maintain ongoing work authorization during employment
Job Responsibility
Job Responsibility
  • Build cutting-edge machine learning and signal processing models (event detection, sequence-to-sequence, signal separation, time series regression, etc.) to advance neuromotor interface capabilities
  • Collaborate with engineering and Human-Computer Interactions (HCI) teams to deploy models that leverage fundamental scientific knowledge into new technology and user experiences
  • Use quantitative research methods to define, iterate upon and advance key areas of our research agenda
  • Develop research-grade code for deployment in research prototypes. Work across organizational boundaries to solve our biggest problems
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Staff Machine Learning Engineer

We are on a mission to ensure everyone has access to medical expertise, no matte...
Location
Location
Denmark , København
Salary
Salary:
Not provided
life-science-talent-solutions.dk Logo
Life Science Talent
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A Master’s degree in computer science, engineering, mathematics, statistics, physics, or a related field (or equivalent experience)
  • Mastery in Python, data structures, and algorithms, with the ability to contribute to production-level code
  • Experience with at least one major ML framework (PyTorch or TensorFlow)
  • Practical experience working on applied machine learning projects (academic or industry)
  • Strong communication skills—you can explain technical concepts to both technical and non-technical colleagues
  • Experience with natural language processing, speech recognition, or generative AI
  • Familiarity with designing ML pipelines for real-world applications
  • Contributions to research projects, publications, or open-source work
Job Responsibility
Job Responsibility
  • Contribute to building and improving machine learning models for audio, text, and structured data
  • Translate research outcomes into prototypes that can inform new product features
  • Help design and implement APIs and pipelines that make ML models accessible across the organization
  • Collaborate with researchers, engineers, and product teams to move ML ideas into production
  • Improve healthcare globally as part of your daily work
What we offer
What we offer
  • Equipment provided by Corti
  • Fulltime
Read More
Arrow Right

Senior Staff Machine Learning Engineer

We are on a mission to ensure everyone has access to medical expertise, no matte...
Location
Location
Denmark , København
Salary
Salary:
Not provided
life-science-talent-solutions.dk Logo
Life Science Talent
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A Master’s degree in computer science, engineering, mathematics, statistics, physics, or a related field (or equivalent experience)
  • Mastery in Python, data structures, and algorithms, with the ability to contribute to production-level code
  • Experience with at least one major ML framework (PyTorch or TensorFlow)
  • Practical experience working on applied machine learning projects (academic or industry)
  • Strong communication skills—you can explain technical concepts to both technical and non-technical colleagues
  • Experience with natural language processing, speech recognition, or generative AI
  • Familiarity with designing ML pipelines for real-world applications
  • Contributions to research projects, publications, or open-source work
Job Responsibility
Job Responsibility
  • Contribute to building and improving machine learning models for audio, text, and structured data
  • Translate research outcomes into prototypes that can inform new product features
  • Help design and implement APIs and pipelines that make ML models accessible across the organization
  • Collaborate with researchers, engineers, and product teams to move ML ideas into production
  • Improve healthcare globally as part of your daily work
What we offer
What we offer
  • Hybrid working environment in our Copenhagen Office
  • Equipment provided by Corti
  • Fulltime
Read More
Arrow Right