CrawlJobs Logo

Machine Learning Engineer - Pre-Training

wayve.ai Logo

Wayve

Location Icon

Location:
United Kingdom , London

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

We are seeking skilled engineers to join our Training Tech team working on optimising large scale training jobs as we aim to scale our models through the next order of magnitude. A successful candidate will increase efficiency of training jobs in order to allow Wayve to train larger models faster.

Job Responsibility:

  • Profile training jobs to identify their bottlenecks, e.g. using NVIDIA Nsight Systems
  • Design and implement efficiency improvements to maximise MFU, e.g. tensor parallelism, model compilation, mixed precision
  • Design and implement observability tools, e.g. to track MFU
  • Collaborate closely with Research teams to integrate training efficiency improvements and create a culture of performance optimization

Requirements:

  • Experience optimize large scale training jobs on GPU compute clusters
  • Experience in working in platform teams and working with research teams
  • Experience in reporting and tracking over time benchmarked performance in an open and accessible way
  • Ability to write high quality, well-structured and tested Python code
  • BS or MS in Machine Learning, Computer Science, Engineering, or a related technical discipline or equivalent experience

Nice to have:

  • Solid experience working with concurrent, parallel and distributed computing
  • Experience using Nvidia NSight Systems
  • Experience implementing GPU kernels
  • Knowledge of computing fundamentals - what makes code fast, secure and reliable

Additional Information:

Job Posted:
January 01, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Machine Learning Engineer - Pre-Training

AI Engineer

As an AI Engineer at Eitan Medical, you will be part of a team committed to brin...
Location
Location
Israel , Netanya
Salary
Salary:
Not provided
eitanmedical.com Logo
Eitan Medical
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science, Engineering, Data Science, or a related STEM field (Master’s degree preferred)
  • Strong background in machine learning and data engineering
  • Proven experience deploying LLM-based or GenAI-powered applications (via APIs, frameworks, or pre-trained models)
  • Proficiency in Python and experience with AI/ML libraries (e.g., LangChain, Hugging Face, PyTorch, TensorFlow)
  • Experience with containerization and orchestration (Docker, Kubernetes, EKS/AKS)
  • Team player with excellent communication and collaboration skills, working effectively with multidisciplinary teams
  • Independent, proactive, and self-motivated, with a strong sense of ownership and the ability to drive initiatives from concept to delivery
  • Passion for continuous learning, staying at the forefront of AI and data innovation, and translating it into tangible impact
Job Responsibility
Job Responsibility
  • Integrate Generative AI (GenAI) capabilities into Eitan’s SaaS platforms to enhance clinical decision support, treatment optimization, and actionable medical insights
  • Identify and lead AI-driven initiatives across departments to streamline processes, boost productivity, and accelerate innovation
  • Design and implement AI-powered systems, including RAG architectures and agentic workflows, using frameworks such as LangChain, LlamaIndex, or similar
  • Develop effective prompt strategies and reasoning pipelines for adaptive, context-aware, and explainable AI behavior
  • Monitor and optimize AI system performance, maintaining accuracy, reliability, and safety in healthcare contexts
  • Stay ahead of emerging AI research and tools, evaluating new technologies for their potential to deliver measurable clinical and business impact
Read More
Arrow Right

Senior Machine Learning Engineer

Our client is looking for a Senior Machine Learning Engineer for a 6 month contr...
Location
Location
Canada , Toronto
Salary
Salary:
Not provided
https://www.randstad.com Logo
Randstad
Expiration Date
March 10, 2026
Flip Icon
Requirements
Requirements
  • Deep Understanding of Machine Learning Concepts: Proficiency in fundamental machine learning concepts, algorithms, and techniques
  • Expertise in Natural Language Processing (NLP): Knowledge of NLP techniques and models, especially BERT and other transformer-based models, for tasks like text classification, sentiment analysis, and language understanding
  • Experience with Deep Learning Frameworks: Proficiency in deep learning libraries such as TensorFlow or PyTorch. Experience with implementing, training, and fine-tuning BERT models using these frameworks is crucial
  • Data Preprocessing Skills: Ability to perform text preprocessing, tokenization, and understanding of word embeddings
  • Programming Skills: Strong programming skills in Python, including experience with libraries like NumPy, Pandas, and Scikit-learn
  • Model Optimization and Tuning: Skills in optimizing model performance through hyperparameter tuning and understanding of trade-offs between model complexity and performance
  • Understanding of Transfer Learning: Knowledge of how to leverage pre-trained models like BERT for specific tasks and adapt them to custom datasets
  • Experience managing available resources such as hardware, data, and personnel so that deadlines are met
  • Experience analyzing the machine learning algorithms that could be used to solve a given problem and ranking them by their success probability
  • Experience exploring and visualizing data to gain an understanding of it, then identifying differences in data distribution that could affect performance when deploying the model in the real world
Job Responsibility
Job Responsibility
  • Creates machine learning models and utilizes data to train models
  • Focuses on analyzing data to find relations between the input and the desired output
  • Understands business objectives and develops models that help achieve them, along with metrics to track their progress
  • Designs and develops machine learning and deep learning systems
  • Runs machine learning tests and experiments
  • Implements appropriate machine learning algorithms
What we offer
What we offer
  • Earn a competitive rate within the industry
  • Potential for extension
Read More
Arrow Right

AI Engineer Associate

We’re looking for an AI Engineer Associate who wants to help shape the future of...
Location
Location
Sweden , Stockholm
Salary
Salary:
Not provided
predli.com Logo
Predli
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Hold a degree in Machine Learning, Computer Science, Data Science, or a related field, and have a solid understanding of core ML principles
  • Skilled in programming across modern languages such as Python and TypeScript, and familiar with frameworks like PyTorch, TensorFlow, or Scikit-learn
  • Experienced working with databases, APIs, and data pipelines
  • Familiar with MLOps concepts, automation, and cloud environments
  • Based in Stockholm and fluent in English at a professional level
Job Responsibility
Job Responsibility
  • Design and build AI solutions that create real business value by applying pre-trained models in new and creative ways, especially within the rapidly evolving field of language models
  • Build and extend features in Predli Studio that push the boundaries of what’s possible with AI, creating tools that empower others and deliver real wow moments
  • Develop scalable APIs, backend services, and data pipelines for production-ready AI systems
  • Work with cloud infrastructure like AWS, GCP, and Azure, as well as container technologies such as Docker and Kubernetes
  • Collaborate with clients and non-technical stakeholders to translate ideas into working AI solutions
  • Explore and experiment with state-of-the-art AI tools and frameworks through internal innovation projects, keeping your skills sharp and your work at the forefront of the field
  • Participate in regular knowledge-sharing sessions and code reviews to exchange insights and improve our collective expertise
What we offer
What we offer
  • Be part of a tight-knit team where your ideas matter and your work creates real impact
  • Work across consulting, product development, and applied research with exposure to diverse technologies and industries
  • Grow in a collaborative environment that values creativity, shared ownership, and learning by doing
  • Contribute to Predli Studio, a platform that redefines how organizations build and deploy AI
  • Take part in internal R&D projects that explore the future of intelligent systems
  • Join a culture that values curiosity, openness, and continuous learning
  • Enjoy a flexible hybrid setup, global collaboration, and opportunities for professional development and travel
  • Competitive compensation with room for growth as you develop in the role
Read More
Arrow Right

Applied Researcher I (AI Foundations)

At Capital One, we are creating trustworthy and reliable AI systems, changing ba...
Location
Location
United States , New York; San Francisco; San Jose; Cambridge; McLean
Salary
Salary:
218700.00 - 272300.00 USD / Year
capitalone.com Logo
Capital One
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining, a PhD in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields, with an exception that required degree will be obtained on or before the scheduled start date or M.S. in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields plus 2 years of experience in Applied Research
  • PhD in Computer Science, Machine Learning, Computer Engineering, Applied Mathematics, Electrical Engineering or related fields
  • PhD focus on NLP or Masters with 5 years of industrial NLP research experience
  • Multiple publications on topics related to the pre-training of large language models (e.g. technical reports of pre-trained LLMs, SSL techniques, model pre-training optimization)
  • Member of team that has trained a large language model from scratch (10B + parameters, 500B+ tokens)
  • Publications in deep learning theory
  • Publications at ACL, NAACL and EMNLP, Neurips, ICML or ICLR
  • PhD focused on topics related to optimizing training of very large deep learning models
  • Multiple years of experience and/or publications on one of the following topics: Model Sparsification, Quantization, Training Parallelism/Partitioning Design, Gradient Checkpointing, Model Compression
  • Experience optimizing training for a 10B+ model
Job Responsibility
Job Responsibility
  • Partner with a cross-functional team of data scientists, software engineers, machine learning engineers and product managers to deliver AI-powered products that change how customers interact with their money
  • Leverage a broad stack of technologies — Pytorch, AWS Ultraclusters, Huggingface, Lightning, VectorDBs, and more — to reveal the insights hidden within huge volumes of numeric and textual data
  • Build AI foundation models through all phases of development, from design through training, evaluation, validation, and implementation
  • Engage in high impact applied research to take the latest AI developments and push them into the next generation of customer experiences
  • Flex your interpersonal skills to translate the complexity of your work into tangible business goals
What we offer
What we offer
  • performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI)
  • comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being
  • Fulltime
Read More
Arrow Right

Applied Researcher I (AI Foundations)

At Capital One, we are creating trustworthy and reliable AI systems, changing ba...
Location
Location
United States , New York; San Francisco; San Jose; Cambridge; McLean
Salary
Salary:
218700.00 - 272300.00 USD / Year
capitalone.com Logo
Capital One
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining, a PhD in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields, with an exception that required degree will be obtained on or before the scheduled start date or M.S. in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields plus 2 years of experience in Applied Research
  • PhD in Computer Science, Machine Learning, Computer Engineering, Applied Mathematics, Electrical Engineering or related fields
  • LLM
  • PhD focus on NLP or Masters with 5 years of industrial NLP research experience
  • Multiple publications on topics related to the pre-training of large language models (e.g. technical reports of pre-trained LLMs, SSL techniques, model pre-training optimization)
  • Member of team that has trained a large language model from scratch (10B + parameters, 500B+ tokens)
  • Publications in deep learning theory
  • Publications at ACL, NAACL and EMNLP, Neurips, ICML or ICLR
  • PhD focused on topics related to optimizing training of very large deep learning models
  • Multiple years of experience and/or publications on one of the following topics: Model Sparsification, Quantization, Training Parallelism/Partitioning Design, Gradient Checkpointing, Model Compression
Job Responsibility
Job Responsibility
  • Partner with a cross-functional team of data scientists, software engineers, machine learning engineers and product managers to deliver AI-powered products that change how customers interact with their money
  • Leverage a broad stack of technologies — Pytorch, AWS Ultraclusters, Huggingface, Lightning, VectorDBs, and more — to reveal the insights hidden within huge volumes of numeric and textual data
  • Build AI foundation models through all phases of development, from design through training, evaluation, validation, and implementation
  • Engage in high impact applied research to take the latest AI developments and push them into the next generation of customer experiences
  • Flex your interpersonal skills to translate the complexity of your work into tangible business goals
What we offer
What we offer
  • performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI)
  • comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being
  • Fulltime
Read More
Arrow Right

Applied Researcher II (AI Foundations)

At Capital One, we are creating trustworthy and reliable AI systems, changing ba...
Location
Location
United States , New York; San Francisco; San Jose; Cambridge; McLean
Salary
Salary:
262500.00 - 326800.00 USD / Year
capitalone.com Logo
Capital One
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining, PhD in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields, with an exception that required degree will be obtained on or before the scheduled start date plus 2 years of experience in Applied Research or M.S. in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields plus 4 years of experience in Applied Research
  • PhD in Computer Science, Machine Learning, Computer Engineering, Applied Mathematics, Electrical Engineering or related fields
  • PhD focus on NLP or Masters with 5 years of industrial NLP research experience
  • Multiple publications on topics related to the pre-training of large language models (e.g. technical reports of pre-trained LLMs, SSL techniques, model pre-training optimization)
  • Member of team that has trained a large language model from scratch (10B + parameters, 500B+ tokens)
  • Publications in deep learning theory
  • Publications at ACL, NAACL and EMNLP, Neurips, ICML or ICLR
  • PhD focused on topics related to optimizing training of very large deep learning models
  • Multiple years of experience and/or publications on one of the following topics: Model Sparsification, Quantization, Training Parallelism/Partitioning Design, Gradient Checkpointing, Model Compression
  • Experience optimizing training for a 10B+ model
Job Responsibility
Job Responsibility
  • Partner with a cross-functional team of data scientists, software engineers, machine learning engineers and product managers to deliver AI-powered products
  • Leverage a broad stack of technologies — Pytorch, AWS Ultraclusters, Huggingface, Lightning, VectorDBs, and more — to reveal insights from data
  • Build AI foundation models through all phases of development, from design through training, evaluation, validation, and implementation
  • Engage in high impact applied research to take the latest AI developments into the next generation of customer experiences
  • Translate the complexity of your work into tangible business goals
What we offer
What we offer
  • Performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI)
  • Comprehensive, competitive, and inclusive set of health, financial and other benefits that support total well-being
  • Fulltime
Read More
Arrow Right

Applied Researcher II (AI Foundations)

At Capital One, we are creating trustworthy and reliable AI systems, changing ba...
Location
Location
United States , New York; San Francisco; San Jose; Cambridge; McLean
Salary
Salary:
262500.00 - 326800.00 USD / Year
capitalone.com Logo
Capital One
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining, PhD in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields, with an exception that required degree will be obtained on or before the scheduled start date plus 2 years of experience in Applied Research or M.S. in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields plus 4 years of experience in Applied Research
  • PhD in Computer Science, Machine Learning, Computer Engineering, Applied Mathematics, Electrical Engineering or related fields
  • PhD focus on NLP or Masters with 5 years of industrial NLP research experience
  • Multiple publications on topics related to the pre-training of large language models (e.g. technical reports of pre-trained LLMs, SSL techniques, model pre-training optimization)
  • Member of team that has trained a large language model from scratch (10B + parameters, 500B+ tokens)
  • Publications in deep learning theory
  • Publications at ACL, NAACL and EMNLP, Neurips, ICML or ICLR
  • PhD focused on topics related to optimizing training of very large deep learning models
  • Multiple years of experience and/or publications on one of the following topics: Model Sparsification, Quantization, Training Parallelism/Partitioning Design, Gradient Checkpointing, Model Compression
  • Experience optimizing training for a 10B+ model
Job Responsibility
Job Responsibility
  • Partner with a cross-functional team of data scientists, software engineers, machine learning engineers and product managers to deliver AI-powered products that change how customers interact with their money
  • Leverage a broad stack of technologies — Pytorch, AWS Ultraclusters, Huggingface, Lightning, VectorDBs, and more — to reveal the insights hidden within huge volumes of numeric and textual data
  • Build AI foundation models through all phases of development, from design through training, evaluation, validation, and implementation
  • Engage in high impact applied research to take the latest AI developments and push them into the next generation of customer experiences
  • Flex your interpersonal skills to translate the complexity of your work into tangible business goals
What we offer
What we offer
  • performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI)
  • comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being
  • Fulltime
Read More
Arrow Right

Applied Researcher I (AI Foundations)

At Capital One, we are creating trustworthy and reliable AI systems, changing ba...
Location
Location
United States , New York; San Francisco; San Jose; Cambridge; McLean
Salary
Salary:
218700.00 - 272300.00 USD / Year
capitalone.com Logo
Capital One
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining, a PhD in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields, with an exception that required degree will be obtained on or before the scheduled start date or M.S. in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields plus 2 years of experience in Applied Research
  • PhD in Computer Science, Machine Learning, Computer Engineering, Applied Mathematics, Electrical Engineering or related fields
  • PhD focus on NLP or Masters with 5 years of industrial NLP research experience
  • Multiple publications on topics related to the pre-training of large language models (e.g. technical reports of pre-trained LLMs, SSL techniques, model pre-training optimization)
  • Member of team that has trained a large language model from scratch (10B + parameters, 500B+ tokens)
  • Publications in deep learning theory
  • Publications at ACL, NAACL and EMNLP, Neurips, ICML or ICLR
  • PhD focused on topics related to optimizing training of very large deep learning models
  • Multiple years of experience and/or publications on one of the following topics: Model Sparsification, Quantization, Training Parallelism/Partitioning Design, Gradient Checkpointing, Model Compression
  • Experience optimizing training for a 10B+ model
Job Responsibility
Job Responsibility
  • Partner with a cross-functional team of data scientists, software engineers, machine learning engineers and product managers to deliver AI-powered products that change how customers interact with their money
  • Leverage a broad stack of technologies — Pytorch, AWS Ultraclusters, Huggingface, Lightning, VectorDBs, and more — to reveal the insights hidden within huge volumes of numeric and textual data
  • Build AI foundation models through all phases of development, from design through training, evaluation, validation, and implementation
  • Engage in high impact applied research to take the latest AI developments and push them into the next generation of customer experiences
  • Flex your interpersonal skills to translate the complexity of your work into tangible business goals
What we offer
What we offer
  • performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI)
  • comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being
  • Fulltime
Read More
Arrow Right