CrawlJobs Logo

Machine Learning Engineering Team Lead

aignostics.com Logo

Aignostics

Location Icon

Location:
Germany , Berlin

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

Lead a high-performing team focused on building large-scale distributed training infrastructure and workflows using cutting-edge technologies for digital pathology, powering our state-of-the-art Foundational Model development. This is a hands-on leadership role where you'll spend approximately 50% of your time on technical contributions while guiding your team to push the boundaries of machine learning for cancer research and diagnostics.

Job Responsibility:

  • Build and scale a high-performing team capable of tackling complex distributed ML challenges
  • Own the full employee lifecycle: recruiting, onboarding, performance management, career development, and retention
  • Empower your team members and help them grow in autonomy and technical expertise
  • Mentor engineers at all levels, fostering a culture of continuous learning and psychological safety
  • Create an inclusive environment where diverse perspectives drive innovation
  • Define and execute technical roadmaps aligned with company objectives and product needs
  • Lead resource allocation and capacity planning to balance team workload and business priorities
  • Own FinOps responsibilities: optimize cloud costs, track spending, and ensure efficient resource utilization
  • Ensure operational readiness through monitoring, incident response protocols, and system reliability practices
  • Establish and track KPIs for team performance, system efficiency and health
  • Design, develop, and maintain robust large-scale distributed training pipelines and ML infrastructure using cutting-edge technologies
  • Lead architecture decisions for distributed systems that enable efficient model development at scale
  • Hands-on contribution to critical technical challenges, including optimization of training pipelines and infrastructure
  • Drive technical excellence through code reviews and architectural guidance
  • Stay at the forefront of distributed training technologies and bring innovation to the team
  • Partner closely with Product teams to translate business requirements into technical solutions
  • Collaborate with (senior) Research Scientists to enable scalable model development and experimentation
  • Work with Platform Engineering to ensure robust infrastructure and tooling
  • Build strong relationships across engineering teams to drive alignment and knowledge sharing
  • Communicate technical concepts effectively to both technical and non-technical stakeholders

Requirements:

  • Bachelor's or Master's degree in Computer Science, Engineering, Mathematics, or a related field
  • 6+ years of software engineering or ML engineering experience, with at least 2 years in a technical leadership or team lead role
  • Proven track record of building and leading high-performing engineering teams
  • Experience guiding projects across the whole Software Development Life Cycle
  • Deep understanding of fundamental Machine Learning concepts and principles, familiarity with advanced model optimization techniques
  • Significant experience with large-scale distributed training systems and frameworks (especially PyTorch and NCCL)
  • Familiarity with GPUs, distributed systems, parallel computing and scaling laws
  • Advanced programming skills in Python, experience in performance-critical languages (C/C++ or CUDA) being a plus
  • Familiarity of MLOps/DevOps best practices including CI/CD, Docker, Kubernetes, and observability, cloud platforms (GCP, AWS or Azure) and infrastructure-as-code
  • Experience with Linux, version control, and container technologies
  • Demonstrated ability in resource allocation, capacity planning, and FinOps principles
  • Excellent problem-solving and data-driven decision-making skills in ambiguous situations
  • Effective communication and stakeholder management skills
  • Ability to give constructive feedback and navigate difficult conversations
  • Proven people leadership skills with experience managing the full employee lifecycle
  • Strategic thinking with ability to balance short-term execution and long-term vision
  • Experience with agile methodologies and iterative development processes
  • Proven ability to influence without authority and build consensus across teams
  • Track record of empowering team members and fostering autonomy

Nice to have:

  • Experience with production systems in a regulated or healthcare environments, familiarity with medical device standards (ISO 13485)
  • Experience working with biomedical or image data
  • Hands-on experience with Google Kubernetes Engine, SLURM and Ray distributed computing framework
  • Experience with advanced ML stack (TorchDyno, JAX, TensorRT)
  • Familiarity with Information Security standards (ISO 27001) in software development
  • Experience with FinOps tools and cloud cost optimization strategies
  • Demonstrated experience with leveraging LLM/Agentic systems to accelerate development
What we offer:
  • Learning & Development yearly budget of 1,000€ (plus 2 L&D days)
  • Language classes, and internal development programs
  • Access to leadership development programs and executive coaching
  • Flexible working hours and teleworking policy
  • 30 paid vacation days per year
  • Family & pet friendly and support flexible parental leave options
  • Subsidized membership of your choice among public transport, sports, and well-being
  • Social gatherings, lunches, and off-site events for a fun and inclusive work environment
  • Optional company pension scheme

Additional Information:

Job Posted:
January 03, 2026

Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Machine Learning Engineering Team Lead

Principal Data Scientist - Machine Learning Engineering

Atlassian is looking for a Principal Data Scientist to uncover valuable insights...
Location
Location
United States , San Francisco
Salary
Salary:
175100.00 - 233400.00 USD / Year
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience applying your Data Science skills to identify and lead projects which have had impact on business strategy and performance
  • 8+ years of experience in Data Science or related fields. (Preferred - 10+ years experience with a post-graduate degree in a quantitative discipline like Statistics, Mathematics, Econometrics, Computer science)
  • Expertise in applying a broad variety of ML methods including NLP and LLM to solve business problems and a strong sense of when to apply them to the problem at hand
  • Experience in managing ML projects end-to-end including deployment and monitoring
  • Expertise in SQL and a high level of proficiency in another data science programming language (e.g Python, R) with expertise in libraries like Pandas, Numpy, Scikit-learn etc.
  • A very high bar for output quality, while balancing 'having something now' vs. 'perfection in the future'
  • Comfort explaining complex concepts to diverse audiences and creating compelling stories for non-data experts
  • Proficiency in visualization tools (e.g. Streamlit, Tableau)
Job Responsibility
Job Responsibility
  • Influence strategy & important decisions around customer friction by surfacing data driven insights
  • Define, set and report on department level metrics or KRs to the CSS Executive team
  • Build and implement measurement frameworks, machine learning models and NLP/LLM tooling to accelerate Atlassian’s growth and improve product quality
  • Foster a world-class Data Science culture by leading training on technical concepts, driving continuous learning and mentoring Data Scientists on the team
What we offer
What we offer
  • health coverage
  • paid volunteer days
  • wellness resources
  • Fulltime
Read More
Arrow Right

Staff Machine Learning Engineer

Join PagerDuty as a Staff Machine Learning Engineer to tackle complex problems, ...
Location
Location
Canada , Toronto
Salary
Salary:
156000.00 - 232000.00 CAD / Year
https://www.pagerduty.com Logo
PagerDuty
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of experience building, designing, and evolving data architecture for large-scale systems
  • Excellent communication skills
  • Experience working with Product teams, ensuring and driving a timely delivery
  • Have a deep understanding of the trade-offs to be considered when designing and delivering machine learning solutions to production
  • Experience leading cross-team architecture discussions, building technical prototypes, and driving the adoption of best practices across diverse teams
  • Demonstrated experience with data engineering processes, working with unstructured data and cloud-based data infrastructures
  • Passionate about ML engineering and interested in driving discussions with stakeholders and executives
Job Responsibility
Job Responsibility
  • Build and improve the capabilities of the data platform that enable and accelerate the production of ML/AI-based solutions
  • Drive and define standards for AI/ML across the organization
  • Provide guidance, technical leadership, and mentoring to other members of the team
  • Mentor junior members and participate in scaling up the existing team
  • Proactively recommend improvements and new approaches addressing potential systemic pain points and technical debt
  • Anticipate technical demands on the data platform based on the organization’s roadmap and systematically drive the evolution of the architecture toward those ends
  • Develop a long-term plan for ML/AI investments
What we offer
What we offer
  • Competitive salary
  • Comprehensive benefits package from day one
  • Flexible work arrangements
  • Company equity
  • ESPP (Employee Stock Purchase Program)
  • Retirement or pension plan
  • Generous paid vacation time
  • Paid holidays and sick leave
  • Dutonian Wellness Days & HibernationDuty - companywide paid days off in addition to PTO
  • Paid parental leave: 22 weeks for pregnant parent, 12 weeks for non-pregnant parent
  • Fulltime
Read More
Arrow Right

Senior Machine Learning Engineer

As a Senior Machine Learning Engineer in the Central AI team, you will build and...
Location
Location
Australia , Sydney
Salary
Salary:
Not provided
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Master or PhD in a quantitative subject (Statistics, Mathematics, Computer Science, Operations Research, or relevant work experience)
  • 3+ years of related industry experience in the data science domain
  • Expertise in Python or Java with and the ability to write performant production-quality code, familiarity with SQL, knowledge of Spark and cloud data environments (e.g. AWS, Databricks)
  • Experience building and scaling machine learning models in business applications using large amounts of data
  • Ability to communicate and explain data science concepts to diverse audiences, craft a compelling story
  • Focus on business practicality and the 80/20 rule
  • very high bar for output quality, but recognize the business benefit of "having something now" vs "perfection sometime in the future"
  • Agile development mindset, appreciating the benefit of constant iteration and improvement
Job Responsibility
Job Responsibility
  • Build and maintain the core infrastructure to allow machine learning engineers and data scientists to develop, train, evaluate, deploy, and operate Machine Learning models and pipelines
  • Use software development expertise to solve difficult problems, tackling complex infrastructure and architecture challenges
  • Design system and model architectures, conducting rigorous experimentation and model evaluations, and providing guidance to junior ML engineers
  • Lead other engineers to drive involved projects from technical design to launch
  • Collaborate with other teams and internal customers to set expectations, gather input and communicate results
What we offer
What we offer
  • Health and wellbeing resources
  • Paid volunteer days
  • Fulltime
Read More
Arrow Right

Principal Machine Learning Engineer

As a Principal Engineer on the ITSM team, you will get the opportunity to work o...
Location
Location
Australia , Sydney
Salary
Salary:
Not provided
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of total experience
  • Fluency in at least 1 scripting, OOP language
  • Solid understanding of machine learning concepts and algorithms, including supervised and unsupervised learning, deep learning, and NLP
  • Familiarity with popular ML libraries like sci-kit-learn, Keras/TensorFlow/PyTorch, numpy, pandas
  • Good Understanding of Machine Learning project lifecycle
  • Familiarity with MLOps and experience with scaling and deploying Machine Learning models
Job Responsibility
Job Responsibility
  • Work on cutting-edge AI and ML algorithms that help modernize IT Operations by reducing MTTR (mean time to resolve), and MTTI (Mean time to identify)
  • Use software development expertise to solve difficult problems, tackling complex infrastructure and architecture challenges
  • Lead engineers to drive involved projects from technical design to launch
  • Collaborate with other teams and internal customers to set expectations, gather input, and communicate results
  • Work with a distributed, world-class team shaping the future of AIOps
  • Master Generative AI
  • Become a machine learning maestro
  • Collaborate with diverse minds
  • Make a tangible impact
  • Routinely tackle complex architectural challenges
What we offer
What we offer
  • Health coverage
  • Paid volunteer days
  • Wellness resources
  • Fulltime
Read More
Arrow Right

Senior AI and Machine Learning Engineer

We are seeking Senior AI/ML & Innovation Engineer who will be leading initiative...
Location
Location
United States , Aguadilla
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or master’s degree in computer science, engineering, data science, machine learning, artificial intelligence, or closely related quantitative discipline
  • Typically, 7-10 years’ experience
  • Deep understanding of machine learning algorithms, such as linear regression, decision trees, support vector machines, random forests, deep learning models (e.g., neural networks), and reinforcement learning
  • A strong foundation in mathematics and statistics
  • Proficiency in programming languages such as Python, R, or Java
  • Strong understanding of GitHub CoPilot, Cursor, N8N, vibe coding, Windsurf, and similar technologies
  • Experience in Cloud Infrastructure (AWS, Azure, etc)
  • Knowledge of Open Source, Linux, etc
  • Understanding of Devops, SRE
  • Advanced knowledge and experience in deep learning
Job Responsibility
Job Responsibility
  • Conducts research and stays up to date with the latest advancements in AI and machine learning technologies, frameworks, and algorithms
  • Collaborates with cross-functional teams to understand business requirements and design AI and machine learning solutions
  • Develops, implements, and optimizes machine learning models and algorithms
  • Deploys machine learning models into production environments
  • Monitors the performance of deployed models
  • Organizes and leads comprehensive design review sessions
  • Works collaboratively with the engineering manager and team lead to set design and implementation standards
  • Regularly leads meetings
  • Has experience in providing technical leadership, mentorship, and guidance to junior team members
  • Develops and delivers strategic presentations and reports to senior stakeholders
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

AI and Machine Learning Engineer

AI and Machine Learning Engineer role at Hewlett Packard Enterprise, responsible...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in computer science, engineering, data science, machine learning, artificial intelligence, or closely related quantitative discipline
  • Master's degree is desirable
  • Typically 5-7 years' experience
  • Deep understanding of machine learning algorithms (linear regression, decision trees, support vector machines, random forests, deep learning models, reinforcement learning)
  • Strong foundation in mathematics and statistics (linear algebra, calculus, probability theory)
  • Proficiency in programming languages such as Python, R, or Java
  • Experience with version control systems (e.g., Git)
  • Knowledge of libraries and frameworks like TensorFlow, PyTorch, sci-kit, Keras
  • Proficiency in using agentic frameworks like langGraph
  • Knowledgeable in lineage tracking of agentic architectures
Job Responsibility
Job Responsibility
  • Conduct advanced research in AI and machine learning
  • Stay up to date with latest advancements in the field
  • Explore emerging technologies
  • Identify opportunities to apply cutting-edge techniques
  • Evaluate traditional AI/ML and Gen-AI based applications
  • Design solutions considering scalability, performance, and maintainability
  • Provide technical guidance and mentorship to junior team members
  • Work with stakeholders to understand requirements and translate into technical solutions
  • Collaborate with cross-functional teams
  • Drive continuous improvement and innovation
What we offer
What we offer
  • Health & Wellbeing benefits
  • Personal & Professional Development programs
  • Unconditional Inclusion environment
  • Comprehensive benefits suite supporting physical, financial and emotional wellbeing
  • Career development programs
  • Fulltime
Read More
Arrow Right

Senior Staff Machine Learning Engineer (AI Agent)

At Cresta, the AI Agent team is on a mission to create state-of-the-art AI Agent...
Location
Location
United States; Canada
Salary
Salary:
Not provided
cresta.com Logo
Cresta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s Degree in Computer Science, Mathematics, or a related field
  • Master’s or Ph.D. preferred, or equivalent professional experience
  • 7+ years of hands-on industry experience with AI and machine learning
  • 3+ years of experience working with LLMs in large-scale production environments
  • Expert knowledge of machine learning concepts and methods, especially those related to NLP, Generative AI, and working with LLMs
  • Proven leadership in designing and deploying AI solutions at scale
  • Extensive practical knowledge of modern machine learning frameworks and technologies (e.g., PyTorch, Tensorflow, Hugging Face, NumPy)
  • Experience with distributed systems and cloud-based AI infrastructure
  • Strong problem-solving and strategic thinking abilities
  • Proven ability to lead cross-functional teams and work collaboratively to deliver innovative AI solutions in production
Job Responsibility
Job Responsibility
  • Design, develop, and deploy Cresta’s AI Agent solutions and proprietary models
  • Focus on practical AI challenges such as improving reasoning, planning capabilities, and evaluation in real-world scenarios
  • Collaborate with cross-functional teams including front-end and back-end software engineers to integrate AI Agents into Cresta’s customer solutions
  • Lead initiatives to scale AI systems for production environments, ensuring performance and reliability across use cases
  • Contribute to solving cutting-edge problems in AI and help define the future roadmap for Cresta’s AI Agents
  • Innovate and research ways to improve security, cost-efficiency, and reliability of AI systems
What we offer
What we offer
  • Variety of medical, dental, and vision plans
  • Paid parental leave
  • Monthly Health & Wellness allowance
  • Work from home office stipend
  • Lunch reimbursement for in-office employees
  • PTO: 3 weeks in Canada
  • Base salary, equity, and a variety of benefits
  • Fulltime
Read More
Arrow Right

Machine Learning Engineering Intern

At Cresta, the Knowledge Assist (KA) team develops AI solutions for the contact ...
Location
Location
Canada , Toronto
Salary
Salary:
45.00 - 70.00 USD / Hour
cresta.com Logo
Cresta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently pursuing a Bachelor's or Master's degree in Computer Science, Artificial Intelligence, Machine Learning, or a related technical field
  • Proficiency in Python and familiarity with at least one deep learning framework (e.g., PyTorch, TensorFlow)
  • Strong understanding of machine learning fundamentals and generative modeling
  • Ability to design and analyze experiments involving large-scale datasets
  • Work authorization in the country of employment at the time of hire
Job Responsibility
Job Responsibility
  • Design, develop, and deploy Cresta’s KA solutions and proprietary models
  • Focus on practical AI challenges such as improving reasoning, and evaluation in real-world scenarios
  • Collaborate with cross-functional teams including front-end and back-end software engineers to integrate KA solutions into Cresta’s customer solutions
  • Lead initiatives to scale AI systems for production environments, ensuring performance and reliability across use cases
  • Contribute to solving cutting-edge problems in AI and help define the future roadmap for Cresta’s KA
  • Innovate and research ways to improve security, cost-efficiency, and reliability of AI systems
What we offer
What we offer
  • Lunch can be expensed (up to $25) while working in the office
  • PTO: 4 days
  • Compensation for this position includes a base salary, equity, and a variety of benefits
Read More
Arrow Right