CrawlJobs Logo

Reinforcement learning intern

enchanted.tools Logo

Enchanted Tools

Location Icon

Location:
France, Paris

Category Icon
Category:
Research and Development

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

As a Reinforcement Learning Intern, you will help develop and implement learning-based navigation and control algorithms for the Mirokai humanoid robot, which balances dynamically on a ball. You will work closely with the team to extend our simulation environments, train agents, and validate policies on real hardware. This internship offers deep hands-on experience in RL for real-world robotics — from simulation to deployment.

Job Responsibility:

  • Develop, debug, and test reinforcement learning algorithms for locomotion and navigation on a dynamically balancing base
  • Extend simulation environments (Isaac Sim / Isaac Lab) to support training and evaluation of RL policies
  • Integrate trained policies into the Mirokai software stack and validate them on physical robots
  • Analyze performance, stability, and sim-to-real transfer aspects
  • Stay up to date with recent research in reinforcement learning for robotics

Requirements:

  • BSc holder in Robotics, Engineering, Computer Science, or related field
  • Coursework or project experience in reinforcement learning or learning-based control
  • Strong Python skills and knowledge of a deep learning framework PyTorch, JAX, or TensorFlow
  • Familiarity with simulation environments such as Isaac Sim, Mujoco, or Gazebo
  • Solid analytical and problem-solving abilities

Additional Information:

Job Posted:
December 08, 2025

Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Reinforcement learning intern

New

PhD Autonomy Engineer Intern - Planning & Controls (Reinforcement Learning)

Skydio builds the world’s most advanced autonomous drones used across inspection...
Location
Location
Switzerland , Zurich
Salary
Salary:
50.00 EUR / Hour
skydio.com Logo
Skydio
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD student in Robotics, Machine Learning, Controls, or related field
  • Strong fundamentals in RL, control theory, and motion planning
  • comfort with safety/robustness concepts
  • Proficient in Python (PyTorch/JAX/Ray RLlib) and at least one of C++ or CUDA
  • Hands-on experience with robotics simulation (Isaac Lab/MuJoCo/PyBullet) and sim2real techniques
  • Experience training/deploying policies for navigation, manipulation, or locomotion on real robots or autonomous vehicles
Job Responsibility
Job Responsibility
  • Develop and deploy reinforcement learning (and adjacent policy-learning methods) that make Skydio aircraft plan, navigate, and control themselves more intelligently—safely, reliably, and efficiently—across our ecosystem: handheld apps, ground control, cloud autonomy services, and fleet workflows
  • Navigation & avoidance in the wild: Train policies that adapt online to cluttered 3D scenes (forests, bridges, urban canyons), complementing our geometric stack for robust obstacle avoidance and dynamic goal-seeking
  • RL-augmented planning: Fuse learned cost shaping / value functions with trajectory optimization for smooth, agile flight with tight safety envelopes and mission constraints
  • Sim → Real at scale: Build scalable datasets and training loops with Isaac Lab, domain randomization, residual learning, and safety filters
  • validate on real drones weekly
  • Human-in-the-loop shared control: Learn assistive policies that blend pilot intent, autonomy priors, and uncertainty-aware behaviors for intuitive control handoffs
  • Fleet & multi-agent: Explore decentralized coordination for coverage, pursuit, and collaborative mapping with minimal comms
Read More
Arrow Right
New

AI Research Engineer - Reinforcement Learning

At Helsing we deliver AI-based capabilities and the enabling infrastructure that...
Location
Location
Germany , Munich
Salary
Salary:
Not provided
helsing.ai Logo
Helsing
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Hold MSc in machine learning with a speciality in either reinforcement learning, multi-agent systems, automation and control, or robotics
  • Have excellent communication skills and the ability to report and present research findings clearly and efficiently both internally and externally
  • Are passionate about keeping up-to-date with current research and enjoy reimplementing / extending papers on state-of-the-art Deep Learning-based approaches
  • Possess solid software engineering skills, writing clean and well-structured code in Python and/or languages like Rust, Java, or modern C++, and experience deploying AI software to production including testing, QA, and monitoring
Job Responsibility
Job Responsibility
  • Design, train and deploy agents in complex multi-agent environments
  • Contribute to our reinforcement learning stack by implementing, improving and extending the current state of the art in multi-agent reinforcement learning
  • Be a part of impactful projects and will collaborate with people across several teams and backgrounds to integrate cutting edge ML/AI in our production systems
What we offer
What we offer
  • Competitive compensation and stock options
  • Relocation support
  • Social and education allowances
  • Regular company events and all-hands to bring together employees as one team across Europe
  • A hands-on onboarding program (affectionately labelled “AI-duction”), in which you will be familiarising yourself with our tools and ML pipelines used across the company
  • Fulltime
Read More
Arrow Right

Machine Learning Research Associate

The Machine Learning research team at Hewlett Packard Labs seeks highly motivate...
Location
Location
United States , Milpitas
Salary
Salary:
43.27 - 93.15 USD / Hour
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Pursuing a Ph.D. degree (with significant research and innovation experience) in a relevant discipline (e.g. machine learning, computer science, electrical engineering, statistics, etc.)
  • Track record of world-class innovative contributions and ideas in machine learning
  • Experience in deep learning, LLM, Agentic AI, and reinforcement learning research
  • Experience in developing deep learning software with high proficiency in data structures and algorithms
  • Experience in Machine Learning frameworks like PyTorch - required
  • Strong programming skills and experience with Python
  • Software development experience in Deep Learning, GPU acceleration, and Model Optimization
  • Demonstrated effective communication and collaboration skills
  • Demonstrated ability for original research papers published in top-tier conferences or journals.
Job Responsibility
Job Responsibility
  • Provide thought leadership and technical influence both internally and externally to HPE
  • Work on cutting-edge machine learning research focusing on Large Language Models, Agentic AI, and Reinforcement Learning
  • Contribute along the full range from initial novel ideas to design, development, implementation, evaluation, and technology transfer
  • Publish in top AI conferences and workshops, including NeurIPS, AAAI, and ICML.
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

Machine Learning Research Scientist

This role focuses on cutting-edge research and development in Artificial Intelli...
Location
Location
United States , Milpitas
Salary
Salary:
117500.00 - 270000.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD in Computer Science, Electrical Engineering, or related fields focusing on Machine Learning for the dissertation
  • extensive experience in deep learning research, preferably in Large Language Models or Reinforcement Learning
  • experience developing applications with deep learning frameworks like PyTorch with a high software proficiency
  • strong programming skills in Python, data structures, and algorithms are required
  • experience with ML model optimization, GPU acceleration, heterogeneous computation, system software, and performance optimization desired
  • experience in Python Web Frameworks – Django, Flask - a plus but not required.
Job Responsibility
Job Responsibility
  • conducting research, developing solutions, and creating intellectual property in emerging fields like reinforcement learning, LLMs, digital twins, clean energy, data center optimization, and sustainability
  • developing advanced technologies for analysis, optimization, time series forecasting, uncertainty quantification, and control
  • providing thought leadership, collaborating internally and externally, and contributing to HPE’s strategy by identifying emerging technologies
  • publishing in top conferences like NeurIPS, AAAI, and ACL
  • developing patent applications
  • software development, GPU acceleration, model optimization, and real-time data streaming to create robust AI solutions for real-world use cases.
What we offer
What we offer
  • a competitive salary and extensive social benefits
  • diverse and dynamic work environment
  • work-life balance and support for career development
  • health and wellbeing programs
  • personal and professional development programs
  • diversity, inclusion, and belonging initiatives.
  • Fulltime
Read More
Arrow Right

Engineering Director

We are seeking a seasoned Engineering Director who thrives in challenging and fa...
Location
Location
Puerto Rico , Aguadilla
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Significant work experience as a director or similar position working across multiple stakeholder organizations, with at least 10+ years of people leadership experience specific to SW and Cloud engineering
  • Solid experience leading SW development across storage, networking, on-prem, and SaaS is a must
  • Experience in setting up geographically distributed sites
  • Must have a strong background in software development lifecycle including cloud infrastructure
  • Familiarity with agile methodologies and tools like JIRA
  • Prior experience in cloud product development and deployments
  • end to end ownership and accountability
  • Solid understanding of fundamental AI and machine learning concepts, including supervised and unsupervised learning, deep learning, reinforcement learning, natural language processing, computer vision, and statistical modeling
  • Extensive business acumen, technical knowledge, and industry experience encompassing one or more engineering, technology, and product domains
  • Demonstrated abilities to drive transformation across a business with exceptional skills in the management of change
Job Responsibility
Job Responsibility
  • Oversee the Puerto Rico Site daily operations, strategic planning and cross-functional team leadership for Hybrid Cloud
  • Recruit, mentor, and manage teams of AI/ML engineers, QA Engineers, Design Engineers and innovation specialists to deliver cutting-edge solutions
  • Continuously evaluate new tools, platforms, and frameworks in AI/ML to drive competitive advantage and operational efficiency
  • Ensure alignment with corporate goals while fostering a high-performance culture, operational efficiency, and employee engagement
  • Lead the development and execution of AI/ML strategies that align with business goals and drive innovation across products, services, or operations
  • Create strategic and tactical operations and resource plans, goals, and priorities for assigned organization based on business and technology roadmap and functional objectives
  • Engage with various senior leaders across the organization, program managers, R&D, support, Quality, product managers, technical leaders and executives to communicate program status, escalate issues, and guide and influence strategic decision-making
  • Manage senior relationships and escalated issues with outsourced partners and suppliers, including setting expectations regarding deliverables, product quality, schedules, and costs
  • ensures that organization is effectively leveraging outsourced resources
  • Identify opportunities for and drive organizational initiatives and programs to support business process improvements and cost reductions
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

Distinguished Technologist, Deep Learning

Joining our HPE Hybrid Cloud team and working as part of our OpsRamp team is a c...
Location
Location
United States , San Jose
Salary
Salary:
164500.00 - 398500.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 15+ years of relevant experience in the industry delivering technical and business strategy at an advanced/strategist level
  • Master's, or PhD degree in Computer Science, Information Systems, Engineering, or equivalent
  • At least 4 years of hands-on expertise in defining, building, training and / or optimizing foundational deep learning models at scale in PyTorch, HF and other ML frameworks and libraries
  • Experience and/or deep understanding in various deep learning architectures like CNNs, GNNs, Transformers, Reinforcement Learning etc. is a strong advantage
  • Strong hands-on experience/understanding in pre-training, fine-tuning, distilling, aligning open-source large language models and have them complement the in-house foundational models
  • Hands-on experience developing multi-agent applications around a mixture of in-house and open-source models while leveraging latest in RAG and Prompt Engineering tooling techniques
  • Strong customer focus and obsession with improving service availability/performance and user experience/consumption using measurable SRE metrics
  • Must have a track record of working alongside other engineering teams architecting, building, and deploying mission-critical, highly distributed, large-scale SaaS applications
  • Must have strong knowledge of application failure modes, resiliency patterns, and techniques to enable robust, self-healing architecture
  • Effective technical leadership skills to influence diverse groups to move toward common goals/strategies
Job Responsibility
Job Responsibility
  • Oversee build of OpsRamp’s CoPilot for Autonomous Operations for the Hybrid Cloud
  • Understand latest in GenAI/ML for ITOM
  • Understand cloud-native architecture concepts and have knowledge of best practices for high availability, scalability, resilience, performance, and security requirements in the cloud
  • Act as a cross-functional product and technical expert for GenAI within engineering with close working relationships with customers, product management, support, and marketing supporting edge-to-cloud services offering
  • Provides consultation, design input, and feedback for product development and design reviews across multiple organizations and architectures
  • Help transition proof-of-concept implementations into R&D teams to accelerate new product delivery
  • Creates technical content such as designs, specifications, and initial software implementations
  • Guides and mentors less-experienced staff members to set an example of software systems design and development innovation and excellence, helping to grow engineers into more senior technical roles
  • Collect product feedback from field interactions to provide input into Engineering and Product Management to influence product roadmap direction
  • Maintain a high level of knowledge of OpsRamp SaaS product and product road maps, as well as that of the competition and prospective strategic partners
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

Distinguished Technologist, Cloud Development (AI/ML)

Joining our HPE Hybrid Cloud team and working as part of our OpsRamp team is a c...
Location
Location
United States , San Jose
Salary
Salary:
164500.00 - 398500.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 15+ years of relevant experience in the industry delivering technical and business strategy at an advanced/strategist level
  • Master's, or PhD degree in Computer Science, Information Systems, Engineering, or equivalent
  • At least 4 years of hands-on expertise in defining, building, training and / or optimizing foundational deep learning models at scale in PyTorch, HF and other ML frameworks and libraries
  • Experience and/or deep understanding in various deep learning architectures like CNNs, GNNs, Transformers, Reinforcement Learning etc. is a strong advantage
  • Strong hands-on experience/understanding in pre-training, fine-tuning, distilling, aligning open-source large language models and have them complement the in-house foundational models
  • Hands-on experience developing multi-agent applications around a mixture of in-house and open-source models while leveraging latest in RAG and Prompt Engineering tooling techniques
  • Strong customer focus and obsession with improving service availability/performance and user experience/consumption using measurable SRE metrics
  • Must have a track record of working alongside other engineering teams architecting, building, and deploying mission-critical, highly distributed, large-scale SaaS applications
  • Must have strong knowledge of application failure modes, resiliency patterns, and techniques to enable robust, self-healing architecture
  • Effective technical leadership skills to influence diverse groups to move toward common goals/strategies
Job Responsibility
Job Responsibility
  • Lead strategy and innovation across OpsRamp’s Intelligent Observability portfolio
  • Champion HPE OpsRamp’s position with HPE customers and GTM partners externally and HPE internal cross-functional stakeholders
  • Drive technical strategy for emerging GenAI trends across Hybrid Observability and AIOps for cloud-scale modern applications
  • Design and introduce new products to the market
  • Provide consultation, design input, and feedback for product development and design reviews
  • Transition proof-of-concept implementations into R&D teams to accelerate new product delivery
  • Guide and mentor less-experienced staff members.
What we offer
What we offer
  • Health and wellbeing benefits
  • Career development programs
  • Diversity, inclusion, and belonging initiatives.
  • Fulltime
Read More
Arrow Right
New

Senior ML Engineer

As a Senior ML Engineer at Provectus, you'll be responsible for designing, devel...
Location
Location
Colombia , Medellín; Bogotá; Cali; Barranquilla; Bucaramanga
Salary
Salary:
Not provided
provectus.com Logo
Provectus
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • ML Fundamentals: supervised, unsupervised, and reinforcement learning
  • Model Development: feature engineering, model training, evaluation, hyperparameter tuning, and validation
  • ML Frameworks: classical ML libraries, TensorFlow, PyTorch, or similar frameworks
  • Deep Learning: CNNs, RNNs, Transformers
  • LLM Applications: Experience building production LLM-based applications
  • Prompt Engineering: Ability to design effective prompts and chain-of-thought strategies
  • RAG Systems: Experience building retrieval-augmented generation architectures
  • Vector Databases: Familiarity with embedding models and vector search
  • LLM Evaluation: Experience with evaluation metrics and techniques for LLM outputs
  • Python: Advanced proficiency in Python for ML applications
Job Responsibility
Job Responsibility
  • Design and implement end-to-end ML solutions from experimentation to production
  • Build scalable ML pipelines and infrastructure
  • Optimize model performance, efficiency, and reliability
  • Write clean, maintainable, production-quality code
  • Conduct rigorous experimentation and model evaluation
  • Troubleshoot and resolve complex technical challenges
  • Mentor junior and mid-level ML engineers
  • Conduct code reviews and provide constructive feedback
  • Share knowledge through documentation, presentations, and workshops
  • Collaborate with cross-functional teams (DevOps, Data Engineering, SAs)
Read More
Arrow Right
Welcome to CrawlJobs.com
Your Global Job Discovery Platform
At CrawlJobs.com, we simplify finding your next career opportunity by bringing job listings directly to you from all corners of the web. Using cutting-edge AI and web-crawling technologies, we gather and curate job offers from various sources across the globe, ensuring you have access to the most up-to-date job listings in one place.