Wayve
Company
Research Scientist Intern, Reinforcement Learning
Job Description
At Wayve we're committed to creating a diverse, fair and respectful culture that is inclusive of everyone based on their unique skills and perspectives, and regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, veteran status, pregnancy or related condition (including breastfeeding) or any other basis as protected by applicable law.
About us
Founded in 2017, Wayve is the leading developer of Embodied AI technology. Our advanced AI software and foundation models enable vehicles to perceive, understand, and navigate any complex environment, enhancing the usability and safety of automated driving systems.
Our vision is to create autonomy that propels the world forward.  Our intelligent, mapless, and hardware-agnostic AI products are designed for automakers, accelerating the transition from assisted to automated driving.  
In our fast-paced environment big problems ignite us—we embrace uncertainty, leaning into complex challenges to unlock groundbreaking solutions. We aim high and stay humble in our pursuit of excellence, constantly learning and evolving as we pave the way for a smarter, safer future.
At Wayve, your contributions matter. We value diversity, embrace new perspectives, and foster an inclusive work environment; we back each other to deliver impact.
Make Wayve the experience that defines your career!
The role
We’re looking for a curious and motivated Reinforcement Learning Intern to help advance the next generation of decision-making systems for autonomous driving. In this role, you’ll work embedded in a research team to develop scalable RL algorithms that enable vehicles to learn complex behaviors directly from experience — both in simulation and the real world.
The ideal candidate has experience in some combination of reinforcement learning, imitation learning, offline RL, or world modelling, and is motivated to apply cutting-edge research ideas to real-world embodied AI challenges. We’re particularly interested in temporal credit assignment and large-scale policy optimization for driving. Familiarity with representation learning and reward modelling is a plus, but not required.
If you’re curious, hands-on, and eager to explore new ideas, you’ll thrive here. We’re looking for people who enjoy experimenting, learning from failure, and turning research into impact — those who want to see their algorithms come alive in real-world autonomy challenges. If this all speaks to you, we can’t wait to meet you!
The role
What you will bring to Wayve
Essential:
- Currently pursuing a PhD or Masters in Computer Science, Robotics, Electrical Engineering, or a related field, with a focus on Machine Learning, AI, or Computer Vision.
 - Experience in research in Reinforcement Learning.
 - Interest in one or more: synthetic data, representation learning, and Offline RL.
 - Comfortable working in Python and libraries like PyTorch, NumPy, and Pandas.
 - A principled mindset: you enjoy brainstorming, making assumptions, building, testing, and iterating on ideas to see what works.
 
Desirable:
- Experience collaborating on research projects or contributing to shared codebases.
 - Publications or submissions at venues such as CVPR, ICCV, CoRL, NeurIPS, ICML, ICRA, or RSS are a nice bonus, but not required!
 
What we offer:
- The chance to be part of a truly mission driven organisation and an opportunity to shape the future of autonomous driving. Unlike our competitors, Wayve is still relatively small and nimble, giving you the chance to make a huge impact
 - Competitive compensation and benefitsA dynamic and fast-paced work environment in which you will grow every day - learning on the job, from the a diverse team of the brightest researchers and engineers in this space
 - A culture that is ego-free, respectful and welcoming!
 - Potential to publish your research work at a top flight conference
 
We understand that everyone has a unique set of skills and experiences and that not everyone will meet all of the requirements listed above. If you’re passionate about self-driving cars and think you have what it takes to make a positive impact on the world, we encourage you to apply.
For more information visit Careers at Wayve.
To learn more about what drives us, visit Values at Wayve
DISCLAIMER: We will not ask about marriage or pregnancy, care responsibilities or disabilities in any of our job adverts or interviews. However, we do look to capture information about care responsibilities, and disabilities among other diversity information as part of an optional DEI Monitoring form to help us identify areas of improvement in our hiring process and ensure that the process is inclusive and non-discriminatory.
Wayve
29 jobs posted
About the job
Similar Jobs
Discover more opportunities that match your interests
- 27 days ago
Research Scientist Intern, Reinforcement Learning (PhD)
Meta
Paris, FranceView details - 3 hours ago
Research Scientist Intern, Reinforcement Learning (PhD)
Meta
Paris, FranceView details - 21 days ago
Research Scientist Intern, Representation Learning
Meta
Paris, FranceView details - 3 hours ago
Research Scientist Intern, Representation Learning
Meta
Paris, FranceView details - 27 days ago
Research Scientist Intern, Reinforcement Learning and Large Language Models, PhD
Meta
Paris, FranceView details - 27 days ago
Research Scientist Intern, AI Core Machine Learning
Meta
Paris, FranceView details - 28 days ago
2026 Intern - Research Scientist
Adobe
LondonView details - 29 days ago
2026 Intern - Research Scientist
Adobe
ParisView details - 21 days ago
Research Scientist Intern, Representation Learning (PhD)
Meta
Paris, FranceView details - 28 days ago
2026 Intern - Research Scientist
Adobe
LondonView details 
Looking for something different?
Browse all AI jobs