Research Scientist, RL for Autonomous Planning & World Modeling
Posted 47 days ago
Job Description
This job posting has expired and no longer accepting applications.
Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver—The World's Most Experienced Driver™—to improve access to mobility while saving thousands of lives now lost to traffic crashes. The Waymo Driver powers Waymo’s fully autonomous ride-hail service and can also be applied to a range of vehicle platforms and product use cases. The Waymo Driver has provided over ten million rider-only trips, enabled by its experience autonomously driving over 100 million miles on public roads and tens of billions in simulation across 15+ U.S. states.
The mission of the Waymo AI Foundations team is to develop machine learning solutions addressing open problems in autonomous driving, towards the goal of safely operating Waymo vehicles in dozens of cities and under all driving conditions. As part of our work, we also initiate and foster collaborations with other research teams in Alphabet. AI Foundations areas that we are currently focusing on include reinforcement learning, learning from demonstration, generative modeling, Bayesian inference, hierarchical learning, and robust evaluation.
In this hybrid role, you will report to a Principal Scientist.
You will:
- Participate in Waymo’s Foundation World Model post-training and evaluation
- Research and develop cutting edge RL and Distillation techniques for Autonomous Vehicle Trajectory Planning
- Integrate emerging research from the broader AI community into Waymo’s internal RL infrastructure, conducting rigorous ablations to identify and scale the most promising methods
- Partner with engineering and research teams across Waymo to share recipes, techniques, and post-training best practices to accelerate our collective know-how
You have:
- PhD or Masters in Computer Science, Machine Learning, Robotics, or a similar technical field; with 3+ years of industry or post-doc research experience in Reinforcement Learning or Foundation Models
- Demonstration of original contributions to the field through high-impact publications (ArXiv, peer-reviewed conferences like NeurIPS/ICLR/CVPR), technical blog posts, or significant open-source contributions
- Proficiency in implementing model training flows in a scalable, distributed and performant manner such as Data parallel, FSDP and other sharding approaches
- A willingness to work with complexity of globally distributed inference infrastructure
We prefer:
- PhD in Computer Science, Machine Learning, or Robotics, with a research focus on Reinforcement Learning, Foundation Models, or Multi-Modal learning
- Extensive experience designing and deploying Reinforcement Learning infrastructure, specifically for on-policy learning or alignment with human preferences
- A consistent history of original contributions to the AI community, evidenced by first-author publications at top-tier venues (e.g., NeurIPS, ICLR, ICRA) or maintaining significant open-source ML projects
- Experience with large scale (many-machine) training infrastructure and techniques for inference with large models such as model sharding/tensor-parallel
((Remote jobs only - Please note that Waymo may not be able to employ remotely in all locations. Please speak with your recruiter about your preferred location for remote work when you begin the interview process.))
The expected base salary range for this full-time position across US locations is listed below. Actual starting pay will be based on job-related factors, including exact work location, experience, relevant training and education, and skill level. Your recruiter can share more about the specific salary range for the role location or, if the role can be performed remote, the specific salary range for your preferred location, during the hiring process.
Waymo employees are also eligible to participate in Waymo’s discretionary annual bonus program, equity incentive plan, and generous Company benefits program, subject to eligibility requirements.
This job posting has expired and no longer accepting applications. Please check out our latest AI jobs.
Waymo
12 jobs posted
About the job
Jan 16, 2026
Feb 15, 2026
Similar Jobs
DeepMind
7 days agoResearch Scientist, Autonomous Agents — RL
London, United KingdomView detailsAdobe
24 days agoResearch Scientist - (applied research)
NoidaView detailsDeepMind
13 days agoResearch Scientist, Autonomous Agents
London, United KingdomView detailsOpenAI
6 days agoResearch Scientist, PhD
San Francisco, CAView detailsAnthropic
4 days agoResearch Engineer / Research Scientist, Tokens
New York City, NYSeattle, WASan Francisco, CA$350K - $500K/yrView detailsTikTok
22 days agoResearch Scientist, Intelligent Editing (Multimodality)
San Jose, CAView detailsWaymo
7 days agoSenior Applied Research Scientist, Perception
Mountain View, CASan Francisco, CA$204K - $259K/yrView detailsTikTok
2 days agoResearch Scientist, Intelligent Editing (Multimodality)
San Jose, CAView detailsAbridge
1 day agoResearch Scientist (Measurement and Evaluation)
NYC OfficeView detailsDeepMind
23 days agoResearch Scientist, AI for Extreme Weather Forecasting
London, United KingdomView details
Looking for something different?
Browse all AI jobsNever miss a new AI job
Get the latest AI jobs delivered to your inbox every week. Free, no spam.
