DeepMind
Company
Research Engineer, Gemini Post Training, Model Behavior
Job Description
Snapshot
Artificial Intelligence could be one of humanity’s most useful inventions. At DeepMind, we’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.
About Us
The Model Behavior team focuses on aligning Gemini models with human needs. We ensure that Gemini is safe, robust, and reliable, while optimizing its helpfulness to be context-aware and deeply aligned with user intent.
The Role
We are looking for a Research Engineer with proven leadership experience building large-scale machine learning systems, with expertise in modeling, data engineering and evaluations, to help us develop the best models as part of the Gemini Post-Training team, with a focus on model behavior. In this role, you will help execute the team’s strategic roadmap, identify critical modeling needs and collaborate cross-functionally and across the stack to land improvements that truly make a difference to the user.
Key responsibilities:
- Develop techniques and data pipelines to optimize frontier Gemini models on key capabilities important for model behavior
- Build robust and real-world motivated evaluations
- Partner with feature teams to prioritize modeling needs and ship gemini powered features.
- Establish best practices for code health and documentation on the team, to facilitate collaboration and reliable development.
- Mentor and guide junior engineers, fostering their professional growth and development.
About You
In order to set you up for success as a Software Engineer at DeepMind, we look for the following skills and experience:
- BS, MS or Ph.D. in Computer Science, Artificial Intelligence, or a related field, or equivalent practical experience
- Fluency in Python or C++ and JAX or similar ML framework
- Software engineering, ML and data engineering experience (e.g. model training / deployment, efficiency optimization, data pipeline design, ML infra)
- Experience building robust automatic or human evaluations for ML models
- Proven ability to lead engineering teams in fast-moving environments, under ambiguity
- Excellent communication and cross-functional collaboration skills, willingness to unblock others and pick up slack
Additionally, this role may be a good fit, if you
- Have hands-on experience with training and debugging LLMs. Experience with RL fine tuning methods are a plus.
- Enjoy the end-to-end process of productionizing cutting-edge research and taking projects from proof-of-concept to launch are passionate about applying AI for learning use cases obsess over the user and are excited about incorporating user feedback into the model development process
- Are well versed in Google infrastructure, including for fine-tuning and serving Gemini models
The US base salary range for this full-time position is between $166,000 - $244,000 + bonus + equity + benefits. Your recruiter can share more about the specific salary range for your targeted location during the hiring process.
At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunity regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy, or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation, please do not hesitate to let us know.
DeepMind
84 jobs posted
About the job
Similar Jobs
Discover more opportunities that match your interests
Anthropic
14 days agoResearch Engineer, Production Model Post-Training - London
London, UK£270K - £340K/yrView detailsAnthropic
14 days agoResearch Engineer, Model Evaluations
San Francisco, CA | New York City, NY$300K - $405K/yrView detailsAnthropic
14 days agoResearch Engineer, Pre-training
Remote$340K - $425K/yrView detailsAnthropic
14 days agoResearch Engineer, Pre-training
London, UK£250K - £270K/yrView detailsAMD
21 days agoAI Model Training Development Engineer
Beijing, ChinaView detailsAnthropic
14 days agoResearch Engineer / Research Scientist, Pre-training
Zürich, CHView detailsReddit
2 days agoStaff Research Engineer, Pre-training Science
Remote$230K - $322K/yrView detailsJasper
14 days agoResearch Scientist Intern - Post-Training (Distillation)
FranceView detailsJasper
14 days agoResearch Scientist Intern - Post-Training (RLHF)
FranceView detailsAnthropic
19 days agoResearch Engineer, Universes
Remote$500K - $850K/yrView details
Looking for something different?
Browse all AI jobs