Job Description
Snapshot
Artificial Intelligence could be one of humanity’s most useful inventions. At Google DeepMind, we’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.
The Gemini Safety team is accountable for the safety and fairness behavior of GDM’s latest Gemini models. The role of the Research Scientist / Research Engineer will be to apply and develop data and algorithmic cutting edge solutions to advance GDM’s latest user-facing models. The workstyle is fast paced, and highly collaborative. The team has a strong culture of support, dedication and collaboration.
About Us
Artificial Intelligence could be one of humanity’s most useful inventions. At Google DeepMind, we’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.
The Role
We’re looking for a versatile Research Scientist at ease both with figuring out how to approach new research questions, and the technical implementation of research ideas. Our team focuses on advancing the safety and fairness behavior of state of the art AI models. We drive the development of the foundational technology adopted by numerous product areas including Gemini App, Cloud API, and Search.
Key responsibilities:
- Post-training / instruction tuning state of the art LLMs, focusing on text-to-text, image/video/audio-to-text modalities and agentic capabilities
- Exploring data, reasoning and algorithmic solutions to make sure Gemini Models are safe, maximally helpful, and work for everyone.
- Improve Gemini’s adversarial robustness, with a focus on high-stakes abuse risks.
- Design and maintain high quality evaluation protocols to assess model behavior gaps and headroom related to safety and fairness.
- Develop and execute experimental plans to address known gaps, or construct entirely new capabilities
- Drive innovation and enhance understanding of Supervised Fine Tuning and Reinforcement Learning fine-tuning at scale
About You
In order to set you up for success as a Research Scientist on the Gemini Safety team we look for the following skills and experience:
- PhD in Computer Science, a related field, or equivalent practical experience.
- Significant LLM post-training experience
In addition, the following would be an advantage:
- Experience in Reward modeling and Reinforcement Learning for LLMs Instruction tuning
- Experience with Long-range Reinforcement learning
- Experience in areas such as Safety, Fairness and Alignment
- Track record of publications at NeurIPS, ICLR, ICML, RL/DL, EMNLP, AAAI, UAI
- Experience taking research from concept to product
- Experience with collaborating or leading an applied research project
- Experience with JAX
At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunity regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy, or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation, please do not hesitate to let us know.
DeepMind
45 jobs posted
About the job
Similar Jobs
16h
Research Scientist, Gemini Diffusion
NewDeepMind
London, United KingdomResearch Scientist, Gemini Diffusion
NewDeepMind
London, United Kingdom16h19d
Research Scientist, AQUA
DeepMind
Bangalore, IndiaResearch Scientist, AQUA
DeepMind
Bangalore, India19d17d
Senior Research Scientist
MongoDB
Sydney, NSW, AustraliaSenior Research Scientist
MongoDB
Sydney, NSW, Australia17d11h
Research Scientist, Audio
NewDeepMind
$147K - $211KNew York CityResearch Scientist, Audio
NewDeepMind
$147K - $211KNew York City11h3d
Research Scientist, Multimodal Alignment, Safety, and Fairness
DeepMind
$147K - $211KKirklandMountain ViewNew York CityResearch Scientist, Multimodal Alignment, Safety, and Fairness
DeepMind
$147K - $211KKirklandMountain ViewNew York City3d21d
Research Scientist, Autonomous Agents
DeepMind
London, United KingdomResearch Scientist, Autonomous Agents
DeepMind
London, United Kingdom21d15d
Research Engineer / Research Scientist, Pretraining
Anthropic
Zürich, SwitzerlandResearch Engineer / Research Scientist, Pretraining
Anthropic
Zürich, Switzerland15d14d
Research Scientist, PhD
OpenAI
San Francisco, CAResearch Scientist, PhD
OpenAI
San Francisco, CA14d9d
Research Scientist, Post-AGI Research
DeepMind
London, United KingdomResearch Scientist, Post-AGI Research
DeepMind
London, United Kingdom9d9d
Senior Research Scientist - Personalization
Yahoo
$128K - $267KUnited StatesSenior Research Scientist - Personalization
Yahoo
$128K - $267KUnited States9d
Looking for something different?
Browse all AI jobsFree AI job alerts
Get the latest AI jobs delivered to your inbox every week. Free, no spam.