DeepMind
Company
Research Scientist/Engineer, Model Threat Defense
Job Description
Snapshot
Artificial Intelligence could be one of humanity’s most useful inventions. At Google DeepMind, we’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.
About Us
Model distillation is a key innovation enabling the acceleration of AI, turning large general models into small and specialized models used across the industry. However, distillation techniques can also be used to steal critical model capabilities, representing a significant threat to the intellectual property and integrity of our foundational models.
The Role
As part of the Security & Privacy Research Team at Google DeepMind, you will take on a holistic role in securing our AI assets. You will both identify unauthorized distillation attempts and actively harden our models against distillation. This is a unique opportunity to contribute to the full lifecycle of defense for the Gemini family of models. You will be at the forefront detecting threats in the wild and building resilience into our models.
Key Responsibilities
- Research Defense Strategies: Research techniques to detect distillation and techniques to actively defend against distillation.
- Deploy Detection & Mitigation Systems: Design and build systems that detect abd mitigate unauthorized capability extraction.
- Evaluate Impact: Rigorously measure the effectiveness of defense mechanisms, balancing the trade-offs between model robustness, defensive utility, and core model performance.
- Collaborate and Publish: Work closely with world-class researchers across GDM, Google, and the industry to publish groundbreaking work, establish new benchmarks, and set the standard for responsible AI defense.
About You
We are looking for a creative and rigorous research scientist, research engineer, or software engineer who is passionate about trailblazing the critical field of model defense. You thrive on ambiguity and are comfortable working across the spectrum of security—from thinking like an adversary to building proactive protections. You are driven to build robust systems that protect the future of AI development.
Minimum qualifications:
- Ph.D. in Computer Science or a related quantitative field, or a B.S./M.S. in a similar field with 2+ years of relevant industry experience.
- Demonstrated research or product expertise in a field related to model security, adversarial ML, post-training, or model evaluation.
- Experience designing and implementing large-scale ML systems or counter-abuse infrastructure.
Preferred qualifications:
- Deep expertise in one or more of the following areas: model distillation, model stealing, security, memorization, Reinforcement Learning, Supervised Fine-Tuning, or Embeddings.
- Proven experience in Adversarial Machine Learning, with a focus on designing and implementing model defenses.
- Strong software engineering skills and experience with ML frameworks like JAX, PyTorch, or TensorFlow.
- A track record of landing research impact or shipping production systems in a multi-team environment.
- Current or prior US security clearance.
The US base salary range for this full-time position is between $166,000 - $244,000 + bonus + equity + benefits. Your recruiter can share more about the specific salary range for your targeted location during the hiring process.
At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunity regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy, or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation, please do not hesitate to let us know.
DeepMind
80 jobs posted
About the job
Similar Jobs
Discover more opportunities that match your interests
- 30 days ago
Research Scientist, Model Optimization
DeepMind
Mountain View, California, USView details - 6 hours ago
Research Engineer, Model Evaluations
Anthropic
San Francisco, CA | New York City, NY | Seattle, WAView details - 4 hours ago
Research Scientist, Tech Lead Manager, Model Threat Mitigation
DeepMind
Mountain View, California, US; New York City, New York, US; San Francisco, California, USView details - 21 days ago
Senior Research Scientist, Model Evaluation
Cohere
TorontoView details - 14 days ago
Research Scientist / Engineer – Realtime Interactive
Luma AI
Palo Alto, Palo Alto, CAView details - 13 days ago
Research Engineer / Scientist, Pretraining Safety
OpenAI
San FranciscoView details - 9 days ago
Research Engineer/Scientist, AI Control
DeepMind
Mountain View, California, USView details - 5 days ago
Staff Research Engineer, Model Efficiency
Cohere
New YorkView details - 4 hours ago
Research Scientist/Engineer, Security Classifier
DeepMind
Mountain View, California, USView details - 29 days ago
Research Engineer
Groq
United KingdomView details
Looking for something different?
Browse all AI jobs