DeepMind
Company
Research Engineer (or Scientist), Speech and Language
Job Description
Snapshot
Artificial Intelligence could be one of humanity’s most useful inventions. At Google DeepMind, we’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.
About Us
We're looking for scientists and engineers to push forward fundamental research and technology in Artificial Intelligence, specifically at the intersection of Speech, Language, and deep learning.
The Role
We are looking for a top-notch Research Scientist or Research Engineer to join a fast-paced audio and language team that is fundamental to Gemini audio, music / speech generation, representation learning, and diffusion theory projects. We're looking for someone hungry to dive into both theory and code: someone who will drive independent research initiatives, work with teams on large scale AI, and develop solutions to fundamental questions in machine learning and AI.
Key responsibilities:
- Design, rapidly implement in code, and rigorously evaluate cutting-edge deep learning algorithms and data curation for multimodal generative AI, with a particular emphasis on audio and video synthesis.
- Report and present research findings and developments clearly and efficiently both internally and externally, verbally and in writing.
- Thriving under uncertainty, driving both team collaborations to meet ambitious research goals, as well as significant individual contributions.
About You
In order to set you up for success as a Research Engineer at Google DeepMind, we look for the following skills and experience:
- MS or PhD in Computer Science, Artificial Intelligence, Machine Learning, Computer Vision, Speech Processing, or equivalent practical experience.
- Proven experience in deep learning research and development, particularly in generative AI and related to video and audio synthesis. This includes diffusion models and autoregressive generative models.
- Exceptional engineering skills in Python and deep learning frameworks (e.g., JAX, TensorFlow, PyTorch), with a track record of building high-quality research prototypes and systems. Self-motivated to pick up technologies to adapt and move quickly.
- Strong publication record at top-tier machine learning, computer vision, and graphics conferences (e.g., NeurIPS, ICLR, ICML, SIGGRAPH, CVPR, ICCV).
In addition, the following would be an advantage:
- Knowledge of probabilistic machine learning and generative modeling (e.g. Diffusion, autoregressive models, GANs, flows, hierarchical VAEs, DDPMs).
- Demonstrated experience in large-scale training of multimodal generative models.
- Sequence processing experience with TensorFlow, PyTorch, or JAX.
- Bonus: knowledge of speech processing and language understanding, in particular text-to-speech synthesis and prosody modeling.
At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunity regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy, or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation, please do not hesitate to let us know.
DeepMind
76 jobs posted
About the job
Similar Jobs
Discover more opportunities that match your interests
- 15 days ago
Research Engineer, Language
DeepMind
London, UKView details - 24 days ago
Research Scientist Intern, Language and Agents (PhD)
Meta
Bellevue, WA, Menlo Park, CA, Seattle, WA, New York, NYView details - 17 days ago
Research Scientist Intern, Language and Agents (PhD)
Meta
Bellevue, WA, Menlo Park, CA, Seattle, WA, New York, NYView details - 17 days ago
Research Scientist Intern, Vision-Language and Embodied AI (PhD)
Meta
Redmond, WAView details - 24 days ago
Research Scientist
DeepMind
Mountain View, California, USView details - 24 days ago
Research Scientist Intern, Rendering and Reconstruction (PhD)
Meta
Zurich, SwitzerlandView details - 24 days ago
Research Engineer, Language - Content and User Understanding Team
Meta
Menlo Park, CA, Seattle, WA, New York, NYView details - 22 days ago
Research Engineer (FICCO)
DRW
ChicagoView details - 22 days ago
Research Engineer (FICCO)
DRW
LondonView details - 17 days ago
Research Scientist Intern, Rendering and Reconstruction (PhD)
Meta
Zurich, SwitzerlandView details
Looking for something different?
Browse all AI jobs