Research Scientist / Engineer — Multimodal Agent
Posted 14 hours ago
Job Description
About Luma AI:
Luma’s mission is to build multimodal AGI. Through our research on video, 3D, and now multimodal models at Luma, we believe that AI needs to be jointly trained over all signal modalities – text, video, audio, images – analogous to the human brain.
To advance our mission, we build and operate the full stack end-to-end, spanning foundation models, inference systems, and products. This integrated approach powers technologies like Ray3, which is seeing rapidly growing adoption among Fortune 500 companies across media, entertainment, and advertising. Backed by a recent $900M Series C and our partnership with Humain to build a 2 GW compute supercluster (Project Halo), our models and the Dream Machine platform are now enabling creatives worldwide to tell some of the most impactful stories of our time.
Where You Come In:
This is a rare and foundational opportunity to define the future of multimodal AI. You will be at the forefront of building and training large-scale multimodal models, directly impacting how users interact with pixels. This role offers the chance to bridge cutting-edge research with magical, shipped products, working end-to-end on novel problems with no existing playbook.
What You'll Do:
This opportunity involves both the “science” and “engineering” parts of research, two aspects that are of equal importance.
This is a multi-stack opportunity where you will work on the intersection of modeling, data, systems, and evaluation.
- Modeling: Architect large-scale multimodal agentic models that use reasoning, planning, coding, and tool calling to achieve complex, multi-step multimodal work.
- Data: Hillclimbing existing tasks and formulating new tasks through data. Design, implement, and run robust data pipelines for constructing, enriching, and filtering massive pixel datasets.
- Systems: Train large-scale multimodal models on massive datasets and GPU clusters.
- Evaluation: Define and build novel evaluation frameworks to measure multimodal agents.
Who You Are:
- Strong foundation in machine learning, foundation models and agentic systems.
- Deep understanding of agentic systems and approaches in LLM/VLM reasoning, coding models, LLM/VLM tool calling.
- Hands-on experience with PyTorch and large-scale training (distributed, mixed precision, large datasets).
What Sets You Apart (Bonus Points):
Experience in the following around data, modeling, or evaluation:
- State-of-the-art foundation models in reasoning
- State-of-the-art foundation models in coding
- State-of-the-art foundation models in tool calling
- State-of-the-art multimodal agents
Your application are reviewed by real people.
Luma AI
3 jobs posted
About the job
Similar Jobs
28d
Research Scientist / Engineer – Foundation Model: Core Research
Luma AI
Palo Alto, Palo Alto, CanadaResearch Scientist / Engineer – Foundation Model: Core Research
Luma AI
Palo Alto, Palo Alto, Canada28d12d
Research Scientist / Research Engineer — Early Career Cohort
OpenAI
$295KSan Francisco, CAResearch Scientist / Research Engineer — Early Career Cohort
OpenAI
$295KSan Francisco, CA12d29d
Research Scientist: Multilingual, Multicultural and Multimodal LLM
DeepMind
Tokyo, JapanResearch Scientist: Multilingual, Multicultural and Multimodal LLM
DeepMind
Tokyo, Japan29d22d
Research Scientist, Multimodal Alignment, Safety, and Fairness
DeepMind
$147K - $211KKirklandMountain ViewNew York CityResearch Scientist, Multimodal Alignment, Safety, and Fairness
DeepMind
$147K - $211KKirklandMountain ViewNew York City22d19d
Research Scientist, Audio
DeepMind
$147K - $211KNew York CityResearch Scientist, Audio
DeepMind
$147K - $211KNew York City19d6d
Operations Research Scientist
DoorDash
$131K - $192KSan Francisco, CASunnyvale, CanadaOperations Research Scientist
DoorDash
$131K - $192KSan Francisco, CASunnyvale, Canada6d4d
People Research Scientist
Anthropic
$245K - $310KSan Francisco, CANew York City, NYPeople Research Scientist
Anthropic
$245K - $310KSan Francisco, CANew York City, NY4dNew
Research Scientist, AnthroKrishi
DeepMind
Bangalore, IndiaResearch Scientist, AnthroKrishi
DeepMind
Bangalore, IndiaNew28d
Research Scientist, Post-AGI Research
DeepMind
London, United KingdomResearch Scientist, Post-AGI Research
DeepMind
London, United Kingdom28d28d
Senior Research Scientist - Personalization
Yahoo
$128K - $267KUnited StatesSenior Research Scientist - Personalization
Yahoo
$128K - $267KUnited States28d
Looking for something different?
Browse all AI jobsFree AI job alerts
Get the latest AI jobs delivered to your inbox every week. Free, no spam.