Job Description
In this role, you will:
Build, pre-train, and evaluate large-scale multi-modality foundation models from the ground up, successfully aligning diverse data streams (e.g., Vision, LiDAR, Radar, Language, Audio).
Define and execute the ML roadmap for deploying these multi-modality representations to the vehicle.
Architect and implement Knowledge Distillation pipelines to compress large-capacity multi-modal teacher models into highly efficient, production-ready student models.
Build high-quality training and evaluation datasets, applying advanced data-centric techniques to maximize cross-modal representation learning and student model convergence.
Collaborate with downstream perception teams to integrate and validate the performance, robustness, and latency of your models in on-board production systems.
Qualifications:
MS or PhD in Computer Science, Machine Learning, or a related technical field with demonstrated professional experience.
Deep, proven expertise in building and training large-scale multi-modality foundation models (e.g., Vision-Language Models (VLMs), Vision-Audio-Text, or Vision-LiDAR-Radar architectures).
Strong understanding of cross-modal alignment, multi-modal attention mechanisms, and large-scale pre-training techniques.
Proven experience in Knowledge Distillation (KD), model compression, and training highly efficient student models for production environments.
Proficiency in ML frameworks (e.g., PyTorch) and experience building large-scale ML training and evaluation pipelines.
Bonus Qualifications:
Experience in the Autonomous Driving or robotics industry.
Experience with model deployment, optimization, and hardware constraints (e.g., C++ for inference, TensorRT, quantization, pruning).
Publications in top-tier conferences (CVPR, ICCV, NeurIPS, ICLR, ACL) related to multi-modality foundation models, cross-modal learning, or model compression.
Zoox
7 jobs posted
About the job
Mar 12, 2026
Apr 11, 2026
Similar Jobs
7d
Machine Learning Engineer, Foundation Models
Grab
HCMC, VietnamMachine Learning Engineer, Foundation Models
Grab
HCMC, Vietnam7d
9dSenior Machine Learning Engineer - AI Foundation
XPENG
$175K - $296KSanta Clara, CA
Senior Machine Learning Engineer - AI Foundation
XPENG
$175K - $296KSanta Clara, CA9d
9dStaff Machine Learning Engineer - AI Foundation
XPENG
$215K - $364KSanta Clara, CA
Staff Machine Learning Engineer - AI Foundation
XPENG
$215K - $364KSanta Clara, CA9d23d
Machine Learning Engineer
Otter
$155K - $207KMountain View, CAMachine Learning Engineer
Otter
$155K - $207KMountain View, CA23d23d
Machine Learning Engineer
Faculty
LondonMachine Learning Engineer
Faculty
London23d24d
Machine Learning Engineer
Paypal
$118K - $174KChicago, IllinoisMachine Learning Engineer
Paypal
$118K - $174KChicago, Illinois24d21d
Machine Learning Engineer
Grab
Petaling Jaya, Selangor, MalaysiaMachine Learning Engineer
Grab
Petaling Jaya, Selangor, Malaysia21d15d
Machine Learning Engineer
Grab
Petaling Jaya, Selangor, MalaysiaMachine Learning Engineer
Grab
Petaling Jaya, Selangor, Malaysia15d14d
Machine Learning Engineer
Visa
Bengaluru, IndiaMachine Learning Engineer
Visa
Bengaluru, India14d14d
Machine Learning Engineer
Yahoo
$111K - $231KUnited StatesMachine Learning Engineer
Yahoo
$111K - $231KUnited States14d
Looking for something different?
Browse all AI jobsFree AI job alerts
Get the latest AI jobs delivered to your inbox every week. Free, no spam.