AI Inference Engineer - Model Optimization & Deployment
Posted 2 days ago
Job Description
The Perception team is pioneering the development of a multi-modality foundation model to drive the next generation of autonomous system intelligence.
As a Model Optimization & Deployment Engineer, you will focus on bringing highly efficient, production-ready large-scale models to our on-vehicle stack. We are looking for experts with hands-on experience in compressing, accelerating, and deploying complex models (LLMs, VLMs, or FMs) for power- and thermal-constrained vehicle SOCs. You will optimize the ML models, write custom CUDA kernels, and build highly concurrent inference code to ensure real-time, deterministic execution on edge devices.
Base Salary Range
There are three major components to compensation for this position: salary, Amazon Restricted Stock Units (RSUs), and Zoox Stock Appreciation Rights. A sign-on bonus may be offered as part of the compensation package. The listed range applies only to the base salary. Compensation will vary based on geographic location and level. Leveling, as well as positioning within a level, is determined by a range of factors, including, but not limited to, a candidate's relevant years of experience, domain knowledge, and interview performance. The salary range listed in this posting is representative of the range of levels Zoox is considering for this position.
Zoox also offers a comprehensive package of benefits, including paid time off (e.g. sick leave, vacation, bereavement), unpaid time off, Zoox Stock Appreciation Rights, Amazon RSUs, health insurance, long-term care insurance, long-term and short-term disability insurance, and life insurance.
In this role, you will:
Architect and implement model conversion and compilation pipelines using TensorRT and TensorRT-LLM for edge deployment.
Perform rigorous parity checking, accuracy recovery, and latency benchmarking between PyTorch frameworks and compiled edge binaries.
Qualifications:
Bonus Qualifications:
About Zoox
Zoox is developing the first ground-up, fully autonomous vehicle fleet and the supporting ecosystem required to bring this technology to market. Sitting at the intersection of robotics, machine learning, and design, Zoox aims to provide the next generation of mobility-as-a-service in urban environments. We’re looking for top talent that shares our passion and wants to be part of a fast-moving and highly execution-oriented team.
Accommodations
If you need an accommodation to participate in the application or interview process please reach out to accommodations@zoox.com or your assigned recruiter.
A Final Note:
You do not need to match every listed expectation to apply for this position. Here at Zoox, we know that diverse perspectives foster the innovation we need to be successful, and we are committed to building a team that encompasses a variety of backgrounds, experiences, and skills.
Zoox
9 jobs posted
About the job
Posted on
Apr 11, 2026
Apply before
May 11, 2026
Job typeFull-time
CategoryOther AI jobs
Location
Foster City, CASan Diego, CASeattle, WA
Similar Jobs
19d
AI Deployment Engineer
OpenAI
Tokyo, JapanAI Deployment Engineer
OpenAI
Tokyo, Japan19d13d
AI Inference Engineer
Perplexity
LondonAI Inference Engineer
Perplexity
London13d10d
Multimodal AI Model Optimization Research Engineer
Tavus
OKMultimodal AI Model Optimization Research Engineer
Tavus
OK10d3d
AI Systems Engineer – AI Model (Training & Inference)
AMD
MARKHAM, CanadaAI Systems Engineer – AI Model (Training & Inference)
AMD
MARKHAM, Canada3d23d
Generative AI Inference Engineer
Stability AI
United StatesGenerative AI Inference Engineer
Stability AI
United States23d19d
AI Deployment Engineer, Startups
OpenAI
SingaporeAI Deployment Engineer, Startups
OpenAI
Singapore19d18d
AI Deployment Engineer, Codex | Tokyo
OpenAI
Tokyo, JapanAI Deployment Engineer, Codex | Tokyo
OpenAI
Tokyo, Japan18d16d
Principal GenAI Inference Optimization Engineer
AMD
San Jose, CAPrincipal GenAI Inference Optimization Engineer
AMD
San Jose, CA16d16d
AI deployment engineer (UK)
Writer
London, United KingdomAI deployment engineer (UK)
Writer
London, United Kingdom16d13d
AI deployment engineer (US)
Writer
$132K - $185KSan Francisco, CAAI deployment engineer (US)
Writer
$132K - $185KSan Francisco, CA13d
Looking for something different?
Browse all AI jobsFree AI job alerts
Get the latest AI jobs delivered to your inbox every week. Free, no spam.