Nuro
Company
Technical Lead, ML Training Infrastructure
Job Description
Who We Are
Nuro is a self-driving technology company on a mission to make autonomy accessible to all. Founded in 2016, Nuro is building the world’s most scalable driver, combining cutting-edge AI with automotive-grade hardware. Nuro licenses its core technology, the Nuro Driver™, to support a wide range of applications, from robotaxis and commercial fleets to personally owned vehicles. With technology proven over years of self-driving deployments, Nuro gives the automakers and mobility platforms a clear path to AVs at commercial scale—empowering a safer, richer, and more connected future.
About the Role
Nuro is seeking an experienced Technical Lead to work on our ML Training stack. You will lead the work to optimize distributed training, job scheduling, model component libraries, and improve upon our performance analysis tools. In this role you will enable models to train faster and more efficiently – accelerating our self-driving roadmap of commercial and personal mobility.
About the Work
As TL for Nuro's ML Training Infrastructure, you will help define and execute the ML Framework roadmap, and drive initiatives in distributed training and efficiency optimizations to scale deep learning models. This will include:
- Help define the ML framework roadmap for the Training Infrastructure team.
- Build and maintain a scalable, distributed training platform with an emphasis on efficiency, determinism, and reproducibility for large-scale training jobs.
- Detect, diagnose, and resolve performance bottlenecks across training workflows, including input data pipelines and distributed training loops.
- Optimize scheduling, training performance, resource utilization, and ensure consistent, reproducible model training outcomes.
- Mentor and grow a high-performing team, fostering technical excellence and collaboration.
- Drive improvements in software quality that measurably raise reliability, efficiency, reproducibility, and determinism.
About You
- 6+ years of professional or research experience in ML infrastructure, distributed training, or ML systems engineering.
- Experience driving complex technical initiatives with stakeholder engagement.
- Expertise in PyTorch; familiarity with TensorFlow, and experience optimizing training performance (e.g., host offloading, quantization, reduced-precision training).
- Strong collaboration and communication skills, with a passion for exploring and promoting new approaches and technology.
Bonus Points
- Hands-on experience with CUDA, Triton, XLA, TPUs.
- Familiarity with ML compilers, ONNX, and intermediate representations.
- Experience with containerization (Docker), orchestration (Kubernetes), and ML pipeline tools (Airflow).
- Practical experience with JAX.
At Nuro, your base pay is one part of your total compensation package. For this position, the reasonably expected base pay range is between $222,775 and $333,925 for the level at which this job has been scoped. Your base pay will depend on several factors, including your experience, qualifications, education, location, and skills. In the event that you are considered for a different level, a higher or lower pay range would apply. This position is also eligible for an annual performance bonus, equity, and a competitive benefits package.
At Nuro, we celebrate differences and are committed to a diverse workplace that fosters inclusion and psychological safety for all employees. Nuro is proud to be an equal opportunity employer and expressly prohibits any form of workplace discrimination based on race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, veteran status, or any other legally protected characteristics. #LI-DNP
Nuro
16 jobs posted
About the job
Similar Jobs
Discover more opportunities that match your interests
- 24 days ago
Technical Lead Manager, ML Training Infrastructure
Nuro
Mountain View, California (HQ)View details - 3 days ago
AI & Technical Training Lead
Nasdaq
USA - New York City - New YorkView details - 22 days ago
Engineering Manager II, Ads ML Training Infrastructure
Pinterest
San Francisco, CA, US; Palo Alto, CA, US; Seattle, WA, USView details - 29 days ago
Senior AI/ML Infrastructure Engineer
AMD
Austin, TexasView details - 29 days ago
Senior AI/ML Infrastructure Engineer
AMD
Austin, TexasView details - 17 days ago
Technical Program Manager, Applied ML
ScaleAI
San Francisco, CA; New York, NYView details - 2 days ago
Lead Product Manager - AI/ML
Paypal
San Jose, California, United States of AmericaView details - 22 days ago
Full-Cycle Technical Recruiter, AI/ML
Waymo
Mountain View, CA, USAView details - 23 days ago
Software Development Engineer, ML Infrastructure Team
Amazon
US, WA, SeattleView details - 23 days ago
Technical Solutions Director - Spark/ML/AI
Databricks
Sao Paulo, BrazilView details
Looking for something different?
Browse all AI jobs