Amazon
Company
4 days ago
Machine learning engineer -AI/ML, AWS Neuron Inference, AWS Neuron Inference
US, WA, Seattle
Full-time
Job Description
AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud-scale machine
learning accelerators. This role is for a senior software engineer in the Machine Learning Inference Applications team. This role is responsible for development and performance optimization of core building blocks of LLM Inference - Attention, MLP, Quantization, Speculative Decoding, Mixture of Experts, etc.
The team works side by side with chip architects, compiler engineers and runtime engineers to deliver performance and accuracy on Neuron devices across a range of models such as Llama 3.3 70B, 3.1 405B, DBRX, Mixtral, and so on.
Key job responsibilities
Responsibilities of this role include adapting latest research in LLM optimization to Neuron chips to extract best performance from both open source as well as internally developed models. Working across teams and organizations is key.
About the team
Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.
learning accelerators. This role is for a senior software engineer in the Machine Learning Inference Applications team. This role is responsible for development and performance optimization of core building blocks of LLM Inference - Attention, MLP, Quantization, Speculative Decoding, Mixture of Experts, etc.
The team works side by side with chip architects, compiler engineers and runtime engineers to deliver performance and accuracy on Neuron devices across a range of models such as Llama 3.3 70B, 3.1 405B, DBRX, Mixtral, and so on.
Key job responsibilities
Responsibilities of this role include adapting latest research in LLM optimization to Neuron chips to extract best performance from both open source as well as internally developed models. Working across teams and organizations is key.
About the team
Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.
Amazon
573 jobs posted
Similar Jobs
Discover more opportunities that match your interests
2 weeks ago
Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference
Amazon
US, CA, Cupertino
View details
2 weeks ago
Staff Machine Learning Engineer - Applied AI/ML
Gusto
Denver, CO;San Francisco, CA;New York, NY;Seattle, WA; Toronto CAN
View details
2 weeks ago
Principal Machine Learning Engineer, AI
Paypal
Bangalore, Karnataka, India
View details
2 weeks ago
Principal Machine Learning Engineer, AI
Paypal
Bangalore, Karnataka, India
View details
2 weeks ago
Principal Machine Learning Engineer, AI
Paypal
Bangalore, Karnataka, India
View details
1 week ago
Machine Learning Developer - AI/ML
Ubisoft
Montreal, QC, CA
View details
1 month ago
Machine Learning Engineer, AI Foundations
Waymo
Oxford, England, United Kingdom ; London, England, United Kingdom
View details
3 weeks ago
Senior Machine Learning Engineer - AI Platform
Visa
Austin, TX, US
View details
3 weeks ago
Senior Machine Learning Engineer - AI Platform
Visa
Austin, TX, US
View details
3 weeks ago
Staff Machine Learning Engineer, AI Labs
Paypal
San Jose, California, United States of America
View details
View all ML Engineer jobs
Looking for something different?
Browse all AI jobs