Software Engineer-AI/ML, AWS Neuron Inference
Posted 59 days ago
Job Description
This job posting has expired and no longer accepting applications.
AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud-scale machine
learning accelerators. This role is for a senior software engineer in the Machine Learning Inference Applications team. This role is responsible for development and performance optimization of core building blocks of LLM Inference - Attention, MLP, Quantization, Speculative Decoding, Mixture of Experts, etc.
The team works side by side with chip architects, compiler engineers and runtime engineers to deliver performance and accuracy on Neuron devices across a range of models such as Llama 3.3 70B, 3.1 405B, DBRX, Mixtral, and so on.
Key job responsibilities
Responsibilities of this role include adapting latest research in LLM optimization to Neuron chips to extract best performance from both open source as well as internally developed models. Working across teams and organizations is key.
About the team
Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.
learning accelerators. This role is for a senior software engineer in the Machine Learning Inference Applications team. This role is responsible for development and performance optimization of core building blocks of LLM Inference - Attention, MLP, Quantization, Speculative Decoding, Mixture of Experts, etc.
The team works side by side with chip architects, compiler engineers and runtime engineers to deliver performance and accuracy on Neuron devices across a range of models such as Llama 3.3 70B, 3.1 405B, DBRX, Mixtral, and so on.
Key job responsibilities
Responsibilities of this role include adapting latest research in LLM optimization to Neuron chips to extract best performance from both open source as well as internally developed models. Working across teams and organizations is key.
About the team
Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.
This job posting has expired and no longer accepting applications. Please check out our latest AI jobs.
Amazon
146 jobs posted
About the job
Posted on
Feb 17, 2026
Apply before
Mar 19, 2026
Job typeFull-time
CategoryOther AI jobs
Location
US, WA
Skills
llmllama
Similar Jobs
1d
Staff Software Engineer AI/ML
Samsung Semiconductor
$141K - $219KSan Jose, CaliforniaStaff Software Engineer AI/ML
Samsung Semiconductor
$141K - $219KSan Jose, California1d3d
AI/ML Framework Software Development Engineer
AMD
Austin, TexasAI/ML Framework Software Development Engineer
AMD
Austin, Texas3d27d
AI/ML Engineer
Epic Games
$141K - $206KBLANK,BLANK,Multiple LocationsAI/ML Engineer
Epic Games
$141K - $206KBLANK,BLANK,Multiple Locations27d27d
AI/ML Engineer
Epic Games
Cary, North CarolinaAI/ML Engineer
Epic Games
Cary, North Carolina27d17d
ML/AI Engineer
AMD
Austin, TexasML/AI Engineer
AMD
Austin, Texas17d16d
AI Inference Engineer
Perplexity
LondonAI Inference Engineer
Perplexity
London16d8d
AI-ML Engineer
Dell Technologies
Bratislava, SKAI-ML Engineer
Dell Technologies
Bratislava, SK8d1d
AI/ML Engineer
Hitachi
Chennai, Tamil Nadu, IndiaAI/ML Engineer
Hitachi
Chennai, Tamil Nadu, India1d28d
Software Engineer, ML (Training and Inference)
Isomorphic Labs
LondonSoftware Engineer, ML (Training and Inference)
Isomorphic Labs
London28d29d
Distinguished, Software Engineer -AI/ML Engineer – Agentic Systems
Walmart
$169K - $338KUnited StatesDistinguished, Software Engineer -AI/ML Engineer – Agentic Systems
Walmart
$169K - $338KUnited States29d
Looking for something different?
Browse all AI jobsFree AI job alerts
Get the latest AI jobs delivered to your inbox every week. Free, no spam.