AMD
Company
Staff AI Engineer - LLM, Gen AI, Deep Learning Inference Solutions
San Jose, California
Job Description
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career. THE ROLE: We are looking for an AI Product Development Engineer to join our AI Solution Engineering organization, supporting the AMD Ryzen AI product line. In this role, you will operate at the intersection of AI software development, productization, and ecosystem enablement to help design, optimize, and scale AI workloads on the AMD Ryzen AI platform. You will act as a trusted technical advisor, enabling customers to adopt the Ryzen AI Software stack, optimize performance, and accelerate deployment. This position offers a unique opportunity to work at the crossroads of cutting-edge hardware and AI software, influencing AMD’s future roadmap for Edge and Endpoint AI through deep customer engagement. KEY RESPONSIBILITIES: Drive end-to-end productization of AI inferencing solutions across AMD CPU, NPU, and AI PC platforms, with a strong focus on flow design, usability, and performance analysis. Partner with software developers to co-own AI software and model optimization efforts, provide early feedback on usability and specifications, and identify potential pain points through holistic product usage analysis. Engage with internal and external stakeholders to troubleshoot workflows, meet performance or accuracy goals, and create reproducible use cases that drive adoption of AMD inference solutions. Collaborate with sales and marketing teams to support strategic business engagements and customer success. Develop state-of-the-art Deep Learning and Generative AI/LLM model use cases, applications, methodologies, and technical guides covering inference software components such as the Compiler, Quantizer, Optimizer, Runtime, Profiler, Visualizer, and supporting libraries. PREFERRED EXPERIENCE: MSEE with 4–12 years of demonstrated experience in AI/ML software development. Strong expertise in Python and C++. Hands-on experience with AI frameworks such as PyTorch, ONNX, llama.cpp, etc. Work experience with one or more popular Deep Learning or Generative AI model architectures such as CNNs, Transformers, LLMs, Stable Diffusion, etc. Proficiency in model analysis, optimization, performance benchmarking, and accuracy measurement. Experience with quantization and pruning of neural networks is preferred. Familiarity with latest Generative AI architectures such as LLM, VLM, VLA, MoE, with exposure to models like Llama, Gemma, Qwen, DeepSeek, Mistral, Phi, etc. Experience working on or contributing to open-source AI projects is a plus. Experience with modern inference frameworks such as vLLM, SGLang, Dynamo, TensorRT-LLM, etc. is preferred. Familiarity with any AI accelerator SDK such as NVIDIA TensorRT, Intel OpenVINO, Qualcomm Neural Processing SDK, AWS Inferentia, etc. will set you apart from other candidates. Strong documentation and presentation skills for clear and concise communication. ACADEMIC CREDENTIALS: Master’s in computer engineering or computer science or electrical engineering, or comparable disciplines LOCATION: San Jose, CA Longmont, CO #LI-TC1 Benefits offered are described: AMD benefits at a glance. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.
Benefits offered are described: AMD benefits at a glance. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.
THE ROLE: We are looking for an AI Product Development Engineer to join our AI Solution Engineering organization, supporting the AMD Ryzen AI product line. In this role, you will operate at the intersection of AI software development, productization, and ecosystem enablement to help design, optimize, and scale AI workloads on the AMD Ryzen AI platform. You will act as a trusted technical advisor, enabling customers to adopt the Ryzen AI Software stack, optimize performance, and accelerate deployment. This position offers a unique opportunity to work at the crossroads of cutting-edge hardware and AI software, influencing AMD’s future roadmap for Edge and Endpoint AI through deep customer engagement. KEY RESPONSIBILITIES: Drive end-to-end productization of AI inferencing solutions across AMD CPU, NPU, and AI PC platforms, with a strong focus on flow design, usability, and performance analysis. Partner with software developers to co-own AI software and model optimization efforts, provide early feedback on usability and specifications, and identify potential pain points through holistic product usage analysis. Engage with internal and external stakeholders to troubleshoot workflows, meet performance or accuracy goals, and create reproducible use cases that drive adoption of AMD inference solutions. Collaborate with sales and marketing teams to support strategic business engagements and customer success. Develop state-of-the-art Deep Learning and Generative AI/LLM model use cases, applications, methodologies, and technical guides covering inference software components such as the Compiler, Quantizer, Optimizer, Runtime, Profiler, Visualizer, and supporting libraries. PREFERRED EXPERIENCE: MSEE with 4–12 years of demonstrated experience in AI/ML software development. Strong expertise in Python and C++. Hands-on experience with AI frameworks such as PyTorch, ONNX, llama.cpp, etc. Work experience with one or more popular Deep Learning or Generative AI model architectures such as CNNs, Transformers, LLMs, Stable Diffusion, etc. Proficiency in model analysis, optimization, performance benchmarking, and accuracy measurement. Experience with quantization and pruning of neural networks is preferred. Familiarity with latest Generative AI architectures such as LLM, VLM, VLA, MoE, with exposure to models like Llama, Gemma, Qwen, DeepSeek, Mistral, Phi, etc. Experience working on or contributing to open-source AI projects is a plus. Experience with modern inference frameworks such as vLLM, SGLang, Dynamo, TensorRT-LLM, etc. is preferred. Familiarity with any AI accelerator SDK such as NVIDIA TensorRT, Intel OpenVINO, Qualcomm Neural Processing SDK, AWS Inferentia, etc. will set you apart from other candidates. Strong documentation and presentation skills for clear and concise communication. ACADEMIC CREDENTIALS: Master’s in computer engineering or computer science or electrical engineering, or comparable disciplines LOCATION: San Jose, CA Longmont, CO #LI-TC1
Benefits offered are described: AMD benefits at a glance. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.
THE ROLE: We are looking for an AI Product Development Engineer to join our AI Solution Engineering organization, supporting the AMD Ryzen AI product line. In this role, you will operate at the intersection of AI software development, productization, and ecosystem enablement to help design, optimize, and scale AI workloads on the AMD Ryzen AI platform. You will act as a trusted technical advisor, enabling customers to adopt the Ryzen AI Software stack, optimize performance, and accelerate deployment. This position offers a unique opportunity to work at the crossroads of cutting-edge hardware and AI software, influencing AMD’s future roadmap for Edge and Endpoint AI through deep customer engagement. KEY RESPONSIBILITIES: Drive end-to-end productization of AI inferencing solutions across AMD CPU, NPU, and AI PC platforms, with a strong focus on flow design, usability, and performance analysis. Partner with software developers to co-own AI software and model optimization efforts, provide early feedback on usability and specifications, and identify potential pain points through holistic product usage analysis. Engage with internal and external stakeholders to troubleshoot workflows, meet performance or accuracy goals, and create reproducible use cases that drive adoption of AMD inference solutions. Collaborate with sales and marketing teams to support strategic business engagements and customer success. Develop state-of-the-art Deep Learning and Generative AI/LLM model use cases, applications, methodologies, and technical guides covering inference software components such as the Compiler, Quantizer, Optimizer, Runtime, Profiler, Visualizer, and supporting libraries. PREFERRED EXPERIENCE: MSEE with 4–12 years of demonstrated experience in AI/ML software development. Strong expertise in Python and C++. Hands-on experience with AI frameworks such as PyTorch, ONNX, llama.cpp, etc. Work experience with one or more popular Deep Learning or Generative AI model architectures such as CNNs, Transformers, LLMs, Stable Diffusion, etc. Proficiency in model analysis, optimization, performance benchmarking, and accuracy measurement. Experience with quantization and pruning of neural networks is preferred. Familiarity with latest Generative AI architectures such as LLM, VLM, VLA, MoE, with exposure to models like Llama, Gemma, Qwen, DeepSeek, Mistral, Phi, etc. Experience working on or contributing to open-source AI projects is a plus. Experience with modern inference frameworks such as vLLM, SGLang, Dynamo, TensorRT-LLM, etc. is preferred. Familiarity with any AI accelerator SDK such as NVIDIA TensorRT, Intel OpenVINO, Qualcomm Neural Processing SDK, AWS Inferentia, etc. will set you apart from other candidates. Strong documentation and presentation skills for clear and concise communication. ACADEMIC CREDENTIALS: Master’s in computer engineering or computer science or electrical engineering, or comparable disciplines LOCATION: San Jose, CA Longmont, CO #LI-TC1
AMD
204 jobs posted
About the job
Similar Jobs
Discover more opportunities that match your interests
- 9 days ago
Staff AI Engineer - LLM, Gen AI, Deep Learning Inference Solutions
AMD
San Jose, CaliforniaView details - 24 days ago
Staff Engineer- Gen AI
Coupang
BengaluruView details - 14 days ago
Staff Machine Learning Engineer, AI Authoring
Unity
San Francisco, CA, USAView details - 10 days ago
Staff Machine Learning Engineer - Responsible AI
Pinterest
RemoteView details - 5 days ago
Staff Software Engineer - Deep Learning Acceleration
Aurora
Pittsburgh, PennsylvaniaView details - 5 days ago
Staff Software Engineer - Deep Learning Acceleration
Aurora
Seattle, WashingtonView details - 5 days ago
Staff Software Engineer - Deep Learning Acceleration
Aurora
Mountain View, CaliforniaView details - 27 days ago
Staff Machine Learning Engineer, AI Experience
Airbnb
United StatesView details - 23 days ago
Staff Software Engineer Generative AI-Machine Learning
Extreme Networks
View details - 7 days ago
Machine Learning Engineer, AI Safety - LLM MLOps
NVIDIA
US, CA, Santa ClaraView details
View all AI Engineer jobs
Looking for something different?
Browse all AI jobs