Staff AI Engineer - LLM, Gen AI, Deep Learning Inference Solutions

San Jose, California

Job Description

WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career. THE ROLE: We are looking for an AI Product Development Engineer to join our AI Solution Engineering organization, supporting the AMD Ryzen AI product line. In this role, you will operate at the intersection of AI software development, productization, and ecosystem enablement to help design, optimize, and scale AI workloads on the AMD Ryzen AI platform. You will act as a trusted technical advisor, enabling customers to adopt the Ryzen AI Software stack, optimize performance, and accelerate deployment. This position offers a unique opportunity to work at the crossroads of cutting-edge hardware and AI software, influencing AMD’s future roadmap for Edge and Endpoint AI through deep customer engagement. KEY RESPONSIBILITIES: Drive end-to-end productization of AI inferencing solutions across AMD CPU, NPU, and AI PC platforms, with a strong focus on flow design, usability, and performance analysis. Partner with software developers to co-own AI software and model optimization efforts, provide early feedback on usability and specifications, and identify potential pain points through holistic product usage analysis. Engage with internal and external stakeholders to troubleshoot workflows, meet performance or accuracy goals, and create reproducible use cases that drive adoption of AMD inference solutions. Collaborate with sales and marketing teams to support strategic business engagements and customer success. Develop state-of-the-art Deep Learning and Generative AI/LLM model use cases, applications, methodologies, and technical guides covering inference software components such as the Compiler, Quantizer, Optimizer, Runtime, Profiler, Visualizer, and supporting libraries. PREFERRED EXPERIENCE: MSEE with 4–12 years of demonstrated experience in AI/ML software development. Strong expertise in Python and C++. Hands-on experience with AI frameworks such as PyTorch, ONNX, llama.cpp, etc. Work experience with one or more popular Deep Learning or Generative AI model architectures such as CNNs, Transformers, LLMs, Stable Diffusion, etc. Proficiency in model analysis, optimization, performance benchmarking, and accuracy measurement. Experience with quantization and pruning of neural networks is preferred. Familiarity with latest Generative AI architectures such as LLM, VLM, VLA, MoE, with exposure to models like Llama, Gemma, Qwen, DeepSeek, Mistral, Phi, etc. Experience working on or contributing to open-source AI projects is a plus. Experience with modern inference frameworks such as vLLM, SGLang, Dynamo, TensorRT-LLM, etc. is preferred. Familiarity with any AI accelerator SDK such as NVIDIA TensorRT, Intel OpenVINO, Qualcomm Neural Processing SDK, AWS Inferentia, etc. will set you apart from other candidates. Strong documentation and presentation skills for clear and concise communication.   ACADEMIC CREDENTIALS: Master’s in computer engineering or computer science or electrical engineering, or comparable disciplines LOCATION: San Jose, CA Longmont, CO #LI-TC1 Benefits offered are described: AMD benefits at a glance. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

Benefits offered are described: AMD benefits at a glance. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

THE ROLE: We are looking for an AI Product Development Engineer to join our AI Solution Engineering organization, supporting the AMD Ryzen AI product line. In this role, you will operate at the intersection of AI software development, productization, and ecosystem enablement to help design, optimize, and scale AI workloads on the AMD Ryzen AI platform. You will act as a trusted technical advisor, enabling customers to adopt the Ryzen AI Software stack, optimize performance, and accelerate deployment. This position offers a unique opportunity to work at the crossroads of cutting-edge hardware and AI software, influencing AMD’s future roadmap for Edge and Endpoint AI through deep customer engagement. KEY RESPONSIBILITIES: Drive end-to-end productization of AI inferencing solutions across AMD CPU, NPU, and AI PC platforms, with a strong focus on flow design, usability, and performance analysis. Partner with software developers to co-own AI software and model optimization efforts, provide early feedback on usability and specifications, and identify potential pain points through holistic product usage analysis. Engage with internal and external stakeholders to troubleshoot workflows, meet performance or accuracy goals, and create reproducible use cases that drive adoption of AMD inference solutions. Collaborate with sales and marketing teams to support strategic business engagements and customer success. Develop state-of-the-art Deep Learning and Generative AI/LLM model use cases, applications, methodologies, and technical guides covering inference software components such as the Compiler, Quantizer, Optimizer, Runtime, Profiler, Visualizer, and supporting libraries. PREFERRED EXPERIENCE: MSEE with 4–12 years of demonstrated experience in AI/ML software development. Strong expertise in Python and C++. Hands-on experience with AI frameworks such as PyTorch, ONNX, llama.cpp, etc. Work experience with one or more popular Deep Learning or Generative AI model architectures such as CNNs, Transformers, LLMs, Stable Diffusion, etc. Proficiency in model analysis, optimization, performance benchmarking, and accuracy measurement. Experience with quantization and pruning of neural networks is preferred. Familiarity with latest Generative AI architectures such as LLM, VLM, VLA, MoE, with exposure to models like Llama, Gemma, Qwen, DeepSeek, Mistral, Phi, etc. Experience working on or contributing to open-source AI projects is a plus. Experience with modern inference frameworks such as vLLM, SGLang, Dynamo, TensorRT-LLM, etc. is preferred. Familiarity with any AI accelerator SDK such as NVIDIA TensorRT, Intel OpenVINO, Qualcomm Neural Processing SDK, AWS Inferentia, etc. will set you apart from other candidates. Strong documentation and presentation skills for clear and concise communication.   ACADEMIC CREDENTIALS: Master’s in computer engineering or computer science or electrical engineering, or comparable disciplines LOCATION: San Jose, CA Longmont, CO #LI-TC1

Please mention that you found this job on MoAIJobs, this helps us grow. Thank you!