AMD logo

AMD

Company

AI Inference Engineer

Beijing, China

Job Description

WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career. AI推理加速研发工程师/HPC高性能优化架构师 岗位职责: 1. 设计、开发和实现高效的大型模型推理系统,以提高计算性能,提升算力利用率; 2. 进行模型性能分析和调优,识别和解决瓶颈问题,提高模型推理速度; 3. 跟踪最新的研究进展和技术趋势,提出改进和创新的想法,推动团队的技术发展; 岗位要求: 1. 深入理解大模型算法原理,熟悉模型结构,包括常见的GPT系列、llama系列、deepseek系列等模型; 2. 熟悉至少一种LLM主流推理引擎,如vllm、sglang等,掌握其底层技术原理,如如FlashAtention、PageAttention、Continuous Batching、Speculative Decoding等,具备开发优化经验; 3. 了解分布式推理框架原理,如pd分离、Expert Parallel等; 4. 熟悉python/C/C++编程,熟练掌握pytorch等至少一种深度学习框架 5. 有算子优化经验,包括不限于CUDA/Triton; 加分项: 1. 有大模型推理加速落地经验者优先; 2. 熟悉分布式推理加速框架,有超大模型分布式加速经验优先 #LI-FL1 Benefits offered are described: AMD benefits at a glance. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

Benefits offered are described: AMD benefits at a glance. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

AI推理加速研发工程师/HPC高性能优化架构师 岗位职责: 1. 设计、开发和实现高效的大型模型推理系统,以提高计算性能,提升算力利用率; 2. 进行模型性能分析和调优,识别和解决瓶颈问题,提高模型推理速度; 3. 跟踪最新的研究进展和技术趋势,提出改进和创新的想法,推动团队的技术发展; 岗位要求: 1. 深入理解大模型算法原理,熟悉模型结构,包括常见的GPT系列、llama系列、deepseek系列等模型; 2. 熟悉至少一种LLM主流推理引擎,如vllm、sglang等,掌握其底层技术原理,如如FlashAtention、PageAttention、Continuous Batching、Speculative Decoding等,具备开发优化经验; 3. 了解分布式推理框架原理,如pd分离、Expert Parallel等; 4. 熟悉python/C/C++编程,熟练掌握pytorch等至少一种深度学习框架 5. 有算子优化经验,包括不限于CUDA/Triton; 加分项: 1. 有大模型推理加速落地经验者优先; 2. 熟悉分布式推理加速框架,有超大模型分布式加速经验优先 #LI-FL1

Please mention that you found this job on MoAIJobs, this helps us grow. Thank you!

AMD logo

AMD

101 jobs posted

View all AMD jobs

About the job

Posted on

Dec 15, 2025

Apply before

Jan 14, 2026

Job typeFull-time
CategoryML Engineer

Share this job opportunity