AMD
22 hours ago

Sr. Software Development Engineer - GPU Performance Tools

MARKHAM, Canada

Job Description

WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world’s most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives. AMD together we advance_ SENIOR SOFTWARE DEVELOPMENT ENGINEER - GPU PERFORMANCE TOOLS THE TEAM:  AI is defining the next era of computing, and this is just the beginning. We see the benefits of AI every day—enabling medical research, curbing credit card fraud, reducing congestion in cities, or simply making life easier.  In the ever-evolving landscape of artificial intelligence, we are a powerhouse – a cutting-edge 'AI Software Solutions Team'. Specialized in AI optimization, fine-tuning large language models to unlock unprecedented Generative AI efficiency. Our expertise extends beyond the hardware realm, encompassing 3P enablement, where we develop custom AI Software Solutions for Industry leading AI customers. Are you excited to work with one of Top 1% of the AI companies in the world?  THE ROLE: As a GPU Performance Tools Developer, you will build and maintain a world-class suite of tools to automate critical performance engineering workflows. Your primary focus will be providing tools to our technical users, who rely on this automation platform for benchmarking of large-scale, GPU-enabled data center applications. This role requires developing a foundational knowledge of our team's processes, diverse workloads, and complex infrastructure to ensure maximum stability, reliability, and user success. You will be a key contributor, learning and growing at the cutting edge of Data Center GPU. THE PERSON: We are seeking a motivated and collaborative software engineer with a strong sense of accountability and a passion for innovation and user enablement. The ideal candidate is a problem-solver who is eager to provide direct support and learn to translate user needs into robust, reliable, performant solutions. You must possess strong communication skills and sound judgment to help drive issues to resolution, communicate effectively with stakeholders, and work collaboratively with different teams across AMD to ensure platform stability and success. A process-oriented mindset and a commitment to understanding and bringing innovation to established systems are essential. KEY RESPONSIBILITIES: Develop and maintain a foundational understanding of our automation framework, infrastructure, and user workflows to grow into a subject matter expert. Assist in enhancing and maintaining a web-based front-end and workload submission portal, prioritizing stability and features that directly address user-reported needs. Contribute to building a highly reliable, self-service reporting system that automates the creation of informative tables and graphs for engineers, business units, and management. Support the scalability, performance, and monitoring of our Azure Cloud-based Web-App deployments. Work closely with the GPU Performance Team to provide user support, troubleshoot issues and gather feedback to drive platform improvements. Learn from senior engineers and contribute to establishing best practices for development, testing, and user support. Assist in developing scripted automation and investigating AI-agents to improve front-end interactions and user efficiency. PREFERRED EXPERIENCE: Up to 5 years of software development experience with a desire to support production systems and interact directly with a technical user base. Strong proficiency in Python, and familiarity with Ansible, Structured and Unstructured DB’s (SQL/Mongo), Linux, Docker, Azure Demonstrated experience in a role requiring ownership, accountability, and the ability to work effectively within established team processes. Experience with or exposure to MS Azure, including Web-Apps, Functions, and Azure Monitor for logging, monitoring, and troubleshooting. Interest in using AI development tools (cursor, cline, windsurf…). Proficiency in JavaScript (React) for front end work and troubleshooting. Knowledge of web application development frameworks (CSS/HTML). Must have: A proactive and inquisitive mindset focused on root cause analysis, reliability, and user-centric solutions. Good to have: o Experience with data visualization tools (Plotly, Matplotlib, Pandas). o Experience with REST API development and containerization (Flask, Kubernetes). o Familiarity with infrastructure and workload automation tools (e.g., Ansible). Nice to have: o Experience with Grafana, Jupyter/JupyterHub, Docker Swarm. o Familiarity with CI/CD pipelines and GitHub Actions. ACADEMIC CREDENTIALS: Bachelor’s or Master’s degree in Computer Science or equivalent relevant professional experience. #LI-RL1 Benefits offered are described: AMD benefits at a glance. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process. Benefits offered are described: AMD benefits at a glance. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process. SENIOR SOFTWARE DEVELOPMENT ENGINEER - GPU PERFORMANCE TOOLS THE TEAM:  AI is defining the next era of computing, and this is just the beginning. We see the benefits of AI every day—enabling medical research, curbing credit card fraud, reducing congestion in cities, or simply making life easier.  In the ever-evolving landscape of artificial intelligence, we are a powerhouse – a cutting-edge 'AI Software Solutions Team'. Specialized in AI optimization, fine-tuning large language models to unlock unprecedented Generative AI efficiency. Our expertise extends beyond the hardware realm, encompassing 3P enablement, where we develop custom AI Software Solutions for Industry leading AI customers. Are you excited to work with one of Top 1% of the AI companies in the world?  THE ROLE: As a GPU Performance Tools Developer, you will build and maintain a world-class suite of tools to automate critical performance engineering workflows. Your primary focus will be providing tools to our technical users, who rely on this automation platform for benchmarking of large-scale, GPU-enabled data center applications. This role requires developing a foundational knowledge of our team's processes, diverse workloads, and complex infrastructure to ensure maximum stability, reliability, and user success. You will be a key contributor, learning and growing at the cutting edge of Data Center GPU. THE PERSON: We are seeking a motivated and collaborative software engineer with a strong sense of accountability and a passion for innovation and user enablement. The ideal candidate is a problem-solver who is eager to provide direct support and learn to translate user needs into robust, reliable, performant solutions. You must possess strong communication skills and sound judgment to help drive issues to resolution, communicate effectively with stakeholders, and work collaboratively with different teams across AMD to ensure platform stability and success. A process-oriented mindset and a commitment to understanding and bringing innovation to established systems are essential. KEY RESPONSIBILITIES: Develop and maintain a foundational understanding of our automation framework, infrastructure, and user workflows to grow into a subject matter expert. Assist in enhancing and maintaining a web-based front-end and workload submission portal, prioritizing stability and features that directly address user-reported needs. Contribute to building a highly reliable, self-service reporting system that automates the creation of informative tables and graphs for engineers, business units, and management. Support the scalability, performance, and monitoring of our Azure Cloud-based Web-App deployments. Work closely with the GPU Performance Team to provide user support, troubleshoot issues and gather feedback to drive platform improvements. Learn from senior engineers and contribute to establishing best practices for development, testing, and user support. Assist in developing scripted automation and investigating AI-agents to improve front-end interactions and user efficiency. PREFERRED EXPERIENCE: Up to 5 years of software development experience with a desire to support production systems and interact directly with a technical user base. Strong proficiency in Python, and familiarity with Ansible, Structured and Unstructured DB’s (SQL/Mongo), Linux, Docker, Azure Demonstrated experience in a role requiring ownership, accountability, and the ability to work effectively within established team processes. Experience with or exposure to MS Azure, including Web-Apps, Functions, and Azure Monitor for logging, monitoring, and troubleshooting. Interest in using AI development tools (cursor, cline, windsurf…). Proficiency in JavaScript (React) for front end work and troubleshooting. Knowledge of web application development frameworks (CSS/HTML). Must have: A proactive and inquisitive mindset focused on root cause analysis, reliability, and user-centric solutions. Good to have: o Experience with data visualization tools (Plotly, Matplotlib, Pandas). o Experience with REST API development and containerization (Flask, Kubernetes). o Familiarity with infrastructure and workload automation tools (e.g., Ansible). Nice to have: o Experience with Grafana, Jupyter/JupyterHub, Docker Swarm. o Familiarity with CI/CD pipelines and GitHub Actions. ACADEMIC CREDENTIALS: Bachelor’s or Master’s degree in Computer Science or equivalent relevant professional experience. #LI-RL1

Please mention that you found this job on MoAIJobs, this helps us grow. Thank you!

WorkdayWorkday
4 weeks ago

Sr Software Development Engineer

Ireland, Dublin
AMDAMD
4 weeks ago

Sr. Software Development Engineer

Shanghai, China
WorkdayWorkday
3 weeks ago

Sr Software Development Engineer

USA, GA, Atlanta
AMDAMD
2 weeks ago

Sr. Software Development Engineer

MARKHAM, Canada
AMDAMD
1 week ago

GPU Software Development Engineer

Warsaw, Poland