Amazon
Company
3 weeks ago
Sr. Technical Program Manager - AI/ML Hardware, Annapurna Labs - Server & Rack Delivery - Trainium Hardware
US, TX, Austin
Full-time
Job Description
In Annapurna Labs we are at the forefront of hardware/software accelerator solutions for not only Amazon Web Services (AWS), but across the industry. The Machine Learning Component, Server and Rack delivery team is looking for candidates interested in diving deep into our designs of Machine Learning servers and developing world class hardware to support current and future generations of accelerator silicon.
Our team designs and builds Annapurna's fleet of Accelerated Servers using Internally designed silicon We solve systemic hardware issues and we build hardware and software systems to detect and mitigate future recurrences so that our our customers can experience the highest quality of service possible!
You will be responsible for program managing the design and implementation of a new generation of servers or a core resuable component used across multiple generations of servers in the Annapurna fleet. You will be responsible for development of individual boards, managing third party ODM's, integrating components from within and outside of amazon and integrating firmware/software stacks ontop of the hardware to produce high quality designs for our customers.
We are seeking an experienced TPM to help drive solutions in the highly technical machine learning server hardware space. Our team has end to end ownership of some of the most advanced server hardware in the world. Come join us!
Key job responsibilities
You will work closely with our customers to understand their technical needs and business goals and partner with development engineers within the org to architect the solutions that we will deploy at scale.
To deliver your products you will work with an interdisciplinary team of hardware design, silicon design, component, firmware, test, qualification, and integration engineers.
This is a fast-paced, intellectually challenging position, and you’ll work with thought leaders in multiple technology areas. You’ll have high standards for yourself and everyone you work with, and you’ll be constantly looking for ways to improve your product’s performance, quality and cost. Using data and key metrics, you will also drive and measure process improvements that enhance our operational effectiveness.
You will work independently in a dynamic, challenging, and fast-changing organization. We’re changing an industry, and we need individuals who are ready for this challenge and who want to reach beyond what is possible today.
A day in the life
Your day to day responsibilities will include interfacing with our internal and external customers to understand project requirements and facilitate system development ontop of your server design. You will be responsible for learning operational challenges to our existing fleet with the goal of improving the current customer experience as well as developing improved systems for future designs. You will work directly with vendors and ODM/JDM design teams to develop and manufacture your product at scale.
About the team
Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, design reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.
Our team designs and builds Annapurna's fleet of Accelerated Servers using Internally designed silicon We solve systemic hardware issues and we build hardware and software systems to detect and mitigate future recurrences so that our our customers can experience the highest quality of service possible!
You will be responsible for program managing the design and implementation of a new generation of servers or a core resuable component used across multiple generations of servers in the Annapurna fleet. You will be responsible for development of individual boards, managing third party ODM's, integrating components from within and outside of amazon and integrating firmware/software stacks ontop of the hardware to produce high quality designs for our customers.
We are seeking an experienced TPM to help drive solutions in the highly technical machine learning server hardware space. Our team has end to end ownership of some of the most advanced server hardware in the world. Come join us!
Key job responsibilities
You will work closely with our customers to understand their technical needs and business goals and partner with development engineers within the org to architect the solutions that we will deploy at scale.
To deliver your products you will work with an interdisciplinary team of hardware design, silicon design, component, firmware, test, qualification, and integration engineers.
This is a fast-paced, intellectually challenging position, and you’ll work with thought leaders in multiple technology areas. You’ll have high standards for yourself and everyone you work with, and you’ll be constantly looking for ways to improve your product’s performance, quality and cost. Using data and key metrics, you will also drive and measure process improvements that enhance our operational effectiveness.
You will work independently in a dynamic, challenging, and fast-changing organization. We’re changing an industry, and we need individuals who are ready for this challenge and who want to reach beyond what is possible today.
A day in the life
Your day to day responsibilities will include interfacing with our internal and external customers to understand project requirements and facilitate system development ontop of your server design. You will be responsible for learning operational challenges to our existing fleet with the goal of improving the current customer experience as well as developing improved systems for future designs. You will work directly with vendors and ODM/JDM design teams to develop and manufacture your product at scale.
About the team
Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, design reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.
Amazon
679 jobs posted
Similar Jobs
Discover more opportunities that match your interests
2 weeks ago
Sr Technical Program Manager, AWS Generative AI & ML Servers
Amazon
US, CA, Cupertino
View details
1 week ago
Sr Cost Program Manager, AWS AI/ML Supply Chain
Amazon
US, WA, Seattle
View details

3 weeks ago
Sr. Program Manager, AI Solutions
Stability AI
Remote
View details
2 weeks ago
Technical Account Manager - AI/ML
Amazon
TW, TPE, Taipei
View details
2 weeks ago
Technical Account Manager - AI/ML
Amazon
HK, Causeway Bay
View details
4 days ago
Senior Technical Program Manager, AI/ML & Data Infrastructure, Central Technology
Chan Zuckerberg Initiative
Redwood City, CA (Hybrid)
View details

3 weeks ago
Technical Program Manager, Generative AI & Communications
PlayStation
United States, Aliso Viejo, CA
View details
2 weeks ago
Sr. Customer Program Manager - AI & HPC
AMD
Bellevue, Washington
View details
2 weeks ago
Senior Technical Program Manager, AI Datacenter
NVIDIA
Israel, Yokneam
View details
2 weeks ago
Sr. Staff Functional Technical Program Manager
AMD
Austin, Texas
View details
Looking for something different?
Browse all AI jobs