Staff Engineer - AI Compute Blade and Rack Validation
Posted 12 hours ago
Job Description
About us
Graphcore is one of the world’s leading innovators in Artificial Intelligence compute.
It is developing hardware, software and systems infrastructure that will unlock the next generation of AI breakthroughs and power the widespread adoption of AI solutions across every industry.
As part of the SoftBank Group, Graphcore is a member of a best-in-class family of companies responsible for some of the world’s most transformative technologies. Together, they share a bold vision: to enable Artificial Super Intelligence and ensure its benefits are accessible to everyone.
Graphcore’s teams are drawn from a diverse group of backgrounds and bring a broad range of skills and perspectives. A melting pot of AI research specialists, silicon designers, software engineers and systems architects, Graphcore enjoys a culture of continuous learning and constant innovation.
Job Summary
We are seeking a senior validation lead engineer to lead at-scale rack validation efforts for next-generation AI hyperscale systems. This role focuses on post-silicon system validation across the full lifecycle, ensuring functional, electrical, and thermal performance meets product objectives. You will own end-to-end blade and rack validation including planning, development, execution, and debug while collaborating across firmware, systems, and hardware teams.
The Team
The Rack Validation team is responsible for ensuring system readiness and quality at scale. The team works cross-functionally with firmware, silicon, and system engineering teams to validate complex AI compute platforms.
Responsibilities and Duties
- Lead post-silicon validation of AI compute blades and racks including test planning, development, and automation.
- Drive provisioning and integration of system components (SoC FW, BMC, RMC, OS) for rack-level readiness.
- Own execution against program achievements and report validation progress and risks.
- Triage test failures, collect debug data, and collaborate on root cause analysis.
- Track validation coverage and continuously improve test processes and infrastructure.
- Collaborate with ODM/JDM partners on validation and quality.
- Mentor engineers and drive engineering excellence.
Candidate Profile
Essential:
- Bachelor’s or Master’s degree or equivalent experience in Computer Engineering, Electrical Engineering, Computer Science, or related field.
- Proven track record in system, rack, or embedded validation with leadership experience.
- Strong experience in large-scale hardware validation environments.
- Expertise in CPU/GPU, memory, IO, and firmware validation.
- Experience with Linux/server OS and automation using Python/Bash.
- Knowledge of IPMI, Redfish, PLDM.
- Experience with CI/CD pipelines and hardware interfaces.
Desirable:
- Experience in hyperscale environments.
- Familiarity with OpenBMC and processes for verifying firmware functionality.
- Knowledge of firmware security and HIL testing.
- Experience with test management tools.
Graphcore
12 jobs posted
About the job
Similar Jobs
Today
Staff Engineer - AI Compute Blade and Rack Validation
Graphcore
Austin, TexasStaff Engineer - AI Compute Blade and Rack Validation
Graphcore
Austin, TexasToday7d
Staff AI Engineer
Paypal
Bangalore, Karnataka, IndiaStaff AI Engineer
Paypal
Bangalore, Karnataka, India7d4d
Staff AI Engineer
Paypal
Bangalore, Karnataka, IndiaStaff AI Engineer
Paypal
Bangalore, Karnataka, India4d22d
Staff Data Engineer - Data AI & Users and Contextualization Data
Airbnb
$204K - $255KUnited StatesStaff Data Engineer - Data AI & Users and Contextualization Data
Airbnb
$204K - $255KUnited States22d28d
Staff AI Research Engineer
AMD
Santa Clara, CaliforniaStaff AI Research Engineer
AMD
Santa Clara, California28d20d
AI Cluster Validation Engineer
AMD
Santa Clara, CaliforniaAI Cluster Validation Engineer
AMD
Santa Clara, California20d21d
AI and Data Engineer
Hitachi
$22 - $29Santa Clara, CaliforniaAI and Data Engineer
Hitachi
$22 - $29Santa Clara, California21d2d
Staff AI Engineer, Payments
GoodLeap
Roseville, CAAustin, TXBentonville, ARDallas, TXIrvine, CALehi, UTPlano, TXSan Francisco, CAWest Palm Beach, FLStaff AI Engineer, Payments
GoodLeap
Roseville, CAAustin, TXBentonville, ARDallas, TXIrvine, CALehi, UTPlano, TXSan Francisco, CAWest Palm Beach, FL2d22d
Staff AI Engineer - Cloud FinOps
Datadog
$234K - $300KUnited StatesStaff AI Engineer - Cloud FinOps
Datadog
$234K - $300KUnited States22d
22dStaff Software Engineer - AI Tools
PlayStation
RemoteUnited Kingdom
Staff Software Engineer - AI Tools
PlayStation
RemoteUnited Kingdom22d
Looking for something different?
Browse all AI jobsFree AI job alerts
Get the latest AI jobs delivered to your inbox every week. Free, no spam.