Staff Engineer - AI Compute Blade and Rack Validation
About us
Graphcore is one of the world’s leading innovators in Artificial Intelligence compute.
It is developing hardware, software and systems infrastructure that will unlock the next generation of AI breakthroughs and power the widespread adoption of AI solutions across every industry.
As part of the SoftBank Group, Graphcore is a member of a best-in-class family of companies responsible for some of the world’s most transformative technologies. Together, they share a bold vision: to enable Artificial Super Intelligence and ensure its benefits are accessible to everyone.
Graphcore’s teams are drawn from a diverse group of backgrounds and bring a broad range of skills and perspectives. A melting pot of AI research specialists, silicon designers, software engineers and systems architects, Graphcore enjoys a culture of continuous learning and constant innovation.
Job Summary
We are seeking a senior validation lead engineer to lead at-scale rack validation efforts for next-generation AI hyperscale systems. This role focuses on post-silicon system validation across the full lifecycle, ensuring functional, electrical, and thermal performance meets product objectives. You will own end-to-end blade and rack validation including planning, development, execution, and debug while collaborating across firmware, systems, and hardware teams.
The Team
The Rack Validation team is responsible for ensuring system readiness and quality at scale. The team works cross-functionally with firmware, silicon, and system engineering teams to validate complex AI compute platforms.
Responsibilities and Duties
- Lead post-silicon validation of AI compute blades and racks including test planning, development, and automation.
- Drive provisioning and integration of system components (SoC FW, BMC, RMC, OS) for rack-level readiness.
- Own execution against program achievements and report validation progress and risks.
- Triage test failures, collect debug data, and collaborate on root cause analysis.
- Track validation coverage and continuously improve test processes and infrastructure.
- Collaborate with ODM/JDM partners on validation and quality.
- Mentor engineers and drive engineering excellence.
Candidate Profile
Essential:
- Bachelor’s or Master’s degree or equivalent experience in Computer Engineering, Electrical Engineering, Computer Science, or related field.
- Proven track record in system, rack, or embedded validation with leadership experience.
- Strong experience in large-scale hardware validation environments.
- Expertise in CPU/GPU, memory, IO, and firmware validation.
- Experience with Linux/server OS and automation using Python/Bash.
- Knowledge of IPMI, Redfish, PLDM.
- Experience with CI/CD pipelines and hardware interfaces.
Desirable:
- Experience in hyperscale environments.
- Familiarity with OpenBMC and processes for verifying firmware functionality.
- Knowledge of firmware security and HIL testing.
- Experience with test management tools.
Create a Job Alert
Interested in building your career at Graphcore? Get future opportunities sent straight to your email.
Apply for this job
*
indicates a required field