Software Engineer
About Graphcore
Graphcore is one of the world’s leading innovators in Artificial Intelligence compute.
It is developing hardware, software and systems infrastructure that will unlock the next generation of AI breakthroughs and power the widespread adoption of AI solutions across every industry.
As part of the SoftBank Group, Graphcore is a member of an elite family of companies responsible for some of the world’s most transformative technologies. Together, they share a bold vision: to enable Artificial Super Intelligence and ensure its benefits are accessible to everyone.
Graphcore’s teams are drawn from diverse backgrounds and bring a broad range of skills and perspectives. A melting pot of AI research specialists, silicon designers, software engineers and systems architects, Graphcore enjoys a culture of continuous learning and constant innovation.
Job Summary
As a Software Engineer in the Collectives Simulator team, you will participate in the development of a large-scale collective communication simulator that enables the analysis of network parameters and the efficient implementation of communication algorithms. The ideal candidate will have experience in designing, developing, and maintaining complex software systems involving custom hardware.
The Team
The Collectives Simulator team is responsible for building large-scale collective communication simulator for new AI hardware Graphcore is working on. The simulator allows users to in depth analyse communication algorithms in various network topologies.
Responsibilities and Duties
- Implementing, testing and documenting Collectives Simulator for new AI hardware
- Collaborating with other teams to design, implement and test new features
- Troubleshooting and resolving complex technical issues
- Participating in agile development – working as part of a scrum team 
Candidate Profile
Essential:
- Experience in software development using C++ programming language
- Experience with Python and C programming
- Good problem-solving skills and ability to debug and resolve complex issues
- Experience with unit testing frameworks such as Boost.Test and Google Test
- Experience with build tools such as CMake, Make and Ninja
- Strong understanding of version control systems (preferred Git)
Desirable
- Experience in development of SW simulators
- Experience with RDMA networking libraries (for example libibverbs, libfabric)
- Knowledge of multithreading and parallel computing concepts, including experience with parallel algorithms and optimization for AI/ML and HPC systems
- Knowledge of multithreading and inter-process communication (IPC) techniques for development of efficient concurrent applications
- Experience with Continuous Integration/Continuous Delivery (CI/CD) pipelines, including setting up automated workflows and deployments (for example GitHub Actions, GitLab CI)
- Experience with communication libraries (for example NCCL, MPI)
- Knowledge of machine learning frameworks (for example PyTorch)
- Knowledge of modern C++ standards 17/20
Benefits
In addition to a competitive salary, Graphcore offers annual leave policy, medical and dental health plans, a gym card, and employee pension (matched up to 4%). We review our benefits on a yearly basis to ensure we offer a valuable and rewarding benefits programme to our employees. We welcome people of different backgrounds and experiences; we’re committed to building an inclusive work environment that makes Graphcore a great home for everyone. We offer an equal opportunity process and understand that there are visible and invisible differences in all of us. We can provide a flexible approach to interview and encourage you to chat to us if you require any reasonable adjustments.
Create a Job Alert
Interested in building your career at Graphcore? Get future opportunities sent straight to your email.
Apply for this job
*
indicates a required field