Infrastructure and MLOps Engineer
Posted 61 days ago
Job Description
This job posting has expired and no longer accepting applications.
About Graphcore
Graphcore is one of the world’s leading innovators in Artificial Intelligence compute.
It is developing hardware, software and systems infrastructure that will unlock the next generation of AI breakthroughs and power the widespread adoption of AI solutions across every industry.
As part of the SoftBank Group, Graphcore is a member of an elite family of companies responsible for some of the world’s most transformative technologies. Together, they share a bold vision: to enable Artificial Super Intelligence and ensure its benefits are accessible to everyone.
Graphcore’s teams are drawn from diverse backgrounds and bring a broad range of skills and perspectives. A melting pot of AI research specialists, silicon designers, software engineers and systems architects, Graphcore enjoys a culture of continuous learning and constant innovation.
Job Summary
Join our dynamic Software Infrastructure team and take a pivotal role in scaling and managing our infrastructure. You will develop essential tools and services that empower our broader software team. Your contributions will enhance the build, test, deployment, and productisation processes of our Machine Learning Software components. Work with our High-Performance Computing (HPC) AI platforms and gain invaluable experience in distributed systems
The Team
The Software Infrastructure team provides critical platforms and services for software development teams across the business. Our responsibilities include managing the CI platform and services, build engineering, component integration, and packaging and release systems. We operate in squads, fostering a culture of service ownership and empowerment for our engineers. We focus on long-term engineering solutions and strive to eliminate toil wherever possible.
Responsibilities and Duties
- Develop, own, and maintain tools and services to support AI research and engineering teams
- Deploy and maintain services with Kubernetes and Docker
- Manage our Cloud Infrastructure using tools such as Terraform
Candidate Profile
Essential:
- Knowledge of Python
- Familiarity with cloud services (e.g. AWS)
- Experience managing or developing in Linux environments
- Understanding of CI/CD principles
- Experience using Kubernetes (k8s)
Desirable
- Experience maintaining machine learning applications
- Experience deploying ML orchestration tools (e.g. NV Ray, KFP, SkyPilot)
- Experience managing ML accelerator hardware (e.g. DCGM)
- Experience with Infrastructure as Code (IaC) tools (e.g. Terraform/OpenTofu)
- Experience with GitHub Actions
- Experience with modern observability tooling (e.g. Prometheus)
- Experience with Grafana
- Knowledge of Go/Java/C++ (or similar language)
Benefits
In addition to a competitive salary, Graphcore offers flexible working, a generous annual leave policy, private medical insurance and health cash plan, a dental plan, pension (matched up to 5%), life assurance and income protection. We have a generous parental leave policy and an employee assistance programme (which includes health, mental wellbeing, and bereavement support). We offer a range of healthy food and snacks at our central Bristol office and have our own barista bar! We welcome people of different backgrounds and experiences; we’re committed to building an inclusive work environment that makes Graphcore a great home for everyone. We offer an equal opportunity process and understand that there are visible and invisible differences in all of us. We can provide a flexible approach to interview and encourage you to chat to us if you require any reasonable adjustments.
Applicants for this position must hold the right to work in the UK. Unfortunately at this time, we are unable to provide visa sponsorship or support for visa applications
This job posting has expired and no longer accepting applications. Please check out our latest AI jobs.
Graphcore
12 jobs posted
About the job
Feb 12, 2026
Mar 14, 2026
Similar Jobs
22d
Software Engineer, Infrastructure (ML and Real-Time Speech)
Otter
$185K - $275KMountain View, CASoftware Engineer, Infrastructure (ML and Real-Time Speech)
Otter
$185K - $275KMountain View, CA22d18d
Solutions Engineer (Data and AI)
Databricks
Paris, FranceSolutions Engineer (Data and AI)
Databricks
Paris, France18d14d
Data and AI Engineer (Manager)
Visa
Bengaluru, IndiaData and AI Engineer (Manager)
Visa
Bengaluru, India14d22d
Staff Software Engineer, Infrastructure (ML and Real-Time Speech)
Otter
$210K - $275KMountain View, CAStaff Software Engineer, Infrastructure (ML and Real-Time Speech)
Otter
$210K - $275KMountain View, CA22d22d
Senior Software Engineer, Infrastructure (ML and Real-Time Speech)
Otter
$185K - $230KMountain View, CASenior Software Engineer, Infrastructure (ML and Real-Time Speech)
Otter
$185K - $230KMountain View, CA22d26d
Software Engineer, ML (Training and Inference)
Isomorphic Labs
LondonSoftware Engineer, ML (Training and Inference)
Isomorphic Labs
London26d15d
Data Engineer
Tavus
RemoteData Engineer
Tavus
Remote15d14d
Senior Solutions Engineer (Data and AI)
Databricks
London, United KingdomSenior Solutions Engineer (Data and AI)
Databricks
London, United Kingdom14d6d
Research Engineer, Infrastructure
Cognition
San Francisco, CAResearch Engineer, Infrastructure
Cognition
San Francisco, CA6d5d
Senior MLOps Engineer
Deep Genomics
$175K - $200KToronto, OntarioSenior MLOps Engineer
Deep Genomics
$175K - $200KToronto, Ontario5d
Looking for something different?
Browse all AI jobsFree AI job alerts
Get the latest AI jobs delivered to your inbox every week. Free, no spam.