Member of Technical Staff – Backend
Job Description
Inflection AI is a public benefit corporation leveraging our world class large language model to build the first AI platform focused on the needs of the enterprise.
Who we are:
Inflection AI was re-founded in March of 2024 and our leadership team has assembled a team of kind, innovative, and collaborative individuals focused on building enterprise AI solutions. We are an organization passionate about what we are building, enjoy working together and strive to hire people with diverse backgrounds and experience.
Our first product, Pi, provides an empathetic and conversational chatbot. Pi is a public instance of building from our 350B+ frontier model with our sophisticated fine-tuning (10M+ examples), inference, and orchestration platform. We are now focusing on building new systems that directly support the needs of enterprise customers using this same approach.
Want to work with us? Have questions? Learn more below.
About the Role
As a backend engineer at Inflection, you will own the platforms, systems, and services that bring our conversational AI to life at scale. You’ll collaborate across research, product, and infrastructure teams to enable rapid iteration, high reliability, and secure delivery of novel AI features to millions of users. Your work will directly impact both the pace of product development and the stability of our production systems.
This is a good role for you if you:
- Have 5+ years of experience building and scaling backend systems for high-throughput applications
- Are fluent in building distributed systems with Python, Go, Rust, or similar languages, and are comfortable with cloud-native architectures (e.g., Kubernetes, gRPC, Postgres, Redis, Kafka)
- Have owned backend services end-to-end—from design and implementation to deployment, monitoring, and debugging
- Thrive in fast-paced environments where you can move quickly without sacrificing engineering rigor
- Proactively improve tooling and infrastructure to support your teammates’ workflows and reliability goals
- Communicate clearly across disciplines and take pride in solving user-facing problems with clean backend solutions
Responsibilities include:
- Design and implement scalable backend systems and APIs that power production LLM experiences, including agentic workflows, memory systems, and tool integrations
- Build and operate high-availability infrastructure to support real-time inference, retrieval, and conversation pipelines
- Develop internal platforms to improve engineering productivity—CI/CD pipelines, service templates, observability frameworks, and rollout tooling
- Collaborate closely with applied research and frontend teams to rapidly prototype, ship, and iterate on end-user features
- Ensure systems meet our high bar for security, uptime, and latency—through incident response, load testing, monitoring, and automation
- Participate in on-call rotations to maintain the reliability of the services you build
Employee Pay Disclosures
At Inflection AI, we aim to attract and retain the best employees and compensate them in a way that appropriately and fairly values their individual contributions to the company. For this role, Inflection AI estimates a starting annual base salary will fall in the range of approximately $175,000 - $350,000 depending on experience. This estimate can vary based on the factors described above, so the actual starting annual base salary may be above or below this range.
Benefits
Inflection AI values and supports our team’s mental and physical health. We are focused on building a positive, safe, inclusive and inspiring place to work. Our benefits include:
- Diverse medical, dental and vision options
- 401k matching program
- Unlimited paid time off
- Parental leave and flexibility for all parents and caregivers
- Support of country-specific visa needs for international employees living in the Bay Area
Interview Process
Apply: Please apply on Linkedin or our website for a specific role.
After speaking with one of our recruiters, you’ll enter our structured interview process, which includes the following stages:
- Hiring Manager Conversation – An initial discussion with the hiring manager to assess fit and alignment.
- Technical Interview – A deep dive with an Inflection Engineer to evaluate your technical expertise.
- Onsite Interview – A comprehensive assessment, including:
- A domain-specific interview
- A system design interview
- A final conversation with the hiring manager
Depending on the role, we may also ask you to complete a take-home exercise or deliver a presentation.
For non-technical roles, be prepared for a role-specific interview, such as a portfolio review.
Decision Timeline
We aim to provide feedback within one week of your final interview.