Software Development Engineer, AGI Data Services
Posted 2 days ago
Job Description
AGI Data Services strives to be best in class at acquiring, creating and ground-truth data, with the highest standards of privacy and trust, to power the best AI models on Earth.
We are seeking a Senior Software Development Engineer (Sr. SDE) who is passionate about Generative AI and has strong engineering fundamentals to own and accelerate the next generation of GenAI-powered tooling within AGI Data Services. The Sr. SDE will design, build, and maintain LLM-as-a-Judge evaluation pipelines that leverage large language models to assess data quality at scale — including judge architectures, evaluation rubrics, scoring models, and calibration mechanisms that align with the standards set by core scientist teams developing Amazon Nova models. The Sr. SDE will also design and build GenAI-powered workflow tools — such as conversational diagnostic agents, automated quality assessment systems, and guided remediation workflows — that streamline data collection and quality assurance processes, enabling cross-functional teams to rapidly identify issues, reduce resolution time, and continuously improve data throughput.
The Sr. SDE's work will directly improve Amazon Nova models. Our team has built a strong foundation of GenAI-powered engineering practices — this senior role will accelerate and scale that momentum. This role offers direct visibility to VP and SVP leadership.
Key job responsibilities
The Sr. SDE will own the LLM-as-a-Judge evaluation pipeline — designing, building, and scaling automated evaluation systems that leverage large language models to assess data quality. The Sr. SDE will architect judge pipelines, develop evaluation rubrics and scoring frameworks, build calibration and agreement mechanisms, and ensure judge outputs align with quality standards defined by core scientist teams.
The Sr. SDE will design and build GenAI-powered diagnostic and workflow tools — including conversational troubleshooting agents, automated quality assessment tools, guided remediation systems, and workflow copilots. The Sr. SDE will leverage and extend agent orchestration frameworks such as LangChain, LangGraph, Amazon Bedrock Agents, or design custom orchestration layers tailored to AGI Data Services workflows.
The Sr. SDE will build upon the team's existing GenAI-forward practices — introducing advanced patterns for prompt engineering, RAG, agent orchestration, and LLM evaluation into production systems. The Sr. SDE will design and implement robust backend services, APIs, and data pipelines on AWS leveraging Amazon Bedrock, SageMaker, Lambda, ECS/EKS, Step Functions, DynamoDB, OpenSearch, and S3.
The Sr. SDE will collaborate with Applied Scientists, Technical Program Managers, domain experts, and vendor teams — bridging technology, process, and operations.
A day in the life
The Sr. SDE will review LLM-as-a-Judge pipeline metrics — monitoring judge accuracy, calibration drift, and agreement rates — and collaborate with Applied Scientists to refine evaluation rubrics. The Sr. SDE will design new judge architectures, build and iterate on conversational troubleshooting agents, fine-tune prompt chains, and expand RAG knowledge bases. The Sr. SDE will dive deep into data quality anecdotes to find patterns and root causes, propose tooling solutions that automate manual processes, and share new GenAI integration patterns that build on existing team practices. The Sr. SDE will communicate impact and roadmaps to cross-functional partners and VP leadership.
We are seeking a Senior Software Development Engineer (Sr. SDE) who is passionate about Generative AI and has strong engineering fundamentals to own and accelerate the next generation of GenAI-powered tooling within AGI Data Services. The Sr. SDE will design, build, and maintain LLM-as-a-Judge evaluation pipelines that leverage large language models to assess data quality at scale — including judge architectures, evaluation rubrics, scoring models, and calibration mechanisms that align with the standards set by core scientist teams developing Amazon Nova models. The Sr. SDE will also design and build GenAI-powered workflow tools — such as conversational diagnostic agents, automated quality assessment systems, and guided remediation workflows — that streamline data collection and quality assurance processes, enabling cross-functional teams to rapidly identify issues, reduce resolution time, and continuously improve data throughput.
The Sr. SDE's work will directly improve Amazon Nova models. Our team has built a strong foundation of GenAI-powered engineering practices — this senior role will accelerate and scale that momentum. This role offers direct visibility to VP and SVP leadership.
Key job responsibilities
The Sr. SDE will own the LLM-as-a-Judge evaluation pipeline — designing, building, and scaling automated evaluation systems that leverage large language models to assess data quality. The Sr. SDE will architect judge pipelines, develop evaluation rubrics and scoring frameworks, build calibration and agreement mechanisms, and ensure judge outputs align with quality standards defined by core scientist teams.
The Sr. SDE will design and build GenAI-powered diagnostic and workflow tools — including conversational troubleshooting agents, automated quality assessment tools, guided remediation systems, and workflow copilots. The Sr. SDE will leverage and extend agent orchestration frameworks such as LangChain, LangGraph, Amazon Bedrock Agents, or design custom orchestration layers tailored to AGI Data Services workflows.
The Sr. SDE will build upon the team's existing GenAI-forward practices — introducing advanced patterns for prompt engineering, RAG, agent orchestration, and LLM evaluation into production systems. The Sr. SDE will design and implement robust backend services, APIs, and data pipelines on AWS leveraging Amazon Bedrock, SageMaker, Lambda, ECS/EKS, Step Functions, DynamoDB, OpenSearch, and S3.
The Sr. SDE will collaborate with Applied Scientists, Technical Program Managers, domain experts, and vendor teams — bridging technology, process, and operations.
A day in the life
The Sr. SDE will review LLM-as-a-Judge pipeline metrics — monitoring judge accuracy, calibration drift, and agreement rates — and collaborate with Applied Scientists to refine evaluation rubrics. The Sr. SDE will design new judge architectures, build and iterate on conversational troubleshooting agents, fine-tune prompt chains, and expand RAG knowledge bases. The Sr. SDE will dive deep into data quality anecdotes to find patterns and root causes, propose tooling solutions that automate manual processes, and share new GenAI integration patterns that build on existing team practices. The Sr. SDE will communicate impact and roadmaps to cross-functional partners and VP leadership.
Amazon
110 jobs posted
About the job
Posted on
Mar 27, 2026
Apply before
Apr 26, 2026
Job typeFull-time
CategoryArtificial General Intelligence
Location
US, WA
Similar Jobs
13d
Data Associate(韓国語 한국어팀), AGI Data Services
Amazon
JapanData Associate(韓国語 한국어팀), AGI Data Services
Amazon
Japan13d13d
Data Associate(中国語 中文团队 ─ 约聘), AGI Data Services
Amazon
JapanData Associate(中国語 中文团队 ─ 约聘), AGI Data Services
Amazon
Japan13d24d
AI Data Associate (Dutch), AGI-Data Services
Amazon
NetherlandsAI Data Associate (Dutch), AGI-Data Services
Amazon
Netherlands24d10d
Advanced Packaging Data Analytics Engineer
AMD
Hsinchu, TaiwanAdvanced Packaging Data Analytics Engineer
AMD
Hsinchu, Taiwan10d16d
AI Data Associate with Dutch, Artificial General Intelligence Data Services
Amazon
NetherlandsAI Data Associate with Dutch, Artificial General Intelligence Data Services
Amazon
Netherlands16d25d
Research Scientist, Post-AGI Research
DeepMind
London, United KingdomResearch Scientist, Post-AGI Research
DeepMind
London, United Kingdom25d11d
Research Scientist, Post-AGI Research
DeepMind
London, United KingdomResearch Scientist, Post-AGI Research
DeepMind
London, United Kingdom11d25d
AI Data Associate - French, Artificial General Intelligence
Amazon
United KingdomAI Data Associate - French, Artificial General Intelligence
Amazon
United Kingdom25d24d
Member of Technical Staff - Applied Science , AGI Autonomy
Amazon
US, CAMember of Technical Staff - Applied Science , AGI Autonomy
Amazon
US, CA24d2d
Applied AI and Simulation Engineer (Packaging Engineering)
Dell Technologies
Austin, TexasApplied AI and Simulation Engineer (Packaging Engineering)
Dell Technologies
Austin, Texas2d
Looking for something different?
Browse all AI jobsFree AI job alerts
Get the latest AI jobs delivered to your inbox every week. Free, no spam.