AI Benchmarking Specialist - Chinese, International Seller Growth
Posted 52 days ago
Job Description
This job posting has expired and no longer accepting applications.
The Seller AI team within International Seller Services organization is focused on helping sellers with the right set of Gen-AI/LLM powered tools and agentic solutions that can enable them to accelerate business growth on Amazon. Our primary focus lies in handling annotations for training, measuring, and improving Artificial Intelligence (AI) and Large Language Models (LLMs), enabling Amazon to deliver a superior seller experience to our sellers worldwide. The AI Benchmarking Associate supports the evaluation of AI systems by designing and executing benchmarking and audit activities to assess model quality, compliance, robustness, and fairness. The role combines elements of AI auditing, quality assurance, and traditional audit-style documentation and stakeholder communication. By joining us, you will play a pivotal role in shaping the future of selling on Amazon for sellers worldwide.
Key job responsibilities
As part of your role, you will have the opportunity to,
• Assist in planning and executing benchmarking exercises for AI models, including defining test plans, metrics, and acceptance criteria across accuracy, robustness, bias, and reliability
• Support content accuracy, relevancy, and privacy checks by reviewing datasets, model outputs, and data handling practices, escalating potential regulatory risks.
• Validate data based on specific annotation guidelines, ensuring the accuracy and quality of the collected information
• Prepare clear audit and benchmarking reports, including error ratings, root-cause analysis, and recommendations, and contribute to presentations for senior stakeholders
Maintain organized audit documentation, evidence, and benchmarking datasets to support internal review
• You will work closely with your team members and managers to drive process efficiencies and explore opportunities for automation
• You will strive to enhance the productivity and effectiveness of the data generation by contributing to the development and continuous improvement of AI audit methodologies, checklists, and test frameworks as regulations and best practices evolve
About the team
There are millions of small and medium businesses across international stores such as India, LatAm, Europe, Middle East, Japan etc. who sign up as sellers on Amazon. Our primary focus lies in handling annotations for training, measuring, and improving Artificial Intelligence (AI) and Large Language Models (LLMs), enabling Amazon to deliver a superior seller experience to our sellers worldwide.
Key job responsibilities
As part of your role, you will have the opportunity to,
• Assist in planning and executing benchmarking exercises for AI models, including defining test plans, metrics, and acceptance criteria across accuracy, robustness, bias, and reliability
• Support content accuracy, relevancy, and privacy checks by reviewing datasets, model outputs, and data handling practices, escalating potential regulatory risks.
• Validate data based on specific annotation guidelines, ensuring the accuracy and quality of the collected information
• Prepare clear audit and benchmarking reports, including error ratings, root-cause analysis, and recommendations, and contribute to presentations for senior stakeholders
Maintain organized audit documentation, evidence, and benchmarking datasets to support internal review
• You will work closely with your team members and managers to drive process efficiencies and explore opportunities for automation
• You will strive to enhance the productivity and effectiveness of the data generation by contributing to the development and continuous improvement of AI audit methodologies, checklists, and test frameworks as regulations and best practices evolve
About the team
There are millions of small and medium businesses across international stores such as India, LatAm, Europe, Middle East, Japan etc. who sign up as sellers on Amazon. Our primary focus lies in handling annotations for training, measuring, and improving Artificial Intelligence (AI) and Large Language Models (LLMs), enabling Amazon to deliver a superior seller experience to our sellers worldwide.
This job posting has expired and no longer accepting applications. Please check out our latest AI jobs.
Amazon
126 jobs posted
About the job
Posted on
Jan 6, 2026
Apply before
Feb 5, 2026
Job typeFull-time
CategoryAI Internships
Location
China
Skills
Similar Jobs
Dell Technologies
23 days agoAI Agent Benchmarking Co-Op intern
Ottawa, Ontario, Canada$39 - $43/hrView detailsDell Technologies
23 days agoAI Engineering Undergraduate Intern
Singapore, SingaporeView details
Zoox
6 days agoAI Applications Development Intern
Foster City, CA$6K - $8K/moView detailsHP
24 days agoCollege Intern - AI Business Applications Support
Spring, Texas$24 - $27/hrView detailsToyota Research Institute
15 days agoHuman-Centered AI Intern, Behavioral Science
Los Altos, CA$45/hrView detailsHP
16 days agoCollege Intern - AI Business Applications Support
Spring, Texas$24 - $27/hrView detailsAmazon
3 days agoSoftware Dev Intern - AI / Machine Learning
United KingdomView details
RIVR
28 days agoAI Intern – Vision-Language-Action (VLA)
View detailsWoven by Toyota
22 days agoComputer Vision Researcher, Vision AI Platform (Internship)
TokyoView detailsHitachi
17 days agoEngineering -AI Content Development – Summer Internship 2026
Michigan City, Indiana$20 - $25/hrView details
Looking for something different?
Browse all AI jobsNever miss a new AI job
Get the latest AI jobs delivered to your inbox every week. Free, no spam.
