AI Benchmarking Spec. - Chinese, International Seller Growth
Posted 6 days ago
Job Description
The Seller AI team within International Seller Services organization is focused on helping sellers with the right set of Gen-AI/LLM powered tools and agentic solutions that can enable them to accelerate business growth on Amazon. Our primary focus lies in handling annotations for training, measuring, and improving Artificial Intelligence (AI) and Large Language Models (LLMs), enabling Amazon to deliver a superior seller experience to our sellers worldwide. The AI Benchmarking Associate supports the evaluation of AI systems by designing and executing benchmarking and audit activities to assess model quality, compliance, robustness, and fairness. The role combines elements of AI auditing, quality assurance, and traditional audit-style documentation and stakeholder communication. By joining us, you will play a pivotal role in shaping the future of selling on Amazon for sellers worldwide.
Key job responsibilities
As part of your role, you will have the opportunity to,
• Assist in planning and executing benchmarking exercises for AI models, including defining test plans, metrics, and acceptance criteria across accuracy, robustness, bias, and reliability
• Support content accuracy, relevancy, and privacy checks by reviewing datasets, model outputs, and data handling practices, escalating potential regulatory risks.
• Validate data based on specific annotation guidelines, ensuring the accuracy and quality of the collected information
• Prepare clear audit and benchmarking reports, including error ratings, root-cause analysis, and recommendations, and contribute to presentations for senior stakeholders
Maintain organized audit documentation, evidence, and benchmarking datasets to support internal review
• You will work closely with your team members and managers to drive process efficiencies and explore opportunities for automation
• You will strive to enhance the productivity and effectiveness of the data generation by contributing to the development and continuous improvement of AI audit methodologies, checklists, and test frameworks as regulations and best practices evolve
Key job responsibilities
As part of your role, you will have the opportunity to,
• Assist in planning and executing benchmarking exercises for AI models, including defining test plans, metrics, and acceptance criteria across accuracy, robustness, bias, and reliability
• Support content accuracy, relevancy, and privacy checks by reviewing datasets, model outputs, and data handling practices, escalating potential regulatory risks.
• Validate data based on specific annotation guidelines, ensuring the accuracy and quality of the collected information
• Prepare clear audit and benchmarking reports, including error ratings, root-cause analysis, and recommendations, and contribute to presentations for senior stakeholders
Maintain organized audit documentation, evidence, and benchmarking datasets to support internal review
• You will work closely with your team members and managers to drive process efficiencies and explore opportunities for automation
• You will strive to enhance the productivity and effectiveness of the data generation by contributing to the development and continuous improvement of AI audit methodologies, checklists, and test frameworks as regulations and best practices evolve
Amazon
129 jobs posted
About the job
Posted on
Mar 19, 2026
Apply before
Apr 18, 2026
Job typeFull-time
CategoryAI Internships
Location
China
Skills
Similar Jobs
6d
Program Manager, AI Model Evaluation, International Seller Growth
Amazon
ChinaProgram Manager, AI Model Evaluation, International Seller Growth
Amazon
China6d8d
AI PM Intern
OpusClip
$25Palo AltoAI PM Intern
OpusClip
$25Palo Alto8d19d
AI Software Archtiect Intern
d-Matrix
Santa ClaraAI Software Archtiect Intern
d-Matrix
Santa Clara19d7d
Undergraduate Intern -- AI Engineering
Dell Technologies
Singapore, SingaporeUndergraduate Intern -- AI Engineering
Dell Technologies
Singapore, Singapore7d16d
Senior Applied Scientist, Alexa International
Amazon
US, WASenior Applied Scientist, Alexa International
Amazon
US, WA16d14d
Summer 2026 Intern - AI Research
Salesforce
$49 - $68California - Palo AltoSummer 2026 Intern - AI Research
Salesforce
$49 - $68California - Palo Alto14d19d
Manager, Internal AI Enablement & Agentic Systems
HubSpot
Remote$139K - $222KUnited StatesManager, Internal AI Enablement & Agentic Systems
HubSpot
Remote$139K - $222KUnited States19d6d
Summer 2026 Intern - Applied AI Strategy & Research
Salesforce
$44 - $53San Francisco, CASummer 2026 Intern - Applied AI Strategy & Research
Salesforce
$44 - $53San Francisco, CA6d2d
Data & AI Innovation Intern - 2026 Summer Internship
Nasdaq
United StatesData & AI Innovation Intern - 2026 Summer Internship
Nasdaq
United States2d25d
AI & Management Consulting Intern (Value Engineering - DACH Market)
Celonis
Munich, GermanyAI & Management Consulting Intern (Value Engineering - DACH Market)
Celonis
Munich, Germany25d
Looking for something different?
Browse all AI jobsFree AI job alerts
Get the latest AI jobs delivered to your inbox every week. Free, no spam.