Data Scientist, AWS Quick Data
Posted 1 day ago
Job Description
Amazon Quick Suite is an enterprise AI platform that transforms how organizations work with their data and knowledge. Combining generative AI-powered search, deep research capabilities, intelligent agents and automations, and comprehensive business intelligence, Quick Suite serves tens of thousands of users. Our platform processes thousands of queries monthly, helping teams make faster, data-driven decisions while maintaining enterprise-grade security and governance. From natural language interactions with complex datasets to automated workflows and custom AI agents, Quick Suite is redefining workplace productivity at unprecedented scale.
We are seeking a Data Scientist II to join our Quick Data team, focusing on evaluation and benchmarking data development for Quick Suite features, with particular emphasis on Research and other generative AI capabilities. Our mission is to engineer high-quality datasets that are essential to the success of Amazon Quick Suite. From human evaluations and Responsible AI safeguards to Retrieval-Augmented Generation and beyond, our work ensures that Generative AI is enterprise-ready, safe, and effective for users at scale.
As part of our diverse team—including data scientists, engineers, language engineers, linguists, and program managers—you will collaborate closely with science, engineering, and product teams. We are driven by customer obsession and a commitment to excellence.
Key job responsibilities
In this role, you will leverage data-centric AI principles to assess the impact of data on model performance and the broader machine learning pipeline. You will apply Generative AI techniques to evaluate how well our data represents human language and conduct experiments to measure downstream interactions.
Specific responsibilities include:
* Design and develop comprehensive evaluation and benchmarking datasets for Quick Suite AI-powered features
* Leverage LLMs for synthetic data corpora generation; data evaluation and quality assessment using LLM-as-a-judge settings
* Create ground truth datasets with high-quality question-answer pairs across diverse domains and use cases
* Lead human annotation initiatives and model evaluation audits to ensure data quality and relevance
* Develop and refine annotation guidelines and quality frameworks for evaluation tasks
* Conduct statistical analysis to measure model performance, identify failure patterns, and guide improvement strategies
* Collaborate with ML scientists and engineers to translate evaluation insights into actionable product improvements
* Build scalable data pipelines and tools to support continuous evaluation and benchmarking efforts
* Contribute to Responsible AI initiatives by developing safety and fairness evaluation datasets
About the team
Why AWS?
Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.
Inclusive Team Culture
Here at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon conferences, inspire us to never stop embracing our uniqueness.
Mentorship & Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.
Work/Life Balance
We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.
Hybrid Work
We value innovation and recognize this sometimes requires uninterrupted time to focus on a build. We also value in-person collaboration and time spent face-to-face. Our team affords employees options to work in the office every day or in a flexible, hybrid work model near one of our U.S. Amazon offices.
We are seeking a Data Scientist II to join our Quick Data team, focusing on evaluation and benchmarking data development for Quick Suite features, with particular emphasis on Research and other generative AI capabilities. Our mission is to engineer high-quality datasets that are essential to the success of Amazon Quick Suite. From human evaluations and Responsible AI safeguards to Retrieval-Augmented Generation and beyond, our work ensures that Generative AI is enterprise-ready, safe, and effective for users at scale.
As part of our diverse team—including data scientists, engineers, language engineers, linguists, and program managers—you will collaborate closely with science, engineering, and product teams. We are driven by customer obsession and a commitment to excellence.
Key job responsibilities
In this role, you will leverage data-centric AI principles to assess the impact of data on model performance and the broader machine learning pipeline. You will apply Generative AI techniques to evaluate how well our data represents human language and conduct experiments to measure downstream interactions.
Specific responsibilities include:
* Design and develop comprehensive evaluation and benchmarking datasets for Quick Suite AI-powered features
* Leverage LLMs for synthetic data corpora generation; data evaluation and quality assessment using LLM-as-a-judge settings
* Create ground truth datasets with high-quality question-answer pairs across diverse domains and use cases
* Lead human annotation initiatives and model evaluation audits to ensure data quality and relevance
* Develop and refine annotation guidelines and quality frameworks for evaluation tasks
* Conduct statistical analysis to measure model performance, identify failure patterns, and guide improvement strategies
* Collaborate with ML scientists and engineers to translate evaluation insights into actionable product improvements
* Build scalable data pipelines and tools to support continuous evaluation and benchmarking efforts
* Contribute to Responsible AI initiatives by developing safety and fairness evaluation datasets
About the team
Why AWS?
Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.
Inclusive Team Culture
Here at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon conferences, inspire us to never stop embracing our uniqueness.
Mentorship & Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.
Work/Life Balance
We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.
Hybrid Work
We value innovation and recognize this sometimes requires uninterrupted time to focus on a build. We also value in-person collaboration and time spent face-to-face. Our team affords employees options to work in the office every day or in a flexible, hybrid work model near one of our U.S. Amazon offices.
Amazon
127 jobs posted
About the job
Posted on
Apr 1, 2026
Apply before
May 1, 2026
Job typeFull-time
CategoryData Science
Location
US, CA
Skills
Similar Jobs
30d
Data Scientist
Paypal
Central Singapore, SingaporeData Scientist
Paypal
Central Singapore, Singapore30d30d
Data Scientist
Paypal
Dublin, County Dublin, IrelandData Scientist
Paypal
Dublin, County Dublin, Ireland30d27d
Data Scientist
Visa
Melbourne, AustraliaData Scientist
Visa
Melbourne, Australia27d28d
Data Scientist
Paypal
Bangalore, Karnataka, IndiaData Scientist
Paypal
Bangalore, Karnataka, India28d27d
Data Scientist
Hitachi
Chennai, Tamil Nadu, IndiaData Scientist
Hitachi
Chennai, Tamil Nadu, India27d23d
Data Scientist
Anthropic
$275K - $370KSan Francisco, CANew York City, NYData Scientist
Anthropic
$275K - $370KSan Francisco, CANew York City, NY23d22d
Data Scientist
Visa
Paris, FranceData Scientist
Visa
Paris, France22d19d
Data Scientist
Visa
$145KWashington, DCData Scientist
Visa
$145KWashington, DC19d17d
Data Scientist
Anthropic
$275K - $370KSan Francisco, CANew York City, NYData Scientist
Anthropic
$275K - $370KSan Francisco, CANew York City, NY17d14d
Data Scientist
Reddit
$197K - $230KSan Francisco, CAData Scientist
Reddit
$197K - $230KSan Francisco, CA14d
Looking for something different?
Browse all AI jobsFree AI job alerts
Get the latest AI jobs delivered to your inbox every week. Free, no spam.