International Women's Day 2020.png
Credit: Glynis Condon

Seven Amazon scientists shaping the future of AI

To commemorate International Women’s Day, we spoke to women scientists across a variety of research areas at Amazon.

To commemorate International Women’s Day (IWD), during Women's History Month, we asked scientists across a variety of Amazon research areas about their backgrounds, and the most exciting innovations in their fields. Here’s what they had to say.

Xin Luna Dong, principal scientist

Xin Luna Dong
Xin Luna Dong, principal scientist

Dong is a principal scientist, leading the efforts to develop the Amazon Product Knowledge Graph. Dong received her PhD in computer science, with a focus on data integration, from the University of Washington. The personal information management system in Dong’s dissertation (which won the Best Demo award in Sigmod’ 2005), is a personal knowledge graph developed at least five years before the phrase “knowledge graph” was coined. After graduation, Dong led the development of the Knowledge-based Trust project at Google. Dong has co-authored the book Big Data Integration. She is an ACM Distinguished Member, and has received the VLDB Early Career Research Contribution Award for "advancing the state of the art of knowledge fusion”. Dong was program committee co-chair for Sigmoid 2018, and is program committee co-chair for VLDB 2020. She also serves on the VLDB endowment and PVLDB advisory committees.

Innovations I find exciting

A recent innovation that I’m most excited about is graph neural network (GNN). Unlike recurrent neural networks (RNNs) and convolutional neural networks (CNNs), which focus much more on regular data such as word sequences, 2-D images, and 3-D videos, GNNs allow us to leverage graphs to capture much more complex relationships. These include elements in the graph, represented by nodes in the graph, and their relationships, represented by edges between the nodes. Examples of graphs influenced by GNNs include social networks, world wide web (WWW) topology, knowledge graphs, and molecular graphs. As we build knowledge graphs for products, it is amazing how many different ways we can benefit from GNNs.

Naturally, we can apply GNNs on the knowledge graphs we have built, to discern interesting patterns to find popular artists in the music domain. We also model webpage layouts as graphs, and model customer behaviors as graphs, so GNNs help us extract relevant knowledge and enrich our knowledge graphs. This new technique enables us to be so much more creative in the practice of constructing knowledge graphs, and applying the findings to real-world applications.

Claire Law, senior technical program manager

ClaireLaw.jfif
Claire Law, senior technical program manager

Law is senior technical program manager on Amazon’s physical retail team, the team behind the Just Walk Out technology used in Amazon Go and Amazon Go Grocery stores. She studied nanotechnology engineering at the University of Waterloo. Early on, she realized that she didn’t enjoy the type of lab work expected from a researcher in material science. She leveraged a university program, and interned as a software developer, marketer, hardware test engineer, project control officer in consulting, and software program manager. These experiences, coupled with work experiences at Microsoft and Research in Motion, led Law to pursue a career in software.

After a stint in Amazon’s international organization, Law joined the physical retail team to work on machine vision initiatives. On this team, Law is able to leverage her experience in cloud computing and knowledge of optics and photography to build new experiences for physical retail.

Innovations I find exciting

We are only now reaching a level where computer vision can solve real-world problems in a meaningful way. While we still need to be creative in where we look for simplifiers, algorithms are able to solve more and more problems every day. Challenges that looked insurmountable just a couple years ago are now part of production systems across the industry. Checkout-free stores seemed like science fiction before Amazon Go was launched, and now customers are loving this effortless shopping experience in the 25 Amazon Go stores, and the new Amazon Go Grocery store we have open today.

Yoelle Maarek, vice president

Yoelle Maarek, vice president of research and science for Alexa Shopping
Yoelle Maarek, vice president of research and science for Alexa Shopping

Maarek is vice president of research and science, Alexa Shopping. Prior to Amazon, Maarek served in engineering and research leadership roles at Yahoo, Google and IBM. Maarek has been regularly serving as program committee (PC) chair and senior PC committee member at leading academic research conferences related to Web search and data mining, such as SIGIR, The Web Conference, and Web Search and Data Mining (WSDM). She is currently serving on the steering committees of WSDM and the Web Conference series.

She is a member of the Technion Board of Governors and was inducted as an ACM Fellow in 2013. Maarek obtained a PhD in computer science from the Technion, Israel in 1989. She holds an engineering degree from the Ecole des Ponts et Chaussées, and a DEA (graduate degree) in computer science from Paris VI university. Maarek completed her PhD at the Technion in Israel and was a visiting student at Columbia University. She played a pioneering role within industry in researching the field of information retrieval, the computer science discipline behind search, in the pre-Web era, and led the launch of Google Suggest, the query auto-completion capability. As such, she jokingly refers to herself as a “search dinosaur”.

Innovations I find exciting

We are on the verge of making ambient computing happen, and Alexa is pioneering this long-awaited revolution. It forces us to revisit all our assumptions across multiple domains. I see this prevalent especially in search and question answering. These are topics close to my heart. I have been following progress in these areas since I got my PhD thirty years ago. The focus on ambient computing is also a unique opportunity for us at Amazon to demonstrate what we mean by customer-obsessed science. As humans are learning to interact with machines, their behavior is evolving and we need to follow suit. It not only challenges scientists to keep inventing on behalf of customers but also forces all of us to remain humble. We are not here to teach customers how to speak to a machine, but rather to do everything in our power to understand, satisfy and predict their needs so as to constantly wow and delight them.

Angeliki Metallinou, applied science manager

angeliki.jpeg
Angeliki Metallinou, applied science manager

Metallinou is an applied science manager within the Amazon Alexa AI Natural Understanding group. She received both her PhD, and master’s degree in electrical engineering from the University of Southern California. Her interests and experience lie in the areas of spoken and natural language understanding, dialogue systems, machine learning, deep learning, affective computing and applications for education and healthcare.

She has published papers in the areas of speech, language, dialogue, artificial intelligence and multimodal human computer interaction at leading science conferences such as Interspeech, the AAAI Conference on Artificial Intelligence, and the Association for Computational Linguistics (ACL), has served as an area chair for Interspeech 2016, and as a reviewer of papers for several science conferences.

Innovations I find exciting

It is exciting to see how new techniques in deep learning continuously push the boundaries of the state of the art in the fields of dialogue and spoken language processing. I’m very interested in advances around unsupervised, semi-supervised and transfer learning, which allow deep learning models to leverage the power of large corpora without relying on costly and time-consuming manual annotations. Pre-trained language models like BERT and GPT-2 and their use in downstream applications are just a few examples. These innovations are particularly relevant for industry applications where scalability is key.

I am also excited about recent literature in deep learning that is allowing us to develop models to perform complex tasks like higher-level reasoning, for example, over the contents of a document or an image or both, as opposed to simpler classification tasks. I’m also excited to see how these methods can have a positive impact on people through their deployment in products, especially in applications of healthcare, accessibility and education.

Priya Ponnapalli, principal deep learning scientist

Priya Ponnapalli
Priya Ponnapalli, principal deep learning scientist

Ponnapalli is a senior manager and principal deep learning scientist within the Amazon ML Solutions Lab, where she leads a global team of data scientists that help AWS customers accelerate their adoption of ML and cloud technologies across industries, from healthcare and finance to sports. As the leader of Amazon ML Solutions Lab’s sports business, Ponnapalli works with customers including National Football League (NFL), Six Nations Rugby, and Formula 1 (F1), just to name a few, to enhance the fan experience and transform sports using ML.

Ponnapalli is also a senior research affiliate at the Genomic Signal Processing Lab at the University of Utah, and a faculty member at Rutgers Business School, where she teaches ML to business leaders, and works to inspire the next generation of leaders. Prior to joining AWS, she co-founded Eigengene, a data-driven personalized medicine startup and has helped companies like Genentech and Roche establish and build data science teams. For her PhD in electrical and computer engineering at the University of Texas at Austin, Ponnapalli defined and demonstrated the higher-order generalized singular value decomposition (HO GSVD), the only framework that can create a single coherent model from multiple two-dimensional datasets by extending the GSVD from two to more than two matrices.

Innovations I find exciting

As an Amazon ML Solutions Lab scientist, I’m most excited about real-world applications of ML across industries. I’m interested in innovations to overcome challenges with small, limited datasets that companies often have to contend with. I’m also intrigued by model interpretability and explainability which are key to earning trust and spurring broad adoption. I’m passionate about making ML accessible to all, so it can be used to solve some of the most important problems we are facing, from fighting climate change to treating cancer.

Ana Pinheiro Privette, senior program manager

ana.jpg
Ana Pinheiro Privette, senior program manager

Ana Pinheiro Privette is a senior program manager with Amazon's Sustainability group. She joined the Sustainability Science and Innovation team in September 2017 as the program lead for the Amazon Sustainability Data Initiative (ASDI), a program that seeks to leverage Amazon’s scale, technology, and infrastructure to help create more global innovation for sustainability. ASDI is a Tech-for-Good project and is a joint effort between Amazon Sustainability and the AWS Open Data team focusing on democratizing access to key data and analytical capabilities to anyone working in the sustainability space.

Privette was trained as an environmental engineer and as an earth sciences researcher at the New University of Lisbon (Portugal) and at MIT. She did her doctoral research work at NASA in the Washington D.C. area and as part of her project, she spent a couple of years running scientific field work sites in Africa to support a NASA international field campaign. After spending most of her career at NASA and NOAA as a scientist, Privette led projects for the White House climate portfolio, including the Obama Climate Data Initiative and the Partnership for Resilience and Preparedness (PREP), both focused on delivering better access and use of US Federal climate data to support decision makers.

Innovations I find exciting

As part of ASDI, I work very closely with AWS customers developing applications in the space of sustainability to understand what challenges they may be experiencing and how we may accelerate sustainability research and innovation by minimizing the cost and time required to acquire and analyze large sustainability datasets. The ASDI currently works with scientific organizations like NOAA, NASA, the UK Met Office and Government of Queensland to identify, host, and deploy key datasets on the AWS Cloud, including weather observations, weather forecasts, climate projection data, satellite imagery, hydrological data, air quality data, and ocean forecast data. These datasets are publicly available to anyone.

In addition, ASDI provides cloud grants to those interested in exploring the use of AWS’ technology and scalable infrastructure to solve big, long-term sustainability challenges with this data. The dual-pronged approach allows sustainability researchers to analyze massive amounts of data in mere minutes, regardless of where they are in the world or how much local storage space or computing capacity they can access.

Nashlie Sephus, manager, applied science

sephus.jpg
Nashlie Sephus, applied scientist, Amazon Web Services machine learning team.
Credit: Terrence Wells@PoetWilliamsPhotography

Sephus is an applied scientist on AWS’s artificial intelligence team, focusing on computer vision. In this role, Sephus focuses on the fairness and accuracy of the team’s algorithms. Sephus formerly led the Amazon Visual Search team in Atlanta, which launched visual search for replacement parts on the Amazon Shopping app in June 2018. This technology was a result of former startup Partpic (Atlanta) being acquired by Amazon, for which she was the chief technology officer (CTO). Prior to working at Partpic, she received her PhD in 2014 from the School of Electrical and Computer Engineering at the Georgia Institute of Technology. She received her bachelor’s degree in computer engineering in 2007 from Mississippi State University.

Innovations I find exciting

Since the onset of machine learning and artificial intelligence, neural networks (such as convolutional neural networks (CNNs), and generative adversarial networks (GANs), etc.) and learning algorithms have always excited me. It’s being able to quickly and automatically draw patterns from data, whether it be images, video, or audio at scale, that fascinates me. Since music was my first love (along with karaoke!), music information retrieval has always been a passion of mine. These innovations, when used responsibly and fairly, are able to benefit people in their everyday activities.

Research areas

Related content

US, WA, Bellevue
The Artificial General Intelligence (AGI) team is seeking a dedicated, skilled, and innovative Applied Scientist with a robust background in machine learning, statistics, quality assurance, auditing methodologies, and automated evaluation systems to ensure the highest standards of data quality, to build industry-leading technology with Large Language Models (LLMs) and multimodal systems. Key job responsibilities As part of the AGI team, an Applied Scientist will collaborate closely with core scientist team developing Amazon Nova models. They will lead the development of comprehensive quality strategies and auditing frameworks that safeguard the integrity of data collection workflows. This includes designing auditing strategies with detailed SOPs, quality metrics, and sampling methodologies that help Nova improve performances on benchmarks. The Applied Scientist will perform expert-level manual audits, conduct meta-audits to evaluate auditor performance, and provide targeted coaching to uplift overall quality capabilities. A critical aspect of this role involves developing and maintaining LLM-as-a-Judge systems, including designing judge architectures, creating evaluation rubrics, and building machine learning models for automated quality assessment. The Applied Scientist will also set up the configuration of data collection workflows and communicate quality feedback to stakeholders. An Applied Scientist will also have a direct impact on enhancing customer experiences through high-quality training and evaluation data that powers state-of-the-art LLM products and services. A day in the life An Applied Scientist with the AGI team will support quality solution design, conduct root cause analysis on data quality issues, research new auditing methodologies, and find innovative ways of optimizing data quality while setting examples for the team on quality assurance best practices and standards. Besides theoretical analysis and quality framework development, an Applied Scientist will also work closely with talented engineers, domain experts, and vendor teams to put quality strategies and automated judging systems into practice.
US, MA, Boston
The Artificial General Intelligence (AGI) team is seeking a dedicated, skilled, and innovative Applied Scientist with a robust background in machine learning, statistics, quality assurance, auditing methodologies, and automated evaluation systems to ensure the highest standards of data quality, to build industry-leading technology with Large Language Models (LLMs) and multimodal systems. Key job responsibilities As part of the AGI team, an Applied Scientist will collaborate closely with core scientist team developing Amazon Nova models. They will lead the development of comprehensive quality strategies and auditing frameworks that safeguard the integrity of data collection workflows. This includes designing auditing strategies with detailed SOPs, quality metrics, and sampling methodologies that help Nova improve performances on benchmarks. The Applied Scientist will perform expert-level manual audits, conduct meta-audits to evaluate auditor performance, and provide targeted coaching to uplift overall quality capabilities. A critical aspect of this role involves developing and maintaining LLM-as-a-Judge systems, including designing judge architectures, creating evaluation rubrics, and building machine learning models for automated quality assessment. The Applied Scientist will also set up the configuration of data collection workflows and communicate quality feedback to stakeholders. An Applied Scientist will also have a direct impact on enhancing customer experiences through high-quality training and evaluation data that powers state-of-the-art LLM products and services. A day in the life An Applied Scientist with the AGI team will support quality solution design, conduct root cause analysis on data quality issues, research new auditing methodologies, and find innovative ways of optimizing data quality while setting examples for the team on quality assurance best practices and standards. Besides theoretical analysis and quality framework development, an Applied Scientist will also work closely with talented engineers, domain experts, and vendor teams to put quality strategies and automated judging systems into practice.
US, MA, Boston
The Artificial General Intelligence (AGI) team is seeking a dedicated, skilled, and innovative Applied Scientist with a robust background in machine learning, statistics, quality assurance, auditing methodologies, and automated evaluation systems to ensure the highest standards of data quality, to build industry-leading technology with Large Language Models (LLMs) and multimodal systems. Key job responsibilities As part of the AGI team, an Applied Scientist will collaborate closely with core scientist team developing Amazon Nova models. They will lead the development of comprehensive quality strategies and auditing frameworks that safeguard the integrity of data collection workflows. This includes designing auditing strategies with detailed SOPs, quality metrics, and sampling methodologies that help Nova improve performances on benchmarks. The Applied Scientist will perform expert-level manual audits, conduct meta-audits to evaluate auditor performance, and provide targeted coaching to uplift overall quality capabilities. A critical aspect of this role involves developing and maintaining LLM-as-a-Judge systems, including designing judge architectures, creating evaluation rubrics, and building machine learning models for automated quality assessment. The Applied Scientist will also set up the configuration of data collection workflows and communicate quality feedback to stakeholders. An Applied Scientist will also have a direct impact on enhancing customer experiences through high-quality training and evaluation data that powers state-of-the-art LLM products and services. A day in the life An Applied Scientist with the AGI team will support quality solution design, conduct root cause analysis on data quality issues, research new auditing methodologies, and find innovative ways of optimizing data quality while setting examples for the team on quality assurance best practices and standards. Besides theoretical analysis and quality framework development, an Applied Scientist will also work closely with talented engineers, domain experts, and vendor teams to put quality strategies and automated judging systems into practice.
US, MA, Boston
The Artificial General Intelligence (AGI) team is seeking a dedicated, skilled, and innovative Applied Scientist with a robust background in machine learning, statistics, quality assurance, auditing methodologies, and automated evaluation systems to ensure the highest standards of data quality, to build industry-leading technology with Large Language Models (LLMs) and multimodal systems. Key job responsibilities As part of the AGI team, an Applied Scientist will collaborate closely with core scientist team developing Amazon Nova models. They will lead the development of comprehensive quality strategies and auditing frameworks that safeguard the integrity of data collection workflows. This includes designing auditing strategies with detailed SOPs, quality metrics, and sampling methodologies that help Nova improve performances on benchmarks. The Applied Scientist will perform expert-level manual audits, conduct meta-audits to evaluate auditor performance, and provide targeted coaching to uplift overall quality capabilities. A critical aspect of this role involves developing and maintaining LLM-as-a-Judge systems, including designing judge architectures, creating evaluation rubrics, and building machine learning models for automated quality assessment. The Applied Scientist will also set up the configuration of data collection workflows and communicate quality feedback to stakeholders. An Applied Scientist will also have a direct impact on enhancing customer experiences through high-quality training and evaluation data that powers state-of-the-art LLM products and services. A day in the life An Applied Scientist with the AGI team will support quality solution design, conduct root cause analysis on data quality issues, research new auditing methodologies, and find innovative ways of optimizing data quality while setting examples for the team on quality assurance best practices and standards. Besides theoretical analysis and quality framework development, an Applied Scientist will also work closely with talented engineers, domain experts, and vendor teams to put quality strategies and automated judging systems into practice.
US, MA, Boston
The Artificial General Intelligence (AGI) team is seeking a dedicated, skilled, and innovative Applied Scientist with a robust background in machine learning, statistics, quality assurance, auditing methodologies, and automated evaluation systems to ensure the highest standards of data quality, to build industry-leading technology with Large Language Models (LLMs) and multimodal systems. Key job responsibilities As part of the AGI team, an Applied Scientist will collaborate closely with core scientist team developing Amazon Nova models. They will lead the development of comprehensive quality strategies and auditing frameworks that safeguard the integrity of data collection workflows. This includes designing auditing strategies with detailed SOPs, quality metrics, and sampling methodologies that help Nova improve performances on benchmarks. The Applied Scientist will perform expert-level manual audits, conduct meta-audits to evaluate auditor performance, and provide targeted coaching to uplift overall quality capabilities. A critical aspect of this role involves developing and maintaining LLM-as-a-Judge systems, including designing judge architectures, creating evaluation rubrics, and building machine learning models for automated quality assessment. The Applied Scientist will also set up the configuration of data collection workflows and communicate quality feedback to stakeholders. An Applied Scientist will also have a direct impact on enhancing customer experiences through high-quality training and evaluation data that powers state-of-the-art LLM products and services. A day in the life An Applied Scientist with the AGI team will support quality solution design, conduct root cause analysis on data quality issues, research new auditing methodologies, and find innovative ways of optimizing data quality while setting examples for the team on quality assurance best practices and standards. Besides theoretical analysis and quality framework development, an Applied Scientist will also work closely with talented engineers, domain experts, and vendor teams to put quality strategies and automated judging systems into practice.
US, MA, Boston
The Artificial General Intelligence (AGI) team is seeking a dedicated, skilled, and innovative Applied Scientist with a robust background in machine learning, statistics, quality assurance, auditing methodologies, and automated evaluation systems to ensure the highest standards of data quality, to build industry-leading technology with Large Language Models (LLMs) and multimodal systems. Key job responsibilities As part of the AGI team, an Applied Scientist will collaborate closely with core scientist team developing Amazon Nova models. They will lead the development of comprehensive quality strategies and auditing frameworks that safeguard the integrity of data collection workflows. This includes designing auditing strategies with detailed SOPs, quality metrics, and sampling methodologies that help Nova improve performances on benchmarks. The Applied Scientist will perform expert-level manual audits, conduct meta-audits to evaluate auditor performance, and provide targeted coaching to uplift overall quality capabilities. A critical aspect of this role involves developing and maintaining LLM-as-a-Judge systems, including designing judge architectures, creating evaluation rubrics, and building machine learning models for automated quality assessment. The Applied Scientist will also set up the configuration of data collection workflows and communicate quality feedback to stakeholders. An Applied Scientist will also have a direct impact on enhancing customer experiences through high-quality training and evaluation data that powers state-of-the-art LLM products and services. A day in the life An Applied Scientist with the AGI team will support quality solution design, conduct root cause analysis on data quality issues, research new auditing methodologies, and find innovative ways of optimizing data quality while setting examples for the team on quality assurance best practices and standards. Besides theoretical analysis and quality framework development, an Applied Scientist will also work closely with talented engineers, domain experts, and vendor teams to put quality strategies and automated judging systems into practice.
US, MA, Boston
The Artificial General Intelligence (AGI) team is seeking a dedicated, skilled, and innovative Applied Scientist with a robust background in machine learning, statistics, quality assurance, auditing methodologies, and automated evaluation systems to ensure the highest standards of data quality, to build industry-leading technology with Large Language Models (LLMs) and multimodal systems. Key job responsibilities As part of the AGI team, an Applied Scientist will collaborate closely with core scientist team developing Amazon Nova models. They will lead the development of comprehensive quality strategies and auditing frameworks that safeguard the integrity of data collection workflows. This includes designing auditing strategies with detailed SOPs, quality metrics, and sampling methodologies that help Nova improve performances on benchmarks. The Applied Scientist will perform expert-level manual audits, conduct meta-audits to evaluate auditor performance, and provide targeted coaching to uplift overall quality capabilities. A critical aspect of this role involves developing and maintaining LLM-as-a-Judge systems, including designing judge architectures, creating evaluation rubrics, and building machine learning models for automated quality assessment. The Applied Scientist will also set up the configuration of data collection workflows and communicate quality feedback to stakeholders. An Applied Scientist will also have a direct impact on enhancing customer experiences through high-quality training and evaluation data that powers state-of-the-art LLM products and services. A day in the life An Applied Scientist with the AGI team will support quality solution design, conduct root cause analysis on data quality issues, research new auditing methodologies, and find innovative ways of optimizing data quality while setting examples for the team on quality assurance best practices and standards. Besides theoretical analysis and quality framework development, an Applied Scientist will also work closely with talented engineers, domain experts, and vendor teams to put quality strategies and automated judging systems into practice.
US, MA, Boston
The Artificial General Intelligence (AGI) team is seeking a dedicated, skilled, and innovative Applied Scientist with a robust background in machine learning, statistics, quality assurance, auditing methodologies, and automated evaluation systems to ensure the highest standards of data quality, to build industry-leading technology with Large Language Models (LLMs) and multimodal systems. Key job responsibilities As part of the AGI team, an Applied Scientist will collaborate closely with core scientist team developing Amazon Nova models. They will lead the development of comprehensive quality strategies and auditing frameworks that safeguard the integrity of data collection workflows. This includes designing auditing strategies with detailed SOPs, quality metrics, and sampling methodologies that help Nova improve performances on benchmarks. The Applied Scientist will perform expert-level manual audits, conduct meta-audits to evaluate auditor performance, and provide targeted coaching to uplift overall quality capabilities. A critical aspect of this role involves developing and maintaining LLM-as-a-Judge systems, including designing judge architectures, creating evaluation rubrics, and building machine learning models for automated quality assessment. The Applied Scientist will also set up the configuration of data collection workflows and communicate quality feedback to stakeholders. An Applied Scientist will also have a direct impact on enhancing customer experiences through high-quality training and evaluation data that powers state-of-the-art LLM products and services. A day in the life An Applied Scientist with the AGI team will support quality solution design, conduct root cause analysis on data quality issues, research new auditing methodologies, and find innovative ways of optimizing data quality while setting examples for the team on quality assurance best practices and standards. Besides theoretical analysis and quality framework development, an Applied Scientist will also work closely with talented engineers, domain experts, and vendor teams to put quality strategies and automated judging systems into practice.
GB, London
As a STRUC Economist Intern, you'll specialize in structural econometric analysis to estimate fundamental preferences and strategic effects in complex business environments. Your responsibilities include: Analyze large-scale datasets using structural econometric techniques to solve complex business challenges Applying discrete choice models and methods, including logistic regression family models (such as BLP, nested logit) and models with alternative distributional assumptions Utilizing advanced structural methods including dynamic models of customer or firm decisions over time, applied game theory (entry and exit of firms), auction models, and labor market models Building datasets and performing data analysis at scale Collaborating with economists, scientists, and business leaders to develop data-driven insights and strategic recommendations Tackling diverse challenges including pricing analysis, competition modeling, strategic behavior estimation, contract design, and marketing strategy optimization Helping business partners formalize and estimate business objectives to drive optimal decision-making and customer value Build and refine comprehensive datasets for in-depth structural economic analysis Present complex analytical findings to business leaders and stakeholders
US, WA, Seattle
At Amazon Selection and Catalog Systems (ASCS), our mission is to power the online buying experience for customers worldwide so they can find, discover, and buy any product they want. We innovate on behalf of our customers to ensure uniqueness and consistency of product identity and to infer relationships between products in Amazon Catalog to drive the selection gateway for the search and browse experiences on the website. We're solving a fundamental AI challenge: establishing product identity and relationships at unprecedented scale. Using Generative AI, Visual Language Models (VLMs), and multimodal reasoning, we determine what makes each product unique and how products relate to one another across Amazon's catalog. The scale is staggering: billions of products, petabytes of multimodal data, millions of sellers, dozens of languages, and infinite product diversity—from electronics to groceries to digital content. The research challenges are immense. GenAI and VLMs hold transformative promise for catalog understanding, but we operate where traditional methods fail: ambiguous problem spaces, incomplete and noisy data, inherent uncertainty, reasoning across both images and textual data, and explaining decisions at scale. Establishing product identities and groupings requires sophisticated models that reason across text, images, and structured data—while maintaining accuracy and trust for high-stakes business decisions affecting millions of customers daily. Amazon's Item and Relationship Platform group is looking for an innovative and customer-focused applied scientist to help us make the world's best product catalog even better. In this role, you will partner with technology and business leaders to build new state-of-the-art algorithms, models, and services to infer product-to-product relationships that matter to our customers. You will pioneer advanced GenAI solutions that power next-generation agentic shopping experiences, working in a collaborative environment where you can experiment with massive data from the world's largest product catalog, tackle problems at the frontier of AI research, rapidly implement and deploy your algorithmic ideas at scale, across millions of customers. Key job responsibilities Key job responsibilities include: * Formulate open research problems at the intersection of GenAI, multimodal reasoning, and large-scale information retrieval—defining the scientific questions that transform ambiguous, real-world catalog challenges into publishable, high-impact research * Push the boundaries of VLMs, foundation models, and agentic architectures by designing novel approaches to product identity, relationship inference, and catalog understanding—where the problem complexity (billions of products, multimodal signals, inherent ambiguity) demands methods that don't yet exist * Advance the science of efficient model deployment—developing distillation, compression, and LLM/VLM serving optimization strategies that preserve frontier-level multimodal reasoning in compact, production-grade architectures while dramatically reducing latency, cost, and infrastructure footprint at billion-product scale * Make frontier models reliable—advancing uncertainty calibration, confidence estimation, and interpretability methods so that frontier-scale GenAI systems can be trusted for autonomous catalog decisions impacting millions of customers daily * Own the full research lifecycle from problem formulation through production deployment—designing rigorous experiments over petabytes of multimodal data, iterating on ideas rapidly, and seeing your research directly improve the shopping experience for hundreds of millions of customers * Shape the team's research vision by defining technical roadmaps that balance foundational scientific inquiry with measurable product impact * Mentor scientists and engineers on advanced ML techniques, experimental design, and scientific rigor—building deep organizational capability in GenAI and multimodal AI * Represent the team in the broader science community—publishing findings, delivering tech talks, and staying at the forefront of GenAI, VLM, and agentic system research