Amazon Science homepage

How AI is changing the nature of mathematical research

Animation shows a map of the United States and each of the 8 individual regions that resulted from Amazon's regionalization effort

How Amazon reworked its fulfillment network to meet customer demand

Graviton5’s improved design increases speed and energy efficiency — beyond Moore’s law

Amazon News

The fuel of the future is already here: Why TRISO matters

Information and knowledge management

Machine learning

Operations research and optimization

Quantum technologies

Robotics

Search and information retrieval

Security, privacy, and abuse prevention

Sustainability

From the blog

View all

Technical deep-dives and perspectives from our scientists.

View all

Amazon and University of Michigan give robots a sense of touch

July 10, 2026

5 min read

HydroShear, a new physics-based simulator, teaches robots how to use their sense of touch to perform complex manipulation tasks, in a way that transfers seamlessly to the real world.

Robotics
Capturing token IDs during agentic interactions for better reinforcement learning

July 9, 2026

10 min read

Machine learning
EC2’s formally verified “isolation engine” provides mathematical assurance of virtual-machine isolation

June 10, 2026

7 min read

Automated reasoning
Real-world grounding in agentic AI

June 8, 2026

7 min read

Machine learning
Bridging intent and execution in agentic systems

June 8, 2026

18 min read

Cloud and systems

View all

Coming soon: Season 2

Hosted by Danielle Perszyk, cognitive scientist at Amazon's AGI Lab, the podcast features researchers tackling the hardest problems in agentic AI — from building reliable perception systems to designing training environments that mirror human learning.

AWS and Hopkins Engineering announce database for AI/ML antibody design

The Antibody Developability Benchmark is powered by one of the most diverse antibody datasets in public literature, enabling transparent performance evaluation for AI-guided antibody design.

2026 Amazon Nova AI Challenge: Trusted Software Agents track

Challenge pushes teams to demonstrate measurable gains in secure-coding performance while building AI agents that advance real-world utility and reliability at scale.

Amazon launches $68 million AI PhD Fellowship program

Initiative will fund over 100 doctoral students researching machine learning, computer vision, and natural-language processing at nine universities.

Stress tests REVEAL fragile temporal and visual grounding in video-language models

Sethuraman T V, Savya Khosla, Aditi Tiwari, Vidya Ganesh, Rakshana Jayaprakash, Aditya Jain, Vignesh Srinivasakumar, Onkar Kishor Susladkar, Srinidhi Sunkara , Aditya Shanmugham, Abbaas Alif Mohamed Nishar, Rakesh Vaideeswaran Mahesh, Simon Jenni, Derek Hoiem

ICML 2026 Workshop on Mechanistic Interpretability

2026

Video-Language Models (VidLMs) achieve strong benchmark scores, yet these scores often hide whether models use the video at all. We show that VidLM failures follow two pathways: some visual signals are never reliably encoded, while others are encoded but overridden by model priors. We introduce REVEAL, a diagnostic stress-test benchmark for quantifying when and why VidLMs under-use visual evidence. REVEAL

Computer vision
Seekable OCI: Lazy-loading container images via range-request indexing

James Thompson, Jesse Butler, Sri Saran Balaji Rajakumar, Henry Wang

arXiv

2026

Container image pulling accounts for the majority of pod startup time in Kubernetes environments. Standard pull down loads the entire image before the container can start, even when the application accesses only a fraction of the image content at startup. We present SOCI (Seekable OCI), a lazy-loading architecture that enables containers to start without downloading the full image. SOCI builds an external

Cloud and systems
Confidence-aware multi-agent orchestration for evaluating multimodal rule compliance

Jie Zhou, Zhenyu Zhang, Xiaolong Kuang, XINYANG SHEN

KDD 2026 Workshop on Evaluation and Trustworthiness of Agentic AI

2026

Evaluating rule compliance in industry requires assessing products against complex regulatory standards using multimodal data sources—a task where both correctness and trustworthiness of automated judgments are critical. Existing approaches either rely on costly human audits, supervised classifiers that demand large-scale labeled training data, or monolithic multimodal models that apply uniform reasoning

Computer vision
Composite-attribute person re-identification via pose-guided disentanglement

Kartik Patwari, Sol Vesdapunt, ChienYi Wang, Dawei Li, Cong Phuoc Huynh, Ning Zhou, Chen-Nee Chuah, Kah Kuen Fu

CVPR 2026

2026

Recent advancement in vision-language models have enabled multi-modal person re-identification (Re-ID), where the system takes both an image and a text query to identify matching individuals. While previous state-of-the-art methods perform well with detailed, sentence-level descriptions, we found that their Recall@1 drops by half when using short, keyword-based queries due to ambiguity, training biases,

Computer vision
Diagnostic knowledge graphs: Automated benchmark construction and deterministic evaluation for multi-step reasoning agents

Chuci Chen, Zhenyu Zhang, XINYANG SHEN

KDD 2026 Workshop on Evaluation and Trustworthiness of Agentic AI

2026

Evaluating multi-step diagnostic reasoning in LLM agents remains an open problem. When cause labels are extracted from resolved operational cases (customer-service tickets, incident reports, clinical notes), the resulting gold standards exhibit extreme vocabulary explosion—5,076 unique cause strings from 2,196 tickets on a single symptom, 92% appearing only once—making LLM-as-judge protocols variance-prone

Search and information retrieval

Russ Tedrake (Massachusetts Institute of Technology).JPG

Gretchen Ertl

Amazon Research Awards

The program offers unrestricted funds and other resources to support research at academic institutions and non-profit organizations in areas that align with our mission.

Amazon Nova AI Challenge

A global university competition to drive secure innovation in generative AI technology, which focuses on responsible AI and large language model coding security.

Credit: Wolfram Scheible

Research collaborations

We partner with particular academic organizations across the world for deep and sustained collaborations in multiple research areas of mutual interest.

Pai-Ling Yin, senior manager of research science, is seen speaking to a classroom, there is a chalkboard behind her and she is gesturing with her hands.

Courtesy of Pai-Ling Yin

Amazon Scholars

We hire world-class academics to work on large-scale technical challenges, while they continue to teach and conduct research at their universities.

Customer-obsessed science

Research areas

From the blog

Featured news

Publications

Collaborations

Work with us