Amazon Science homepage

In a keynote address at the latest Amazon Machine Learning Conference, Amazon academic research consultant, Stanford professor, and recent Nobel laureate Guido Imbens offered insights on the estimation of causal effects in “panel data” settings.

David Chang/Getty Images/iStockphoto

Amazon opens new AI lab in San Francisco focused on long-term research bets

The Amazon AGI SF Lab will focus on developing new foundational capabilities for enabling useful AI agents.

The Amazon Nova family of models: Technical report and model card

Training infrastructure, benchmarks, responsible-AI methodology, and more.

Information and knowledge management

Machine learning

Operations research and optimization

Quantum technologies

Robotics

Search and information retrieval

Security, privacy, and abuse prevention

Sustainability

From the blog

View all

The latest research from Amazon scientists.

View all

QKD and authentication: Separating facts from myths

January 14, 2025

Key exchange protocols and authentication mechanisms solve distinct problems and must be integrated in a secure communication system.

Quantum technologies
The 10 most viewed publications of 2024

December 24, 2024
The 10 most viewed blog posts of 2024

December 24, 2024
New AWS tool recommends removal of unused permissions

December 19, 2024

Automated reasoning
Understanding the training dynamics of transformers

December 18, 2024

Machine learning

View all

David Chang/Getty Images/iStockphoto

Amazon AGI SF Lab

Led by David Luan and Pieter Abbeel, the lab will focus on developing new foundational capabilities for enabling useful AI agents.

Amazon Nova

The company's new state-of-the-art foundation models deliver frontier intelligence and industry-leading price performance.

An irregular polyhedron suspended in midair, with shadows projected onto each of three orthogonal surfaces: one shadow is a triangle, one a square, and the third a circle.

Preskill wins prize for work on learning and quantum computing

Caltech professor and Amazon Scholar John Preskill wins Bell Prize for applying both classical and quantum computing to the problem of learning from quantum experiments.

Improving lip-synchrony in direct audio-visual speech-to-speech translation

Lucas Goncalves, Prashant Mathur, Xing Niu, Brady Houston, Chandrashekhar Lavania, Srikanth Vishnubhotla, Lijia Sun, Anthony Ferritto

ICASSP 2025

2025

Audio-Visual Speech-to-Speech Translation (AVS2S) typically prioritizes improving translation quality and naturalness. However, an equally critical aspect in audio-visual content is lip-synchrony—ensuring that the movements of the lips match the spoken content—essential for maintaining realism in dubbed videos. Despite its importance, the inclusion of lip-synchrony constraints in AVS2S models has been largely

Conversational AI
Lightweight neural front-ends for low-resource on-device text-to-speech

Giulia Comini, Heereen Shim, Sam Ribeiro

ICASSP 2025

2025

We propose a lightweight neural front-end framework for on-device speech generation and highlight its benefits towards low-resource language scaling. While data-driven models have shown potential in front-end literature, especially since they can enable fast language expansion, they are often extremely large and of high latency. There is limited work focusing on their usability in real-time settings, and

Conversational AI
Learning rich speech representations with acoustic-semantic factorization

Sandy Niu, Najmeh Sadoughi, Abhishek Yanamandra, Pichao Wang, Zhu Liu, Vimal Bhat, Liz Norred

ICASSP 2025

2025

Self-supervised pretraining has transformed speech representation learning, enabling models to generalize across various downstream tasks. However, empirical studies have highlighted two notable gaps. First, different speech tasks require varying levels of acoustic and semantic information, which are encoded at different layers within the model. This adds the extra complexity of layer selection on downstream

Machine learning
SEAL: Speaker error correction using acoustic-conditioned large language models

Anurag Kumar, Rohit Paturi, Amber Afshan, Sundararajan Srinivasan

ICASSP 2025

2025

Speaker Diarization (SD) is a crucial component of modern end-to-end ASR pipelines. Traditional SD systems, which are typically audio-based and operate independently of ASR, often introduce speaker errors, particularly during speaker transitions and overlapping speech. Recently, language models including fine-tuned large language models (LLMs) have shown to be effective as a second-pass speaker error corrector

Conversational AI
V-MIND: Building versatile monocular indoor 3D detector with diverse 2D annotations

Jin-Cheng Jhang, Tao Tu, Fu-En Wang, Ke Zhang, Min Sun, Cheng-Hao Kuo

WACV 2025

2025

The field of indoor monocular 3D object detection is gaining significant attention, fueled by the increasing demand in VR/AR and robotic applications. However, its advancement is impeded by the limited availability and diversity of 3D training data, owing to the labor-intensive nature of 3D data collection and annotation processes. In this paper, we present V-MIND (Versatile Monocular INdoor Detector),

Computer vision

AAAI 2025

February 25 - March 4, 2025

Philadelphia, Pennsylvania

Machine learning

WACV 2025

February 28 - March 4, 2025

Tucson, Arizona

Computer vision

WSDM 2025

March 10 - 14, 2025

Hannover, Germany

Search and information retrieval

KDD 2025

August 3 - 7, 2025

Toronto, Ontario

Information and knowledge management

Russ Tedrake (Massachusetts Institute of Technology).JPG

Gretchen Ertl

Amazon Research Awards

The program offers unrestricted funds and other resources to support research at academic institutions and non-profit organizations in areas that align with our mission.

JORDAN STEAD/(JORDAN STEAD / Amazon)

Amazon Trusted AI Challenge

A global university competition to drive secure innovation in generative AI technology, which focuses on responsible AI and LLM coding security.

Credit: Wolfram Scheible

Research collaborations

We partner with particular academic organizations across the world for deep and sustained collaborations in multiple research areas of mutual interest.

Pai-Ling Yin, senior manager of research science, is seen speaking to a classroom, there is a chalkboard behind her and she is gesturing with her hands.

Courtesy of Pai-Ling Yin

Academics at Amazon

We hire world-class academics to work on large-scale technical challenges, while they continue to teach and conduct research at their universities.

Customer-obsessed science

Research areas

From the blog

Featured news

Publications

Conferences

Academia

Work with us