Customer-obsessed science

Amazon Science Fulfillment Center OAK4 in Tracy, CA

How task decomposition and smaller LLMs can make AI more affordable

September 19, 2024

“Agentic workflows” that use multiple, fine-tuned smaller LLMs — rather than one large one — can improve efficiency.

Machine learning
Accounting for cognitive bias in human evaluation of large language models

September 16, 2024

A position paper presented at ACL proposes a framework for more-accurate human evaluation of LLMs.

Conversational AI
Better-performing “25519” elliptic-curve cryptography

September 10, 2024

Automated reasoning and optimizations specific to CPU microarchitectures improve both performance and assurance of correct implementation.

Automated reasoning
Conference calendar
- ECCV 2024
  
  Computer vision
  
  September 29 - October 4, 2024
- IROS 2024
  
  Robotics
  
  October 14 - 18, 2024
- CIKM 2024
  
  Information and knowledge management
  
  October 21 - 25, 2024

AmazonScience_ARA_Fall2024_092424_MC_Fall 2024.jpg

Amazon Research Awards issues fall 2024 call for proposals

Amazon Research Awards team

September 25, 2024

Now open until November 6, Amazon Research Awards will be seeking proposals in the following research areas: AI for Information Security, Automated Reasoning, AWS AI, AWS Cryptography, and Sustainability.

DynaPipe: Optimizing multi-task training through dynamic pipelines

Chenyu Jiang, Zhen Jia, Shuai Zheng, Yida Wang, Chuan Wu

EuroSys 2023

2023

Multi-task model training has been adopted to enable a single deep neural network model (often a large language model) to handle multiple tasks (e.g., question answering and text summarization). Multi-task training commonly receives input sequences of highly different lengths due to the diverse contexts of different tasks. Padding (to the same sequence length) or packing (short examples into long sequences

Cloud and systems
Adversarial density ratio estimation for change point detection

Shreyas S, Prakash Mandayam Comar, Sivaramakrishnan (Siva) Kaveri

CIKM 2023

2023

Change Point Detection (CPD) models are used to identify abrupt changes in the distribution of a data stream and have a widespread practical use. CPD methods generally compare the distribution of data sequences before and after a given time step to infer if there is a shift in distribution at the said time step. Numerous divergence measures, which measure distance between data distributions of sequence

Machine learning
ForeSeer: Product aspect forecasting using temporal graph embedding

Zixuan Liu, Gaurush Hiranandani, Kun Qian, Eddie Huang, Yi Xu, Belinda Zeng, Karthik Subbian, sheng wang

CIKM 2023

2023

Developing text mining approaches to mine aspects from customer reviews has been well-studied due to its importance in understand-ing customer needs and product attributes. In contrast, it remains unclear how to predict the future emerging aspects of a new product that currently has little review information. This task, which we named product aspect forecasting, is critical for recommending new products

Machine learning
GeMQuAD: Generating multilingual question answering datasets from large language models using few shot learning

Amani Namboori, Shivam Mangale, Andy Rosenbaum, Saleh Soltan

NeurIPS 2023 Workshop on SyntheticData4ML

2023

The emergence of Large Language Models (LLMs) with capabilities like In-Context Learning (ICL) has ushered in new possibilities for data generation across various domains while minimizing the need for extensive data collection and modeling techniques. Researchers have explored ways to use this generated synthetic data to optimize smaller student models for reduced deployment costs and lower latency in downstream

Conversational AI
Tree-regularized tabular embeddings

Xuan Li, Yun Wang, Bo Li

NeurIPS 2023 Workshop on Table Representation Learning

2023

Tabular neural network (NN) has attracted remarkable attentions and its recent advances have gradually narrowed the performance gap with respect to tree-based models on many public datasets. While the mainstreams focus on calibrating NN to fit tabular data, we emphasize the importance of homogeneous embeddings and alternately concentrate on regularizing tabular inputs through supervised pretraining. Specifically

Machine learning

Career opportunities

We look for talent from around the world for applied scientists, data scientists, economists, research scientists, scholars, academics, PhDs, and interns.
Academic collaborations

We collaborate with leading academic organizations to drive innovation and to ensure that research is creating solutions whose benefits are shared broadly.
Photo by Zak Brickett

Awards and recognitions

Learn more about the awards and recognitions that Amazon researches from around the world have been honored with during their tenure.

Customer-obsessed science

Conference calendar

Publications

Resources

Work with us