Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

Two-pass endpoint detection for speech recognition

Anirudh Raju, Aparna Khare, Di He, Ilya Sklyar, Long Chen, Sam Alptekin, Viet Anh Tranh, Zhe Zhang, Colin Vaz, Venkatesh Ravichandran, Roland Maas, Ariya Rastrow

ASRU 2023

2023

Endpoint (EP) detection is a key component of far-field speech recognition systems that assist the user through voice commands. The endpoint detector has to trade-off between accuracy and latency, since waiting longer reduces the cases of users being cut-off early. We propose a novel two-pass solution for endpointing, where the utterance endpoint detected from a first pass endpointer is verified by a 2nd-pass

Conversational AI
Supervised fine-tuning of large language models on human demonstrations through the lens of memorization

Yubin Ge, Devamanyu Hazarika, Yang Liu, Mahdi Namazifar

NeurIPS 2023 Workshop on Instruction Tuning and Instruction Following

2023

In recent years, the field of natural language processing (NLP) has witnessed remarkable advancements driven by the development of large language models (LLMs). Various techniques, such as instruction tuning, have emerged as crucial approaches, enhancing LLMs’ adaptability to new tasks guided by instructional prompts. Meanwhile, the phenomenon of memorization within LLMs has garnered considerable attention

Conversational AI
Faithful model evaluation for model-based metrics

Palash Goyal, Qian Hu, Rahul Gupta

EMNLP 2023

2023

Statistical significance testing is used in natural language processing (NLP) to determine whether the results of a study or experiment are likely to be due to chance or if they reflect a genuine relationship. A key step in significance testing is the estimation of confidence interval which is a function of sample variance. Sample variance calculation is straightforward when evaluating against ground truth

Conversational AI
Can unpaired textual data replace synthetic speech in ASR model adaptation?

Pasquale D'Alterio, Christian Hensel, Bashar Awwad Shiekh Hasan

2023 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)

2023

To boost training and adaptation of end to end (E2E) automatic speech recognition (ASR) models, several approaches to use paired speech-text input together with unpaired text input have emerged. They aim at improving the model performance on rare words, personalisation, and long tail. In this work, we present a systematic study of the impact of such training/adaptation and compare it to training with synthetic

Conversational AI
Evaluating large language models on controlled generation tasks

Jiao Sun, Yufei Tian, Wangchunshu Zhou, Nan Xu, Qian Hu, Rahul Gupta, John Wieting, Nanyun Peng, Xuezhe Ma

EMNLP 2023

2023

While recent studies have looked into the abilities of large language models in various benchmark tasks, few studies have looked into the controllability of large language models on generation tasks. We present a systematic and extensive analysis of the controllability of large language models on ten benchmarks, including a new simple yet challenging numerical planning benchmark with different granularities

Conversational AI

Credit: Glynis Condon

Ensuring that new language-processing models don't backslide

Yi-An Lai

August 04, 2021

New approach corrects for cases when average improvements are accompanied by specific regressions.

Machine learning
Amazon at ACL: How to teach machines to reason

Larry Hardesty

July 29, 2021

Amazon’s Dan Roth on a hot new research topic — that he’s been studying for more than 25 years.

Conversational AI
Credit: Glynis Condon

Alexa Prize Socialbot Grand Challenge 4 finalists announced

Alexa Prize team

July 19, 2021

Five teams to compete for $500,000 first prize; winners will be announced in August 2021.

Conversational AI
Alexa & Friends features Amazon Scholar Julia Hirschberg

Staff writer

July 15, 2021

Hirschberg explains why mastering empathetic speech is critical for successful dialogue systems.

Conversational AI
Amazon paper exposes biases in unreliable-news datasets

Heba Elfardy, Christos Christodoulopoulos, Thomas Butler

July 15, 2021

The paper, which received honorable mention at EACL, presents guidelines for better analysis and construction of datasets.

Conversational AI
Searching video using natural-language descriptions

Mohsen Malmir

July 12, 2021

New method uses cross-attention and multitask training to improve the accuracy and training efficiency of video moment retrieval.

Search and information retrieval

Conversational AI

Publications

Related content

Work with us