Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

Entity contrastive learning in a large-scale virtual assistant system

Jonathan Rubin, Jason Crowley, George Leung, Morteza Ziyadi, Maria Minakova

ACL 2023

2023

Conversational agents are typically made up of domain (DC) and intent classifiers (IC) that identify the general subject an utterance be-longs to and the specific action a user wishes to achieve. In addition, named entity recognition (NER) performs per token labeling to identify specific entities of interest in a spoken utterance. We investigate improving joint IC and NER models using entity contrastive

Conversational AI
Context-aware transformer pre-training for answer sentence selection

Luca Di Liello, Siddhant Garg, Alessandro Moschitti

ACL 2023

2023

Answer Sentence Selection (AS2) is a core component for building an accurate Question Answering pipeline. AS2 models rank a set of candidate sentences based on how likely they answer a given question. The state of the art in AS2 exploits pre-trained transformers by transferring them on large annotated datasets, while using local contextual information around the candidate sentence. In this paper, we propose

Conversational AI
Personalized predictive ASR for latency reduction in voice assistants

Andreas Schwarz, Di He, Maarten Van Segbroeck, Mohammed Hethnawi, Ariya Rastrow

Interspeech 2023

2023

Streaming Automatic Speech Recognition (ASR) in voice assistants can utilize prefetching to partially hide the latency of response generation. Prefetching involves passing a preliminary ASR hypothesis to downstream systems in order to prefetch and cache a response. If the final ASR hypothesis after endpoint detection matches the preliminary one, the cached response can be delivered to the user, thus saving

Conversational AI
“I’m fully who I am”: Towards centering transgender and non-binary voices to measure biases in open language generation

Anaelia Ovalle, Palash Goyal, Jwala Dhamala, Zachary Jaggers, Kai-Wei Chang, Aram Galstyan, Richard Zemel, Rahul Gupta

ACM FAccT 2023

2023

Warning: This paper contains examples of gender non-affirmative language which could be offensive, upsetting, and/or triggering. Transgender and non-binary (TGNB) individuals disproportionately experience discrimination and exclusion from daily life. Given the recent popularity and adoption of language generation technologies, the potential to further marginalize this population only grows. Although a multitude

Conversational AI
Learning answer generation using supervision from automatic question answering evaluators

Matteo Gabburo, Siddhant Garg, Rik Koncel-Kedziorski, Alessandro Moschitti

ACL 2023

2023

Recent studies show that sentence-level extractive QA, i.e., based on Answer Sentence Selection (AS2), is outperformed by Generationbased QA (GenQA) models, which generate answers using the top-k answer sentences ranked by AS2 models (a la retrieval-augmented generation style). In this paper, we propose a novel training paradigm for GenQA using supervision from automatic QA evaluation models (GAVA). Specifically

Conversational AI

Nine university teams selected to compete in the Alexa Prize Socialbot Grand Challenge 4

Alexa Prize team

November 03, 2020

Fourth challenge features four new teams.

Conversational AI
credit: Glynis Condon

More-natural prosody for synthesized speech

Shubhi Tyagi, Sri Karlapati

October 30, 2020

Prosody transfer technique addresses the problem of “source speaker leakage”, while prosody selection model better matches prosody to semantic content.

Conversational AI
Successes, challenges and opportunities for speech technology in conversational agents

Staff writer

October 29, 2020

Watch the replay of Shehzad Mevawalla's Interspeech 2020 keynote talk.

Conversational AI
Alexa scientists discuss relevant work in the field of conversational AI

Staff writer

October 29, 2020

Watch the replay of the Interspeech 2020 industry forum session.

Conversational AI
From "Efficient minimum word error rate training of RNN-transducer for end-to-end speech recognition"

Amazon’s new research on automatic speech recognition

Björn Hoffmeister

October 29, 2020

Interspeech papers include novel approaches to speaker identification and the training of end-to-end speech recognition models.

Conversational AI
How Alexa scientists are advancing speech science

Staff writer

October 28, 2020

Watch as four Amazon Alexa scientists talk about current state, new developments, and recent announcements surrounding advancements in Alexa speech technologies.

Conversational AI

Conversational AI

Publications

Related content

Work with us