Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

Faithful low-resource data-to-text generation through cycle training

Zhuoer Wang, Marcus Collins, Nikhita Vedula, Simone Filice, Shervin Malmasi, Oleg Rokhlenko

ACL 2023

2023

Methods to generate text from structured data have advanced significantly in recent years, primarily due to fine-tuning of pre-trained lan-guage models on large datasets. However, such models can fail to produce output faithful to the input data, particularly on out-of-domain data. Sufficient annotated data is often not avail-able for specific domains, leading us to seek an unsupervised approach to improve

Conversational AI
Bias invariant approaches for improving word embedding fairness

Siyu Liao, Rongting Zhang, Barbara Poblete, Vanessa Murdock

CIKM 2023

2023

Many public pre-trained word embeddings have been shown to encode different types of biases. Embeddings are often obtained from training on large pre-existing corpora, and therefore resulting biases can be a reflection of unfair representations in the original data. Bias, in this scenario, is a challenging problem since current mitigation techniques require knowing and understanding existing biases in the

Conversational AI
Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech

Guangyang Zhang, Tom Merritt, Sam Ribeiro, Biel Tura Vecino, Kayoko Yanagisawa, Kamil Pokora, Abdelhamid Ezzerg, Sebastian Cygert, Ammar Abbas, Piotr Bilinski, Roberto Barra-Chicote, Daniel Korzekwa, Jaime Lorenzo Trueba

Interspeech 2023

2023

Neural text-to-speech systems are often optimized on L1/L2 losses, which make strong assumptions about the distributions of the target data space. Aiming to improve those assumptions, Normalizing Flows and Diffusion Probabilistic Models were recently proposed as alternatives. In this paper, we compare traditional L1/L2-based approaches to diffusion and flow-based approaches for the tasks of prosody and

Conversational AI
UseClean: Learning from complex noisy labels in named entity recognition

Jinjin Tian, Kun Zhou, Meiguo Wang, Yu Zhang, Benjamin Yao, Xiaohu Liu, Chenlei (Edward) Guo

ACL 2023 Workshop on Learning with Small Data

2023

We investigate and refine denoising methods for NER task on data that potentially contains extremely noisy labels from multi-sources. In this paper, we first summarized all possible noise types and noise generation schemes, based on which we built a thorough evaluation system. We then pinpoint the bottleneck of current state-of-art denoising methods using our evaluation system. Correspondingly, we propose

Conversational AI
Predicting interaction quality of conversational assistants with spoken language understanding model confidences

Yue Gao, Enrico Piovano, Tamer Soliman, Monir Moniruzzaman, Anoop Kumar, Melanie Bradford, Nandi Subhrangshu

CIKM 2023

2023

In conversational AI assistants, SLU models are part of a complex pipeline composed of several modules working in harmony. Hence, an update to the SLU model needs to ensure improvements not only in the model specific metrics but also in the overall conversational assistant. Specifically, the impact on user interaction quality metrics must be factored in, while integrating interactions with distal modules

Conversational AI

Context-aware deep-learning method boosts Alexa dialogue system’s ability to recognize conversation topics by 35%

Behnam Hedayatnia

December 4, 2018

Method factors in the utterances that immediately preceded the target utterance and its classification as a “dialogue act”

Conversational AI
Varying speaking styles with neural text-to-speech

Trevor Wood, Tom Merritt

November 19, 2018

Amazon scientists have shown that our latest text-to-speech (TTS) system, which uses a generative neural network, can learn to employ a newscaster style from just a few hours of training data.

Conversational AI
Reducing Customer Friction through Skill Selection

Young-Bum Kim

October 31, 2018

This year, we’ve started to explore ways to make it easier for customers to find and engage with Alexa skills.

Conversational AI
Photo credit: Sharaf Maksumov / Shutterstock.com

Amazon helps launch workshop on automatic fact verification

Larry Hardesty

October 25, 2018

At the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP), Amazon researchers and their colleagues at the University of Sheffield and Imperial College London will host the first Workshop on Fact Extraction and Verification, which will explore how computer systems can learn to recognize false assertions online.

Search and information retrieval
How an Echo device could locate snaps, claps, and taps

Jun Yang

October 4, 2018

Parallel processing of microphone inputs and separate detectors for periodicity and dynamics improve performance.

Conversational AI
Identifying sounds in audio streams

Chieh-Chi Kao, Weiran Wang

October 2, 2018

On September 20, Amazon unveiled a host of new products and features, including Alexa Guard, a smart-home feature available on select Echo devices later this year. When activated, Alexa Guard can send a customer alerts if it detects the sound of glass breaking or of smoke or carbon monoxide alarms in the home.

Conversational AI

Conversational AI

Publications

Related content

Work with us