Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

Contextual query rewriting (CQR): Natural language as interface for dialog state tracking

Pushpendre Rastogi, Arpit Gupta, Lambert Mathias

NAACL 2019

2019

We present a novel approach to dialogue state tracking and referring expression resolution tasks. Successful contextual understanding of multi-turn spoken dialogues requires resolving referring expressions across turns and tracking the entities relevant to the conversation across turns. Tracking conversational state is particularly challenging in a multi-domain scenario when there exist multiple spoken

Related: Teaching Alexa to follow conversations

Conversational AI
Contextual phonetic pre-training for end-to-end utterance-level language and speaker recognition

Shaoshi Ling, Julian Salazar, Katrin Kirchhoff

Interspeech 2019

2019

Pretrained contextual word representations in NLP have greatly improved performance on various downstream tasks. For speech, we propose contextual frame representations that capture phonetic information at the acoustic frame level and can be used for utterance-level language, speaker, and speech recognition. These representations come from the frame-wise intermediate representations of an end-to-end, self-attentive

Conversational AI
Simple, fast, accurate intent classification and slot labeling for dialogue systems

Arshit Gupta, John Hewitt, Katrin Kirchhoff

SIGDIAL 2019

2019

With the advent of conversational assistants, like Amazon Alexa, Google Now, etc., dialogue systems are gaining a lot of traction, especially in industrial setting. These systems typically consist of Spoken Language understanding component which, in turn, consists of two tasks - Intent Classification (IC) and Slot Labeling (SL)...

Conversational AI
Multi-passage BERT: A globally normalized BERT model for open-domain question answering

Zhiguo Wang, Patrick Ng, Xiaofei Ma, Ramesh Nallapati, Bing Xiang

EMNLP 2019

2019

BERT model has been successfully applied to open-domain QA tasks. However, previous work trains BERT by viewing passages corresponding to the same question as independent training instances, which may cause incomparable scores for answers from different passages. To tackle this issue, we propose a multi-passage BERT model to globally normalize answer scores across all passages of the same question, and

Conversational AI
Domain adaptation with BERT-based domain classification and data selection

Xiaofei Ma, Zhiguo Wang, Ramesh Nallapati, Bing Xiang

EMNLP 2019 Workshop on DeepLo

2019

The performance of deep neural models can deteriorate substantially when there is a domain shift between training and test data. For example, the pre-trained BERT model can be easily fine-tuned with just one additional output layer to create a state-of-the-art model for a wide range of tasks. However, the fine-tuned BERT model suffers considerably at zero-shot when applied to a different domain. In this

Conversational AI

Machine translation accelerates how Alexa learns new languages

Penny Karanasou

May 29, 2018

As Alexa-enabled devices continue to expand into new countries, we propose an approach for quickly bootstrapping machine-learning models in new languages, with the aim of more efficiently bringing Alexa to new customers around the world.

Conversational AI
Amazon scientists use transfer learning to accelerate development of new Alexa capabilities

Angeliki Metallinou

May 24, 2018

Amazon scientists are continuously expanding Alexa’s natural-language-understanding (NLU) capabilities to make Alexa smarter, more useful, and more engaging.

Conversational AI
Yang, Jun

Amazon Scientist Outlines Multilayer System For Smart Speaker Echo Cancellation And Voice Enhancement

Jun Yang

May 11, 2018

Smart speakers, such as the Amazon Echo family of products, are growing in popularity among consumer and business audiences. In order to improve the automatic speech recognition (ASR) and full-duplex voice communication (FDVC) performance of these smart speakers, acoustical echo cancellation (AEC) and noise reduction systems are required. These systems reduce the noises and echoes that can impact operation, such as an Echo device accurately hearing the wake word “Alexa.”

Conversational AI
Amazon and University of Sheffield researchers make large-scale fact extraction and verification dataset publicly available

Arpit Mittal

May 04, 2018

In recent years, the amount of textual information produced daily has increased exponentially. This information explosion has been accelerated by the ease with which data can be shared across the web. Most of the textual information is generated as free-form text, and only a small fraction is available in structured format (Wikidata, Freebase etc.) that can be processed and analyzed directly by machines.

Search and information retrieval
Making Alexa more friction-free

Ruhi Sarikaya

April 25, 2018

This morning, I am delivering a keynote talk at the World Wide Web Conference in Lyon, France, with the title, Conversational AI for Interacting with the Digital and Physical World.

Conversational AI
Alexa scientists present two new techniques that improve wake word performance

Minhua Wu

April 12, 2018

The Amazon Echo is a hands-free smart home speaker you control with your voice. The first important step in enabling a delightful customer experience with an Echo or other Alexa-enabled device is wake word detection, so accurate detection of “Alexa” or substitute wake words is critical. It is challenging to build a wake word system with low error rates when there are limited computation resources on the device and it's in the presence of background noise such as speech or music.

Conversational AI

Conversational AI

Publications

Related content

Work with us