Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

Low-bit quantization and quantization-aware training for small-footprint keyword spotting

Yuriy Mishchenko, Yusuf Goren , Ming Sun, Chris Beauchene, Spyros Matsoukas, Oleg Rybakov, Shiv Naga Prasad Vitaladevuni

ICML 2019

2019

In this paper, we investigate novel quantization approaches to reduce memory and computational footprint of deep neural network (DNN) based keyword spotters (KWS). We propose a new method for KWS offline and online quantization, which we call dynamic quantization, where we quantize DNN weight matrices column-wise, using each column’s exact individual min-max range, and the DNN layers’ inputs and outputs

Conversational AI
Domain-Independent turn-level Dialogue Quality Evaluation via User Satisfaction Estimation

Praveen Kumar Bodigutla, Spyros Matsoukas, Longshaokan Marshall Wang, Kate Ridgeway, Joshua Levy, Swanand Joshi, Alborz Geramifard

SIGDIAL 2019 Workshop on Implications of Deep Learning for Dialog Modeling

2019

An automated metric to evaluate dialogue quality is vital for optimizing data driven dialogue management. The common approach of relying on explicit user feedback during a conversation is intrusive and sparse. Current models to estimate user satisfaction use limited feature sets and rely on annotation schemes with low inter-rater reliability, limiting generalizability to conversations spanning multiple

Conversational AI
A Closer Look at Latent Space Data Augmentation for Few-Shot Intent Classification

Varun Kumar, Hadrien Glaude, William M. Campbell

EMNLP 2019 Workshop on DeepLo

2019

New conversation topics and functionalities are constantly being added to conversational AI agents like Amazon Alexa and Apple Siri. As data collection and annotation is not scalable and is often costly, only a handful of examples for the new functionalities are available, which results in poor generalization performance. We formulate it as a Few-Shot Integration (FSI) problem where a few examples are used

Machine learning
Bootstrapping Conversational Speech Recognition System using Neural Machine Translation

Surabhi Punjabi, Harish Arsikere, Sri Garimella

ASRU 2019

2019

Building a conversational speech recognition system for a new language is constrained by the availability of interaction style utterances. Data collection is often expensive and limited by the speed of manual transcription. In this work, we advocate the use of neural machine translation as a data augmentation technique for bootstrapping language models in factored speech recognition systems. Translation

Conversational AI
Graph-based Semi-Supervised Learning for Natural Language Processing

Chris Qiu, Eunah Cho, Xiaochun Ma, William M. Campbell

EMNLP 2019 Workshop on TextGraphs

2019

Semi-supervised learning is an efﬁcient method to augment training data automatically from unlabeled data. Development of many natural language understanding (NLU) applications has a challenge where unlabeled data is relatively abundant while labeled data is rather limited. In this work, we propose transductive graph based semi-supervised learning models as well as their inductive variants for NLU. We evaluate

Conversational AI

Leveraging unannotated data to bootstrap Alexa functions more quickly

Anuj Goyal

January 22, 2019

Developing a new natural-language-understanding system usually requires training it on thousands of sample utterances, which can be costly and time-consuming to collect and annotate. That’s particularly burdensome for small developers, like many who have contributed to the library of more than 70,000 third-party skills now available for Alexa.

Conversational AI
_{Projection image adapted from Michael Horvath under the CC BY-SA 4.0 license}

New method for compressing neural networks better preserves accuracy

Anish Acharya, Rahul Goel

January 15, 2019

Neural networks have been responsible for most of the top-performing AI systems of the past decade, but they tend to be big, which means they tend to be slow. That’s a problem for systems like Alexa, which depend on neural networks to process spoken requests in real time.

Conversational AI
How Alexa may learn to retrieve stored "memories"

Rasool Fakoor

December 21, 2018

In May 2018, Amazon launched Alexa’s Remember This feature, which enables customers to store “memories” (“Alexa, remember that I took Ben’s watch to the repair store”) and recall them later by asking open-ended questions (“Alexa, where is Ben’s watch?”).

Search and information retrieval
How Alexa knows “peanut butter” is one shopping-list item, not two

Sanchit Agarwal

December 18, 2018

At a recent press event on Alexa's latest features, Alexa’s head scientist, Rohit Prasad, mentioned multistep requests in one shot, a capability that allows you to ask Alexa to do multiple things at once. For example, you might say, “Alexa, add bananas, peanut butter, and paper towels to my shopping list.” Alexa should intelligently figure out that “peanut butter” and “paper towels” name two items, not four, and that bananas are a separate item.

Conversational AI
With New Data Representation Scheme, Alexa Can Better Match Skills to Customer Requests

Young-Bum Kim

December 17, 2018

In recent years, data representation has emerged as an important research topic within machine learning.

Conversational AI
New Approach to Language Modeling Reduces Speech Recognition Errors by Up to 15%

Ankur Gandhe

December 13, 2018

Language models are a key component of automatic speech recognition systems, which convert speech into text. A language model captures the statistical likelihood of any particular string of words, so it can help decide between different interpretations of the same sequence of sounds.

Conversational AI

Conversational AI

Publications

Related content

Work with us