Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

Sub-band Convolutional Neural Networks for Small-footprint Spoken Term Classification

Chieh-Chi Kao, Ming Sun, Yixin Gao, Shiv Vitaladevuni, Chao Wang

Interspeech 2019

2019

This paper proposes a Sub-band Convolutional Neural Network for spoken term classification. Convolutional neural networks(CNNs) have proven to be very effective in acoustic applications such as spoken term classification, keyword spotting, speaker identification, acoustic event detection, etc. Unlike applications in computer vision, the spatial invariance property of 2D convolutional kernels does not fit

Related: More-Efficient Machine Learning Models for On-Device Operation

Conversational AI
Neural model compression using low rank matrix factorization

Anish Acharya, Rahul Goel, Angeliki Metallinou, Inderjit S. Dhillon

AAAI 2019

2019

Deep learning models have become state of the art for natural language processing (NLP) tasks, however deploying these models in production system poses significant memory constraints. Existing compression methods are either lossy or introduce significant latency. We propose a compression method that leverages low rank matrix factorization during training,to compress the word embedding layer which represents

Conversational AI
Unsupervised Transfer Learning for Spoken Language Understanding in Intelligent Agents

Aditya Siddhant, Anuj Goyal, Angeliki Metallinou

AAAI 2019

2019

User interaction with voice-powered agents generates large amounts of unlabeled utterances. In this paper, we explore techniques to efficiently transfer the knowledge from these unlabeled utterances to improve model performance on Spoken Language Understanding (SLU) tasks. We use Embeddings from Language Model (ELMo) to take advantage of unlabeled data by learning contextualized word representations. Additionally

Related: Leveraging unannotated data to bootstrap Alexa functions more quickly

Conversational AI
One-vs-all models for asynchronous training: An empirical analysis

Rahul Gupta, Aman Alok, Shankar Ananthakrishnan

Interspeech 2019

2019

Any given classification problem can be modeled using multi-class or One-vs-All (OVA) architecture. An OVA system consists of as many OVA models as the number of classes, providing the advantage of asynchrony, where each OVA model can be re-trained independent of other models. This is particularly advantageous in settings where scalable model training is a consideration (for instance in an industrial environment

Conversational AI
Acoustic Model Bootstrapping Using Semi-Supervised Learning

Langzhou Chen

Interspeech 2019

2019

This work aims at bootstrapping the acoustic model training with small amount of the human annotated speech data and large amount of the unlabeled speech data for automatic speech recognition.The technologies of the semi-supervised learning were investigated to select the automatically transcribed training samples.Two semi-supervised learning methods were pro-posed: one is the local-global uncertainty based

Conversational AI

Stacy Reilly

Preserving privacy in analyses of textual data

Tom Diethe

January 23, 2020

New "Mad Libs" technique for replacing words in individual sentences is grounded in metric differential privacy.

Security, privacy, and abuse prevention
Stacy Reilly

How we taught Alexa to correct her own defects

Chenlei (Edward) Guo

January 21, 2020

Self-learning system uses customers’ rephrased requests as implicit error signals.

Machine learning
Stacy Reilly

The research behind Alexa’s popular whispered speech

Marius Cotescu

January 16, 2020

According to listener tests, whispers produced by a new machine learning model sound as natural as vocoded human whispers.

Conversational AI
Alexa’s ASRU papers concentrate on extracting high-value training data

Larry Hardesty

December 11, 2019

Related data selection techniques yield benefits for both speech recognition and natural-language understanding.

Conversational AI
Alexa at five: Looking back, looking forward

Rohit Prasad

November 06, 2019

Today is the fifth anniversary of the launch of the Amazon Echo, so in a talk I gave yesterday at the Web Summit in Lisbon, I looked at how far Alexa has come and where we’re heading next.

Conversational AI
Improving cross-lingual transfer learning by filtering training data

Quynh Ngoc Thi Do, Judith Gaspers

October 28, 2019

In a paper we’re presenting at this year’s Conference on Empirical Methods in Natural Language Processing, we describe experiments with a new data selection technique.

Conversational AI

Conversational AI

Publications

Related content

Work with us