Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

Two Tiered Distributed Training Algorithm for Acoustic Modeling

Pranav Ladkat, Oleg Rybakov, Radhika Arava, Sree Hari Krishnan Parthasarathi, I-Fan Chen, Nikko Ström

Interspeech 2019

2019

We present a hybrid approach for scaling distributed training of neural networks by combining Gradient Threshold Com-pression (GTC) algorithm - a variant of stochastic gradient de-scent (SGD) - which compresses gradients with thresholding and quantization techniques and Blockwise Model Update Filtering(BMUF) algorithm - a variant of model averaging (MA). In this proposed method we divide total number of

Related: Accelerating parallel training of neural nets

Conversational AI
Neural models for abusive language detection

Sravan Bodapati, Spandana Gella, Kasturi Bhattacharjee, Yaser Al-Onaizan

ACL 2019 Workshop on Abusive Language Online

2019

User-generated text on social media often suffers from a lot of undesired characteristics, including hate speech, abusive language, insults, etc. that are targeted to attack or abuse a specific group of people. Often such text is written differently compared to traditional text, such as news involving either explicit mention of abusive words, obfuscated words and typo-logical errors or implicit abuse i.e

Conversational AI
Improving long distance slot carryover in spoken dialogue systems

Tongfei Chen, Chetan Naik, Hua He, Pushpendre Rastogi, Lambert Mathias

ACL 2019 Workshop on NLP for Conversational AI

2019

Tracking the state of the conversation is a central component in task-oriented spoken dialogue systems. One such approach for tracking the dialogue state is slot carryover, where a model makes a binary decision if a slot from the context is relevant to the current turn. Previous work on the slot carryover task used models that made independent decisions for each slot. A close analysis of the results show

Related: Who’s on First? How Alexa Is Learning to Resolve Referring Terms

Conversational AI
Improving Noise Robustness of Automatic Speech Recognition via Parallel Data and Teacher-student Learning

Ladislav Mosner, Minhua Wu, Sree Hari Krishnan Parthasarathi, Roland Maas, Anirudh Raju, Kenichi Kumatani, Shiva Sundaram, Björn Hoffmeister

ICASSP 2019

2019

For real-world speech recognition applications, noise robustness is still a challenge. In this work, we adopt the teacher-student (T/S) learning technique using a parallel clean and noisy corpus for improving automatic speech recognition (ASR) performance under multimedia noise. On top of that, we apply a logits selection method which only preserves the k highest values to prevent wrong emphasis of knowledge

Related: Machine-labeled data + artificial noise = better speech recognition

Machine learning
Efficient Semi-Supervised Learning for Natural Language Understanding by Optimizing Diversity

Eunah Cho, He Xie, John Lalor, Varun Kumar, William M. Campbell

ASRU 2019

2019

Expanding new functionalities efﬁciently is an ongoing challenge for single-turn task-oriented dialogue systems. In this work, we explore functionality-speciﬁc semi-supervised learning via self-training. We consider methods that augment training data automatically from unlabeled data sets in a functionality-targeted manner. In addition, we examine multiple techniques for efﬁcient selection of augmented

Related: Alexa’s ASRU papers concentrate on extracting high-value training data

Conversational AI

Adapting Alexa to regional language variations

Young-Bum Kim

June 11, 2019

As Alexa expands into new countries, she usually has to be trained on new languages. But sometimes, she has to be re-trained on languages she’s already learned. British English, American English, and Indian English, for instance, are different enough that for each of them, we trained a new machine learning model from scratch.

Conversational AI
Animation by O’Reilly Science Art

Teaching Alexa to follow conversations

Arpit Gupta

June 06, 2019

New approach to reference resolution rewrites queries to clarify ambiguous references.

Conversational AI
Amazon Unveils Novel Alexa Dialog Modeling for Natural, Cross-Skill Conversations

Alexa Science Team

June 05, 2019

Today, customer exchanges with Alexa are generally either one-shot requests, like “Alexa, what’s the weather?”, or interactions that require multiple requests to complete more complex tasks.

Conversational AI
Using adversarial training to recognize speakers’ emotions

Viktor Rozgic

May 21, 2019

A person’s tone of voice can tell you a lot about how they’re feeling. Not surprisingly, emotion recognition is an increasingly popular conversational-AI research topic.

Conversational AI
Should Alexa read “2/3” as “two-thirds” or “February Third”?: The science of text normalization

Ming Sun

May 16, 2019

Text normalization is an important process in conversational AI. If an Alexa customer says, “book me a table at 5:00 p.m.”, the automatic speech recognizer will transcribe the time as “five p m”. Before a skill can handle this request, “five p m” will need to be converted to “5:00PM”. Once Alexa has processed the request, it needs to synthesize the response — say, “Is 6:30 p.m. okay?” Here, 6:30PM will be converted to “six thirty p m” for the text-to-speech synthesizer. We call the process of converting “5:00PM” to “five p m” text normalization and its counterpart — converting “five p m” to “5:00PM” — inverse text normalization.

Conversational AI
Training a Machine Learning Model in English Improves Its Performance in Japanese

Judith Gaspers

May 13, 2019

Recently, we published a paper showing that training a neural network to do language processing in English, then retraining it in German, drastically reduces the amount of German-language training data required to achieve a given level of performance.

Conversational AI

Conversational AI

Publications

Related content

Work with us