Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

Augmented natural language for generative sequence labeling

Ben Athiwaratkun, Cicero Nogueira dos Santos, Jason Krone, Bing Xiang

EMNLP 2020

2020

We propose a generative framework for joint sequence labeling and sentence-level classification. Our model performs multiple sequence labeling tasks at once using a single, shared natural language output space. Unlike prior discriminative methods, our model naturally incorporates label semantics and shares knowledge across tasks. Our framework is general purpose, performing well on few-shot, low-resource

Conversational AI
Improve transformer models with better relative position embeddings

Zhiheng Huang, Davis Liang, Peng Xu, Bing Xiang

Findings of EMNLP 2020

2020

Transformer architectures rely on explicit position encodings in order to preserve a notion of word order. In this paper, we argue that existing work does not fully utilize position information. For example, the initial proposal of a sinusoid embedding is fixed and not learnable. In this paper, we first review absolute position embeddings and existing methods for relative position embeddings. We then propose

Conversational AI
Beyond [CLS] through ranking by generation

Cicero Nogueira dos Santos, Xiaofei Ma, Ramesh Nallapati, Zhiheng Huang, Bing Xiang

EMNLP 2020

2020

Generative models for Information Retrieval, where ranking of documents is viewed as the task of generating a query from a document’s language model, were very successful in various IR tasks in the past. However, with the advent of modern deep neural networks, attention has shifted to discriminative ranking functions that model the semantic similarity of documents and queries instead. Recently, deep generative

Conversational AI
Converting the point of view of messages spoken to virtual assistants

Isabelle G. Lee, Vera Zu, Sai Srujana Buddi, Dennis Liang, Purva Kulkarni, Jack G. M. FitzGerald

EMNLP 2020

2020

Virtual Assistants can be quite literal at times. If a user says tell Bob I love him, most virtual assistants will extract the message I love him and send it to the user’s contact named Bob, rather than properly converting the message to I love you. We designed a system that takes a voice message from one user, converts the point of view of the message, and then delivers the result to its target user. We

Conversational AI
An empirical investigation towards efficient multi-domain language model pre-training

Kristjan Arumae, Qing Sun, Parminder Bhatia

EMNLP 2020

2020

Pre-training large language models has become a standard in the natural language processing community. Such models are pretrained on generic data (e.g. BookCorpus and English Wikipedia) and often fine-tuned on tasks in the same domain. However, in order to achieve state-of-the-art performance on out of domain tasks such as clinical named entity recognition and relation extraction, additional in domain pre-training

Conversational AI

3 questions about Interspeech 2018 with Björn Hoffmeister

Larry Hardesty

August 24, 2018

This year’s Interspeech — the largest conference in speech technology — will take place in Hyderabad, India, the first week of September. More than 40 Amazon researchers will be attending, including Björn Hoffmeister, the senior manager for machine learning in the Alexa Automatic Speech Recognition group. He took a few minutes to answer three questions about this year’s conference.

Conversational AI
Alexa, do I need to use your wake word? How about now?

Sri Harish Mallidi

August 23, 2018

Here’s a fairly common interaction with Alexa: “Alexa, set volume to five”; “Alexa, play music”. Even though the queries come in quick succession, the customer needs to repeat the wake word “Alexa”. To allow for more natural interactions, the device could immediately re-enter its listening state after the first query, without wake-word repetition; but that would require it to detect whether a follow-up speech input is indeed a query intended for the device (“device-directed”) or just background speech (“non-device-directed”).

Conversational AI
Public release of fact-checking dataset quickly begins to pay dividends

Larry Hardesty

August 19, 2018

At the annual meeting of the North American chapter of the Association for Computational Linguistics in June, researchers at Amazon and the University of Sheffield released a new dataset that can be used to train machine-learning systems to determine the veracity of factual assertions online. The dataset is called FEVER, for fact extraction and verification.

Search and information retrieval
Shrinking machine learning models for offline use

Grant Strimel

August 18, 2018

"Perfect hashing" is among the techniques that reduce the memory footprints of machine learning models by 94%.

Conversational AI
Automatic transliteration can help Alexa find data across language barriers

Yuval Merhav, Steve Ash

August 8, 2018

New machine-learned multilingual named-entity transliteration system.

Conversational AI
Contextual Clues Can Help Improve Alexa’s Speech Recognizers

Anirudh Raju

July 23, 2018

Automatic speech recognition systems, which convert spoken words into text, are an important component of conversational agents such as Alexa. These systems generally comprise an acoustic model, a pronunciation model, and a statistical language model. The role of the statistical language model is to assign a probability to the next word in a sentence, given the previous ones. For instance, the phrases “Pulitzer Prize” and “pullet surprise” may have very similar acoustic profiles, but statistically, one is far more likely to conclude a question that begins “Alexa, what playwright just won a … ?”

Conversational AI

Conversational AI

Publications

Related content

Work with us