Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

DiPCo - Dinner Party Corpus

Maarten Van Segbroeck, Zaid Ahmed, Ksenia Kutsenko, Cirenia Huerta, Tinh Nguyen, Björn Hoffmeister, Jan Trmal, Maurizio Omologo, Roland Maas

Interspeech 2020

2019

We present a speech data corpus that simulates a “dinner party” scenario taking place in an everyday home environment. The corpus was created by recording multiple groups of four Amazon employee volunteers having a natural conversation in English around a dining table. The participants were recorded by a single-channel close-talk microphone and by five far-field 7-microphone array devices positioned at

Related: Amazon Releases New Public Data Set to Help Address “Cocktail Party” Problem

Conversational AI
Learning to segment inputs for NMT shows preference for character-level processing

Julia Kreutzer, Artem Sokolov

AAAI 2019

2019

Most modern neural machine translation (NMT) systems rely on presegmented inputs. Segmentation granularity importantly determines the input and output sequence lengths, hence the modeling depth, and source and target vocabularies, which in turn determine model size, computational costs of softmax normalization, and handling of out-of-vocabulary words...

Conversational AI
ProductQnA: Answering user questions on e-commerce product pages

Ashish Kulkarni, Kartik Mehta, Shweta Garg, Vidit Bansal, Nikhil Rasiwasia, Srinivasan Sengamedu, "SHS"

WWW 2019 Workshop on ECNLP

2019

Product pages on e-commerce websites often overwhelm their customers with a wealth of data, making discovery of relevant information a challenge. Motivated by this, here, we present a novel framework to answer both factoid and non-factoid user questions on product pages. We propose several question-answer matching models leveraging both deep learned distributional semantics and semantics imposed by a structured

Conversational AI
Synonym expansion for large shopping taxonomies

Adrian Boteanu, Adam Kiezun, Shay Artzi

AKCB 2019

2019

We present an approach for expanding taxonomies with synonyms, or aliases. We target large shopping taxonomies, with thousands of nodes. A comprehensive set of entity aliases is an important component of identifying entities in unstructured text such as product reviews or search queries. Our method consists of two stages: we generate synonym candidates from WordNet and shopping search queries, then use

Conversational AI
Simple Question Answering with Subgraph Ranking and Joint-Scoring

Wenbo Zhao, Tagyoung Chung, Anuj Goyal, Angeliki Metallinou

NAACL 2019

2019

Knowledge graph based simple question answering (KBSQA) is a major area of research within question answering. Although only dealing with simple questions, i.e., questions that can be answered through a single knowledge base (KB) fact, this task is neither simple nor close to being solved. Targeting on the two main steps, subgraph selection and fact selection, the research community has developed sophisticated

Conversational AI

The FEVER data set: What doesn’t kill it will make it stronger

Christos Christodoulopoulos, Arpit Mittal

October 17, 2019

This year at EMNLP, we will cohost the Second Workshop on Fact Extraction and Verification — or FEVER — which will explore techniques for automatically assessing the veracity of factual assertions online.

Conversational AI
Tools for generating synthetic data helped bootstrap Alexa’s new-language releases

Janet Slifka

October 11, 2019

In the past few weeks, Amazon announced versions of Alexa in three new languages: Hindi, U.S. Spanish, and Brazilian Portuguese. Like all new-language launches, these addressed the problem of how to bootstrap the machine learning models that interpret customer requests, without the ability to learn from customer interactions.

Conversational AI
Amazon Releases New Public Data Set to Help Address “Cocktail Party” Problem

Zaid Ahmed, Maarten Van Segbroeck

October 01, 2019

Amazon today announced the public release of a new data set that will help speech scientists address the difficult problem of separating speech signals in reverberant rooms with multiple speakers. In the field of automatic speech recognition, this problem is known as the “cocktail party” or “dinner party” problem; accordingly, we call our data set the Dinner Party Corpus, or DiPCo.

Conversational AI
Amazon releases data set of annotated conversations to aid development of socialbots

Dilek Hakkani-Tür

September 17, 2019

Today I am happy to announce the public release of the Topical Chat Dataset, a text-based collection of more than 235,000 utterances (over 4,700,000 words) that will help support high-quality, repeatable research in the field of dialogue systems.

Conversational AI
Turning Dialogue Tracking into a Reading Comprehension Problem

Shuyang Gao

September 16, 2019

During a conversation between a customer and a dialogue system like Alexa’s, the system must not only understand what the customer is saying currently but also remember the conversation history. Only by combining the history with the current utterance can the system truly understand the customer’s requirements.

Conversational AI
Photo courtesy of Getty Images

The 16 Alexa-related papers at this year’s Interspeech

Larry Hardesty

September 10, 2019

At next week’s Interspeech, the largest conference on the science and technology of spoken-language processing, Alexa researchers have 16 papers, which span the five core areas of Alexa functionality.

Conversational AI

Conversational AI

Publications

Related content

Work with us