Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

Towards open-world product attribute mining: A lightly-supervised approach

Liyan Xu, Chenwei Zhang, Xian Li, Jingbo Shang, Jinho Choi

ACL 2023

2023

We present a new task setting for attribute mining on e-commerce products, serving as a practical solution to extract open-world attributes without extensive human intervention. Our supervision comes from a high-quality seed attribute set bootstrapped from existing resources, and we aim to expand the attribute vocabulary of existing seed types, and also to discover any new attribute types automatically.

Conversational AI
MASSIVE: A 1M-example multilingual natural language understanding dataset with 51 typologically-diverse languages

Jack G. M. FitzGerald, Chris Hench, Charith Peris, Scott Mackie, Kay Rottmann, Ana Sanchez, Aaron Nash, Liam Urbach, Vishesh Kakarala, Richa Singh, Swetha Ranganath, Laurie Crist, Misha Britan, Wouter Leeuwis, Gokhan Tur, Prem Natarajan

ACL 2023

2023

We present the MASSIVE dataset— Multilingual Amazon Slu resource package (SLURP) for Slot-filling, Intent classification, and Virtual assistant Evaluation. MASSIVE contains 1M realistic, parallel, labeled virtual assistant utterances spanning 51 languages, 18 domains, 60 intents, and 55 slots. MASSIVE was created by tasking professional translators to localize the English-only SLURP dataset into 50 typologically

Related: Amazon releases 51-language dataset for language understanding

Conversational AI
Efficient data selection employing Semantic Similarity-based Graph Structures for model training

Roxana-Maria Petcu, Subhadeep Maji

ICML 2023 Workshop on Sampling and Optimization in Discrete Spaces

2023

Recent developments in natural language processing (NLP) have highlighted the need for substantial amounts of data for models to capture textual information accurately. This raises concerns regarding the computational resources and time required for training such models. This paper introduces SEmantics for data SAliency in Model performance Estimation (SeSaME). It is an efficient data sampling mechanism

Conversational AI
Automatic table union search with tabular representation learning

Xuming Hu, Shen Wang, Xiao Qin, Chuan Lei, Zhengyuan Shen, Asterios Katsifodimos, Christos Faloutsos, George Karypis, Lijie Wen, Philip S. Yu

ACL Findings 2023

2023

Given a data lake of tabular data as well as a query table, how can we retrieve all the tables in the data lake that can be unioned with the query table? Table union search constitutes an essential task in data discovery and preparation as it enables data scientists to navigate massive open data repositories. Existing methods identify uniability based on column representations (word surface forms or token

Conversational AI
Exploring continual learning for code generation models

Prateek Yadav, Qing Sun, Hantian Ding, Xiaopeng LI, Dejiao Zhang, Ming Tan, Xiaofei Ma, Parminder Bhatia, Ramesh Nallapati, Murali Krishna Ramanathan, Mohit Bansal, Bing Xiang

ACL 2023

2023

Large-scale code generation models such as Codex and CodeT5 have achieved impressive performance. However, libraries are upgraded or deprecated very frequently and re-training large-scale language models is computationally expensive. Therefore, Continual Learning (CL) is an important aspect that remains under-explored in the code domain. In this paper, we introduce a benchmark called CODETASK-CL that covers

Conversational AI

_{Projection image adapted from Michael Horvath under the CC BY-SA 4.0 license}

New method for compressing neural networks better preserves accuracy

Anish Acharya, Rahul Goel

January 15, 2019

Neural networks have been responsible for most of the top-performing AI systems of the past decade, but they tend to be big, which means they tend to be slow. That’s a problem for systems like Alexa, which depend on neural networks to process spoken requests in real time.

Conversational AI
How Alexa may learn to retrieve stored "memories"

Rasool Fakoor

December 21, 2018

In May 2018, Amazon launched Alexa’s Remember This feature, which enables customers to store “memories” (“Alexa, remember that I took Ben’s watch to the repair store”) and recall them later by asking open-ended questions (“Alexa, where is Ben’s watch?”).

Search and information retrieval
How Alexa knows “peanut butter” is one shopping-list item, not two

Sanchit Agarwal

December 18, 2018

At a recent press event on Alexa's latest features, Alexa’s head scientist, Rohit Prasad, mentioned multistep requests in one shot, a capability that allows you to ask Alexa to do multiple things at once. For example, you might say, “Alexa, add bananas, peanut butter, and paper towels to my shopping list.” Alexa should intelligently figure out that “peanut butter” and “paper towels” name two items, not four, and that bananas are a separate item.

Conversational AI
With New Data Representation Scheme, Alexa Can Better Match Skills to Customer Requests

Young-Bum Kim

December 17, 2018

In recent years, data representation has emerged as an important research topic within machine learning.

Conversational AI
New Approach to Language Modeling Reduces Speech Recognition Errors by Up to 15%

Ankur Gandhe

December 13, 2018

Language models are a key component of automatic speech recognition systems, which convert speech into text. A language model captures the statistical likelihood of any particular string of words, so it can help decide between different interpretations of the same sequence of sounds.

Conversational AI
Distributed “Re-Ranker” ensures that Alexa improvements reach customers ASAP

Chengwei Su

December 11, 2018

Suppose that you say to Alexa, “Alexa, play Mary Poppins.” Alexa must decide whether you mean the book, the video, or the soundtrack. How should she do it?

Conversational AI

Conversational AI

Publications

Related content

Work with us