Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

Intent detection in the age of LLMs

Gaurav Arora, Shreya Jain, Srujana Merugu

EMNLP 2024

2024

Intent detection is a critical component of task-oriented dialogue systems (TODS) which enables the identification of suitable actions to address user utterances at each dialog turn. Traditional approaches relied on computationally efficient supervised sentence transformer encoder models, which require substantial training data and struggle with out-of-scope (OOS) detection. The emergence of generative

Conversational AI
LLM self-correction with DeCRIM: Decompose, critique, and refine for enhanced following of instructions with multiple constraints

Thomas Palmeira Ferraz, Kartik Mehta, Yu-Hsiang Lin, Haw-Shiuan Chang, Shereen Oraby, Sijia Liu, Vivek Subramanian, Tagyoung Chung, Mohit Bansal, Nanyun Peng

EMNLP 2024, NeurIPS 2024 Workshop on System-2 Reasoning at Scale

2024

Instruction following is a key capability for LLMs. However, recent studies have shown that LLMs often struggle with instructions containing multiple constraints (e.g. a request to create a social media post “in a funny tone” with “no hashtag”). Despite this, most evaluations focus solely on synthetic data. To address this, we introduce RealInstruct, the first benchmark designed to evaluate LLMs’ ability

Conversational AI
Learning from natural language explanations for generalizable entity matching

Somin Wadhwa, Adit Krishnan, Runhui Wang, Byron C. Wallace, Chris (Luyang) Kong

EMNLP 2024

2024

Entity matching is the task of linking records from different sources that refer to the same real-world entity. Past work has primarily treated entity linking as a standard supervised learning problem. However, supervised entity matching models often do not generalize well to new data, and collecting exhaustive labeled training data is often cost prohibitive. Further, recent efforts have adopted LLMs for

Conversational AI
DEM: Distribution edited model for training with mixed data distributions

Dhananjay Ram, Aditya Rawal, Momchil Hardalov, Nikolaos Pappas, Sheng Zha

EMNLP 2024

2024

Training with mixed data distributions is a common and important part of creating multi-task and instruction-following models. The diversity of the data distributions and cost of joint training makes the optimization procedure extremely challenging. Data mixing methods partially address this problem, albeit having a suboptimal performance across data sources and require multiple expensive training runs.

Conversational AI
Attribute controlled fine-tuning for large language models: A case study on detoxification

Tao Meng, Ninareh Mehrabi, Palash Goyal, Anil Ramakrishna, Aram Galstyan, Richard Zemel, Kai-Wei Chang, Rahul Gupta, Charith Peris

EMNLP 2024

2024

We propose a constraint learning schema for fine-tuning Large Language Models (LLMs) with attribute control. Given a training corpus and control criteria formulated as a sequencelevel constraint on model outputs, our method fine-tunes the LLM on the training corpus while enhancing constraint satisfaction with minimal impact on its utility and generation quality. Specifically, our approach regularizes the

Related: Detoxification of large language models via regularized fine-tuning

Conversational AI

How Amazon Music's recommender hits the right notes

Sean O'Neill

January 28, 2022

Learn how the Amazon Music Conversations team is using pioneering machine learning to make Alexa's discernment better than ever.

Machine learning
On-device speech processing makes Alexa faster, lower-bandwidth

Ariya Rastrow, Shehzad Mevawalla

January 25, 2022

Innovative training methods and model compression techniques combine with clever engineering to keep speech processing local.

Conversational AI
How Alexa learned Arabic

Larry Hardesty

January 24, 2022

Arabic posed unique challenges for speech recognition, language understanding, and speech synthesis.

Conversational AI
Alexa Prize TaskBot Challenge

Staff writer

Three top performers emerge in inaugural Alexa Prize TaskBot Challenge—the first conversational AI challenge to incorporate multimodal (voice and vision) customer experiences.

Conversational AI
Using NLU labels to improve an ASR rescoring model

Yi Gu

January 05, 2022

Second-pass language models that rescore automatic-speech-recognition hypotheses benefit from multitask training on natural-language-understanding objectives.

Conversational AI
How deep learning is reducing Amazon’s packaging waste

Sean O'Neill

January 04, 2022

A combination of deep learning, natural language processing, and computer vision enables Amazon to hone in on the right amount of packaging for each product.

Machine learning

Conversational AI

Publications

Related content

Work with us