Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

CERET: Cost-effective extrinsic refinement for text generation

Jason Cai, Hang Su, Monica Sunkara, Igor Shalyminov, Saab Mansour

NAACL 2024

2024

Large Language Models (LLMs) are powerful models for generation tasks, but they may not generate good quality outputs in their first attempt. Apart from model fine-tuning, existing approaches to improve prediction accuracy and quality typically involve LLM self-improvement / self-reflection that incorporate feedback from models themselves. Despite their effectiveness, these methods are hindered by their

Conversational AI
II-MMR: Identifying and improving multi-modal multi-hop reasoning in visual question answering

Jihyung Kil, Farideh Tavazoee, Dongyeop Kang, Joo-Kyung Kim

ACL Findings 2024

2024

Visual Question Answering (VQA) often involves diverse reasoning scenarios across Vision and Language (V&L). Most prior VQA studies, however, have merely focused on assessing the model’s overall accuracy without evaluating it on different reasoning cases. Furthermore, some recent works observe that conventional Chain-of-Thought (CoT) prompting fails to generate effective reasoning for VQA, especially for

Computer vision
Impacts of misspelled queries on translation and product search

Greg Hanneman, Natawut Monaikul, Taichi Nakatani

ACL 2024

2024

Machine translation is used in e-commerce to translate second-language queries into the primary language of the store, to be matched by the search system against the product catalog. However, many queries contain spelling mistakes. We first present an analysis of the spelling-robustness of a population of MT systems, quantifying how spelling variations affect MT output, the list of returned products, and

Conversational AI
Large language models as recommender systems: A study of popularity bias

Jan Malte Lichtenberg, Alexander Buchholz, Pola Schwöbel

SIGIR 2024 Workshop on Generative Information Retrieval

2024

The issue of popularity bias—where popular items are disproportionately recommended, overshadowing less popular but potentially relevant items—remains a significant challenge in recommender systems. Recent advancements have seen the integration of general-purpose Large Language Models (LLMs) into the architecture of such systems. This integration raises concerns that it might exacerbate popularity bias,

Conversational AI
Fine-tuned machine translation metrics struggle in unseen domains

Vilém Zouhar, Shuoyang Ding, Anna Currey, Tatyana Badeka, Jenyuan Wang, Brian Thompson

ACL 2024

2024

We introduce a new, extensive multidimensional quality metrics (MQM) annotated dataset covering 11 language pairs in the biomedical domain. We use this dataset to investigate whether machine translation (MT) metrics which are fine-tuned on human-generated MT quality judgements are robust to domain shifts between training and inference. We find that fine-tuned metrics exhibit a substantial performance drop

Conversational AI

On a mission to demystify artificial intelligence

Staff writer

February 07, 2023

Parmida Beigi, an Amazon senior research scientist, shares a lifetime worth of experience, and uses her skills to help others grow into machine learning career paths.

Conversational AI
AAAI: Prompt engineering and reasoning in the spotlight

Larry Hardesty

February 06, 2023

Methods for controlling the outputs of large generative models and integrating symbolic reasoning with machine learning are among the conference’s hot topics.

Conversational AI
Teaching virtual robots to follow natural-language instructions

Qiaozi (QZ) Gao, Gaurav Sukhatme

January 23, 2023

Two Alexa AI papers present novel methodologies that use vision and language understanding to improve embodied task completion in simulated environments.

Conversational AI
Using large language models (LLMs) to synthesize training data

Andy Rosenbaum, Saleh Soltan, Wael Hamza

January 20, 2023

Prompt engineering enables researchers to generate customized training examples for lightweight “student” models.

Conversational AI
Domain data trumps teacher knowledge for distilling NLU models

Charith Peris, Thomas Gueudre

January 18, 2023

On natural-language-understanding tasks, student models trained only on task-specific data outperform those trained on a mix that includes generic data.

Conversational AI
Teaching speech recognizers new words — without retraining

Sravan Bodapati

January 13, 2023

Using lists of rare or out-of-vocabulary words to bias connectionist temporal classification models enables personalization.

Conversational AI

Conversational AI

Publications

Related content

Work with us