Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

Leveraging large language models for multimodal search

Oriol Barbany Mayor, Michael Huang, Xinliang Zhu, Arnab Dhua

CVPR 2024 Workshop on Fine-Grained Visual Categorization

2024

Multimodal search has become increasingly important in providing users with a natural and effective way to ex-press their search intentions. Images offer fine-grained details of the desired products, while text allows for easily incorporating search modifications. However, some existing multimodal search systems are unreliable and fail to address simple queries. The problem becomes harder with the large

Computer vision
Less is more for improving automatic evaluation of factual consistency

Tong Wang, Ninad Kulkarni, Yanjun (Jane) Qi

NAACL 2024

2024

Assessing the factual consistency of automatically generated texts in relation to source context is crucial for developing reliable natural language generation applications. Recent literature proposes AlignScore which uses a unified alignment model to evaluate factual consistency and substantially outperforms previous methods across many benchmark tasks. In this paper, we take a closer look of datasets

Conversational AI
How lexical is bilingual lexicon induction?

Harsh Kohli, Helian Feng, Nicholas Dronen, Calvin McCarter, Sina Moeini, Ali Kebarighotbi

NAACL 2024

2024

In contemporary machine learning approaches to bilingual lexicon induction (BLI), a model learns a mapping between the embedding spaces of a language pair. Recently, the retrieve-and-rank approach to BLI has achieved state-of-the-art results on the task. However, the problem remains challenging in low-resource settings, due to the paucity of data. The task is complicated by factors such as lexical variation

Conversational AI
Enhancing contextual understanding in large language models through contrastive decoding

Zheng Zhao, Emilio Monti, Jens Lehmann, Haytham Assem

NAACL 2024

2024

Large language models (LLMs) tend to inadequately integrate input context during text generation, relying excessively on encoded prior knowledge in model parameters, potentially resulting in generated text with factual inconsistencies or contextually unfaithful content. LLMs utilize two primary knowledge sources: 1) prior (parametric) knowledge from pretraining, and 2) contextual (non-parametric) knowledge

Conversational AI
Low-cost generation and evaluation of dictionary example sentences

Bill Cai, Clarence Ng, Daniel Tan, Shelvia Hotama

NAACL 2024

2024

Dictionary example sentences play an important role in illustrating word definitions and usage, but manually creating quality sentences is challenging. Prior works have demonstrated that language models can be trained to generate example sentences. However, they relied on costly customized models and word sense datasets for generation and evaluation of their work. Rapid advancements in foundational models

Conversational AI

Context-aware deep-learning method boosts Alexa dialogue system’s ability to recognize conversation topics by 35%

Behnam Hedayatnia

December 4, 2018

Method factors in the utterances that immediately preceded the target utterance and its classification as a “dialogue act”

Conversational AI
Varying speaking styles with neural text-to-speech

Trevor Wood, Tom Merritt

November 19, 2018

Amazon scientists have shown that our latest text-to-speech (TTS) system, which uses a generative neural network, can learn to employ a newscaster style from just a few hours of training data.

Conversational AI
Reducing Customer Friction through Skill Selection

Young-Bum Kim

October 31, 2018

This year, we’ve started to explore ways to make it easier for customers to find and engage with Alexa skills.

Conversational AI
Photo credit: Sharaf Maksumov / Shutterstock.com

Amazon helps launch workshop on automatic fact verification

Larry Hardesty

October 25, 2018

At the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP), Amazon researchers and their colleagues at the University of Sheffield and Imperial College London will host the first Workshop on Fact Extraction and Verification, which will explore how computer systems can learn to recognize false assertions online.

Search and information retrieval
How an Echo device could locate snaps, claps, and taps

Jun Yang

October 4, 2018

Parallel processing of microphone inputs and separate detectors for periodicity and dynamics improve performance.

Conversational AI
Identifying sounds in audio streams

Chieh-Chi Kao, Weiran Wang

October 2, 2018

On September 20, Amazon unveiled a host of new products and features, including Alexa Guard, a smart-home feature available on select Echo devices later this year. When activated, Alexa Guard can send a customer alerts if it detects the sound of glass breaking or of smoke or carbon monoxide alarms in the home.

Conversational AI

Conversational AI

Publications

Related content

Work with us