Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

Sign language dataset for automatic motion generation

Maria Villa Monedero, Manuel Gil Martin, Daniel Sáez-Trigueros, Andrzej Pomirski, Ruben san Segundo

Journal of Imaging

2023

Several sign language datasets are available in the literature. Most of them are designed for sign language recognition and translation. This paper presents a new sign language dataset for automatic motion generation. This dataset includes phonemes for each sign (specified in HamNoSys, a transcription system developed at the University of Hamburg, Hamburg, Germany) and the corresponding motion information

Conversational AI
Sign language motion generation from sign characteristics

Manuel Gil Martin, Maria Villa Monedero, Andrzej Pomirski, Daniel Sáez-Trigueros, Ruben san Segundo

MDPI Sensors Journal

2023

This paper proposes, analyzes, and evaluates a deep learning architecture based on transformers for generating sign language motion from sign phonemes (represented using HamNoSys: a notation system developed at the University of Hamburg). The sign phonemes provide information about sign characteristics like hand configuration, localization, or movements. The use of sign phonemes is crucial for generating

Conversational AI
JAB: Joint adversarial prompting and belief augmentation

Ninareh Mehrabi, Palash Goyal, Anil Ramakrishna, Jwala Dhamala, Shalini Ghosh, Richard Zemel, Kai-Wei Chang, Aram Galstyan, Rahul Gupta

NeurIPS 2023 Workshop on Robustness of Zero/Few-shot Learning in Foundation Models (R0-FoMo)

2023

With the recent surge of language models in different applications, attention to safety and robustness of these models has gained significant importance. Here we introduce a joint framework in which we simultaneously probe and improve the robustness of a black-box target model via adversarial prompting and belief augmentation using iterative feedback loops. This framework utilizes an automated red teaming

Conversational AI
Comprehensive bench-marking of entropy and margin based scoring metrics for data selection

Anusha Sabbineni, Nikhil Anand, Maria Minakova

NeurIPS 2023 Workshop on Efficient Natural Language and Speech Processing (ENLSP-III)

2023

While data selection methods have been studied extensively in active learning, data pruning, and data augmentation settings, there is little evidence for the efficacy of these methods in industry scale settings, particularly in low-resource languages. Our work presents ways of assessing prospective training examples in those settings for their "usefulness" or "difficulty". We also demonstrate how these

Conversational AI
Are large language models good annotators?

Jay Mohta, Kenan Emir Ak, Yan Xu, Mingwei Shen

NeurIPS 2023 Workshop on I Can’t Believe It’s Not Better (ICBINB): Failure Modes in the Age of Foundation Models

2023

Numerous Natural Language Processing (NLP) tasks require precisely labeled data to ensure effective model training and achieve optimal performance. However, data annotation is marked by substantial costs and time requirements, especially when requiring specialized domain expertise or annotating a large number of samples. In this study, we investigate the feasibility of employing large language models (LLMs

Conversational AI

How Alexa learned to speak with an Irish accent

Georgi Tinchev, Marta Czarnowska

July 03, 2023

With little training data and no mapping of speech to phonemes, Amazon researchers used voice conversion to generate Irish-accented training data in Alexa’s own voice.

Conversational AI
The science behind the improved Fire TV voice search

Sean O'Neill

June 26, 2023

How phonetically blended results (PBR) help ensure customers find the content they were actually asking for.

Conversational AI
More-inclusive speech recognition with cross-utterance rescoring

Venkatesh Ravichandran

June 09, 2023

In a top-3% paper at ICASSP, Amazon researchers adapt graph-based label propagation to improve speech recognition on underrepresented pronunciations.

Conversational AI
University of Michigan’s SEAGULL wins Alexa Prize SimBot Challenge

Alexa Prize team

June 07, 2023

Team earned $500,000 for its performance in a challenge focused on advancing next-generation virtual assistants that help humans complete real-world tasks by continuously learning.

Conversational AI
Federated learning with weak supervision for speech recognition

Milind Rao

June 07, 2023

Combining semi-supervised learning, data augmentation, and reinforcement learning using rewards based on implicit customer feedback and natural-language-understanding semantics reduces word error rate by more than 10%.

Conversational AI
A quick guide to Amazon’s 40-plus papers at ICASSP

Staff writer

June 02, 2023

Topics such as code generation, commonsense reasoning, and self-learning complement the usual focus on speech recognition and acoustic-event classification.

Conversational AI

Conversational AI

Publications

Related content

Work with us