Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

Bidirectional long-range parser for sequential data understanding

George Leotescu, Daniel Voinea, Alin-Ionut Popa

ICLR 2024 Workshop on Data-centric Machine Learning Research

2024

The transformer is a powerful data-modeling framework responsible for remarkable performance on a wide range of tasks. However, transformers are limited in terms of scalability as it is suboptimal and inefficient to process long-sequence data. To this purpose we introduce BLRP (Bidirectional Long-Range Parser), a novel and versatile attention mechanism designed to increase performance and efficiency on

Machine learning
ProMISe: A proactive multi-turn dialogue dataset for information-seeking intent resolution

Yash Parag Butala, Siddhant Garg, Pratyay Banerjee, Amita Misra

EACL 2024

2024

Users of AI-based virtual assistants and search systems encounter challenges in articulating their intents while seeking information on unfamiliar topics, possibly due to complexity of the user’s intent or the lack of meta-information on the topic. We posit that an iterative suggested question-answering (SQA) conversation can improve the trade-off between the satisfaction of the user’s intent while keeping

Conversational AI
Question aware vision transformer for multimodal reasoning

Roy Ganz, Yair Kittenplon, Aviad Aberdam, Elad Ben Avraham, Oren Nuriel, Shai Mazor, Ron Litman

CVPR 2024

2024

Vision-Language (VL) models have gained significant research focus, enabling remarkable advances in multimodal reasoning. These architectures typically comprise a vision encoder, a Large Language Model (LLM), and a projection module that aligns visual features with the LLM’s representation space. Despite their success, a critical limitation persists: the vision encoding process remains decoupled from user

Computer vision
MEND: Meta demonstration distillation for efficient and effective in-context learning

Yichuan Li, Xiyao Ma, Sixing Lu, Kyumin Lee, Xiaohu Liu, Chenlei (Edward) Guo

ICLR 2024

2024

Large Language models (LLMs) have demonstrated impressive in-context learning (ICL) capabilities, where a LLM makes predictions for a given test input together with a few input-output pairs (demonstrations). Nevertheless, the inclusion of demonstrations leads to a quadratic increase in the computational overhead of the self-attention mechanism. Existing solutions attempt to distill lengthy demonstrations

Conversational AI
Towards robustness analysis of e-commerce ranking system

Ningfei Wang, Yupin Huang, Han Cheng, Jiri Gesi, Xiaojie Wang, Vivek Mittal

The Web Conference 2024

2024

Information retrieval (IR) is a pivotal component in various applications. Recent advances in machine learning (ML) have enabled the integration of ML algorithms into IR, particularly in ranking systems. While there is a plethora of research on the robustness of ML-based ranking systems, these studies largely neglect commercial e-commerce systems and fail to establish a connection between real-world and

Conversational AI

How Alexa learned to speak with an Irish accent

Georgi Tinchev, Marta Czarnowska

July 03, 2023

With little training data and no mapping of speech to phonemes, Amazon researchers used voice conversion to generate Irish-accented training data in Alexa’s own voice.

Conversational AI
The science behind the improved Fire TV voice search

Sean O'Neill

June 26, 2023

How phonetically blended results (PBR) help ensure customers find the content they were actually asking for.

Conversational AI
More-inclusive speech recognition with cross-utterance rescoring

Venkatesh Ravichandran

June 09, 2023

In a top-3% paper at ICASSP, Amazon researchers adapt graph-based label propagation to improve speech recognition on underrepresented pronunciations.

Conversational AI
University of Michigan’s SEAGULL wins Alexa Prize SimBot Challenge

Alexa Prize team

June 07, 2023

Team earned $500,000 for its performance in a challenge focused on advancing next-generation virtual assistants that help humans complete real-world tasks by continuously learning.

Conversational AI
Federated learning with weak supervision for speech recognition

Milind Rao

June 07, 2023

Combining semi-supervised learning, data augmentation, and reinforcement learning using rewards based on implicit customer feedback and natural-language-understanding semantics reduces word error rate by more than 10%.

Conversational AI
A quick guide to Amazon’s 40-plus papers at ICASSP

Staff writer

June 02, 2023

Topics such as code generation, commonsense reasoning, and self-learning complement the usual focus on speech recognition and acoustic-event classification.

Conversational AI

Conversational AI

Publications

Related content

Work with us