Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

Generating factually consistent sport highlights narrations

Noah Sarfati, Ido Yerushalmy, Michael Chertok, Joseph Keller

ACM MMSports 2023

2023

Sports highlights are an important form of media for fans worldwide, as they provide short videos that capture key moments from games, often accompanied by the original commentaries of the game’s announcers. However, traditional forms of presenting sports highlights have limitations in conveying the complexity and nuance of the game. In recent years, the use of Large Language Models (LLMs) for natural language

Conversational AI
Alexa Arena: A user-centric interactive platform for embodied AI

Qiaozi (QZ) Gao, Govind Thattai, Suhaila Shakiah, Xiaofeng Gao, Shreyas Pansare, Vasu Sharma, Gaurav Sukhatme, Hangjie Shi, Bofei Yang, Desheng Zhang, Lucy Hu, Karthika Arumugam, Shui Hu, Matthew Wen, Dinakar Guthy, Cadence Chung, Rohan Khanna, Osman Ipek, Leslie Ball, Kate Bland, Heather Rocker, Michael Johnston, Reza Ghanadan, Dilek Hakkani-Tür, Prem Natarajan

NeurIPS 2023

2023

We introduce Alexa Arena, a user-centric simulation platform to facilitate research in building assistive conversational embodied agents. Alexa Arena features multi-room layouts and an abundance of interactable objects. With user-friendly graphics and control mechanisms, the platform supports the development of gamified robotic tasks readily accessible to general human users, allowing high-efficiency data

Related: Amazon releases code, datasets for developing embodied AI agents

Conversational AI
CL-QR: Cross-lingual enhanced query reformulation for multi-lingual conversational AI agents

Zhongkai Sun, Zhengyang Zhao, Sixing Lu, Chengyuan Ma, Xiaohu Liu, Xing Fan, Wei (Sawyer) Shen, Chenlei (Edward) Guo

EMNLP 2023

2023

The growing popularity of conversational AI agents such as Alexa, Google Assistant, and Siri relies on accurate spoken-language comprehension. The query reformulation (QR) method, which reformulates defective user queries, has been broadly adopted to mitigate the challenges posed by understanding the user’s intent from an imperfect spoken recognition result. However, due to the scarcity of non- English

Conversational AI
Improving contextual query rewrite for conversational AI agents through user-preference feedback learning

Zhongkai Sun, Yingxue Zhou, Jie Hao, Xing Fan, Yanbin Lu, Chengyuan Ma, Wei (Sawyer) Shen, Chenlei (Edward) Guo

EMNLP 2023

2023

Contextual query rewriting (CQR) is a crucial component in Conversational AI agents, leveraging the contextual information from previous user-agent conversations to improve the comprehension of current user intent. However, traditional CQR methods often concentrate on supervised fine-tuning only, neglecting the opportunities to learn from user feedback to align with user preferences. Inspired by recent

Conversational AI
Multimodal embodied plan prediction augmented with synthetic embodied dialogue

Aishwarya Padmakumar, Mert Inan, Spandana Gella, Patrick Lange, Dilek Hakkani-Tür

EMNLP 2023

2023

Embodied task completion is a challenge where an agent in a simulated environment must predict environment actions to complete tasks based on natural language instructions and egocentric visual observations. We propose a variant of this problem where the agent predicts actions at a higher level of abstraction called a plan which more directly tests language understanding and reasoning. We show that multimodal

Conversational AI

Pronunciation detection for Alexa’s new English-learning experience

Daniel Zhang, Animish Sivaramakrishnan

July 12, 2023

Data augmentation, novel loss functions, and weakly supervised training enable a state-of-the art model for recognizing mispronunciations.

Conversational AI
A quick guide to Amazon's 65-plus papers at this year's ACL

Staff writer

July 10, 2023

Familiar topics such as question answering and natural-language understanding remain well represented, but a new concentration on language modeling and multimodal models reflect the spread of generative AI.

Conversational AI
Do large language models really need all those layers?

Karthik Gopalakrishnan

July 09, 2023

Finding that 70% of attention heads and 20% of feed-forward networks can be excised with minimal effect on in-context learning suggests that large language models are undertrained.

Conversational AI
ACL 2023: Computational linguistics in the age of large language models

Larry Hardesty

July 07, 2023

Amazon’s Yang Liu, general chair of this year’s meeting of the Association for Computational Linguistics, on the road ahead for LLMs.

Conversational AI
Alexa Skills Inventor boosts AI education

Staff writer

July 06, 2023

The program exposes students to computer science as they create their own Alexa skills.

Conversational AI
USC

“Who we are shapes what we say and how we say it”

Staff writer

July 05, 2023

Amazon Research Award recipient Shrikanth Narayanan is on a mission to make inclusive human-AI conversational experiences.

Conversational AI

Conversational AI

Publications

Related content

Work with us