Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

ITERALIGN: Iterative constitutional alignment of large language models

Xiusi Chen, Hongzhi Wen, Sreyashi Nag, Chen Luo, Qingyu Yin, Ruirui Li, Zheng Li, Wei Wang

NAACL 2024

2024

With the rapid development of large language models (LLMs), aligning LLMs with human values and societal norms to ensure their reliability and safety has become crucial. Reinforcement learning with human feedback (RLHF) and Constitutional AI (CAI) have been proposed for LLM alignment. However, these methods require either heavy human annotations or explicitly pre-defined constitutions, which are labor-intensive

Conversational AI
Towards unified multi-modal personalization: Large vision-language models for generative recommendation and beyond

Tianxin Wei, Bowen Jin, Ruirui Li, Hansi Zeng, Zhengyang Wang, Jianhui Sun, Qingyu Yin, Hanqing Lu, Suhang Wang, Jingrui He, Xianfeng Tang

ICLR 2024

2024

Developing a unified model that can effectively harness heterogeneous resources and respond to a wide range of personalized needs has been a longstanding community aspiration. Our daily choices, especially in domains like fashion and retail, are substantially shaped by multi-modal data, such as pictures and textual descriptions. The vision and language modalities not only offer intuitive guidance but also

Conversational AI
ConEC: Earnings call dataset with real-world contexts for benchmarking contextual speech recognition

Ruizhe Huang, Mahsa Yarmohammad, Jan Trmal, Jing Liu, Desh Raj, Leibny Paola Garcia, Alexei V. Ivanov, Patrick Ehlen, Mingzhi (Emily) Yu, Ariya Rastrow, Daniel Povey, Sanjeev Khudanpur

LREC-COLING 2024

2024

Knowing the particular context associated with a conversation can help improving the performance of an automatic speech recognition (ASR) system. For example, if we are provided with a list of in-context words or phrases — such as the speaker’s contacts or recent song playlists — during inference, we can bias the recognition process towards this list. There are many works addressing contextual ASR; however

Conversational AI
Bring your own KG: Self-supervised program synthesis for zero-shot KGQA

Dhruv Agarwal, Rajarshi (Raj) Das, Sopan Khosla, Rashmi Gangadharaiah

NAACL 2024

2024

We present BYOKG, a universal question-answering (QA) system that can operate on any knowledge graph (KG), requires no human-annotated training data, and can be ready to use within a day—attributes that are out-of-scope for current KGQA systems. BYOKG draws inspiration from the remarkable ability of humans to comprehend information present in an unseen KG through exploration—starting at random nodes, inspecting

Conversational AI
GROUNDHOG: Grounding large language models to holistic segmentation

Yichi Zhang, Martin Ma, Xiaofeng Gao, Suhaila Shakiah, Qiaozi (QZ) Gao, Joyce Chai

CVPR 2024

2024

Most multimodal large language models (MLLMs) learn language-to-object grounding through causal language modeling where grounded objects are captured by bounding boxes as sequences of location tokens. This paradigm lacks pixel-level representations that are impor- tant for fine-grained visual understanding and diagnosis. In this work, we introduce GROUNDHOG, an MLLM developed by grounding Large Language

Computer vision

Alexa Prize TaskBot Challenge 2 winner announced

Alexa Prize team

October 03, 2023

Team TWIZ from NOVA School of Science and Technology awarded $500,000 prize for first-place overall performance.

Conversational AI
Alexa unveils new speech recognition, text-to-speech technologies

Staff writer

September 20, 2023

Leveraging large language models will make interactions with Alexa more natural and engaging.

Conversational AI
Alexa Prize SocialBot Grand Challenge 5 winners announced

Alexa Prize team

September 12, 2023

GauchoChat wins $250,000 first place prize in overall competition; Chirpy Cardinal earns $250,000 for first place in scientific innovation category.

Conversational AI
Amazon Bedrock offers access to multiple generative AI models

Staff writer

August 28, 2023

AWS service enables machine learning innovation on a robust foundation.

Conversational AI
Interspeech: Where speech recognition and synthesis converge

Larry Hardesty

August 23, 2023

Senior principal scientist Jasha Droppo on the shared architectures of large language models and spectrum quantization text-to-speech models — and other convergences between the two fields.

Conversational AI
A quick guide to Amazon's papers at Interspeech 2023

Staff writer

August 18, 2023

Speech recognition predominates, but Amazon's research takes in data representation, dialogue management, question answering, and more.

Conversational AI

Conversational AI

Publications

Related content

Work with us