Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

Unlocking efficient, scalable, and continual knowledge editing with basis-level representation fine-tuning

Tianci Liu, Ruirui Li, Yunzhe Qi, Hui Liu, Xianfeng Tang, Tianqi Zheng, Qingyu Yin, Monica Cheng, Luke Huan, Haoyu Wang, Jing Gao

ICLR 2025

2025

Large language models (LLMs) have achieved remarkable performance on various natural language tasks. However, they are trained on static corpora and their knowledge can become outdated quickly in the fast-changing world. This motivates the development of knowledge editing methods designed to update certain knowledge in LLMs without changing unrelated others. To make selective edits, previous efforts often

Conversational AI
Sequence-level large language model training with contrastive preference optimization

Zhili Feng, Dhananjay Ram, Cole Hawkins, Aditya Rawal, Jinman Zhao, Sheng Zha

NAACL Findings 2025

2025

The next token prediction loss is the dominant self-supervised training objective for large language models and has achieved promising results in a variety of downstream tasks. However, upon closer investigation of this objective, we find that it lacks an understanding of sequence-level signals, leading to a mismatch between training and inference processes. To bridge this gap, we introduce a contrastive

Conversational AI
On mitigating code LLM hallucinations with API documentation

Nihal Jain, Rob Kwiatkowski, Baishakhi Ray, Murali Krishna Ramanathan, Varun Kumar

ICSE 2025

2025

In this study, we address the issue of API hallucinations in various software engineering contexts. We introduce CloudAPIBench, a new benchmark designed to measure API hallucination occurrences. CloudAPIBench also provides annotations for frequencies of API occurrences in the public domain, allowing us to study API hallucinations at various frequency levels. Our findings reveal that Code LLMs struggle with

Conversational AI
Towards robust knowledge representations in multilingual LLMs for equivalence and inheritance based consistent reasoning

Gaurav Arora, Srujana Merugu, Shreya Jain, Vaibhav Saxena

NAACL 2025

2025

Reasoning and linguistic skills form the cornerstone of human intelligence, facilitating problem-solving and decision-making. Recent advances in Large Language Models (LLMs) have led to impressive linguistic capabilities and emergent reasoning behaviors, fueling widespread adoption across application do-mains. However, LLMs still struggle with complex reasoning tasks, highlighting their systemic limitations

Conversational AI
Learning LLM preference over intra-dialogue pairs: A framework for utterance-level understandings

Xuanqing Liu, Chris (Luyang) Kong, Wei Niu, Afshin Khashei, Belinda Zeng, Steve Johnson, Jon Jay, Davor Golac, Matt Pope

NAACL 2025

2025

Large language models (LLMs) have demonstrated remarkable capabilities in handling complex dialogue tasks without requiring use case-specific fine-tuning. However, analyzing live dialogues in real-time necessitates low-latency processing systems, making it impractical to deploy models with billions of parameters due to latency constraints. As a result, practitioners often prefer smaller models with millions

Conversational AI

Training large language models more efficiently

Dhananjay Ram, Nikolaos Pappas

March 27, 2025

Training separate models on different datasets and then merging them reduces computational costs by as much as 91%.

Conversational AI
Amazon Nova AI Challenge accelerating the field of generative AI

Staff writer

March 10, 2025

Inaugural global university competition focused on advancing secure, trusted AI-assisted software development.

Conversational AI
Training code generation models to debug their own outputs

Varun Kumar

February 20, 2025

Using large language models to generate training data and updating models through both fine tuning and reinforcement learning improves the success rate of code generation by 39%.

Conversational AI
Lightweight LLM for converting text to structured data

Karim Bouyarmane

February 06, 2025

Novel training procedure and decoding mechanism enable model to outperform much larger foundation model prompted to perform the same task.

Conversational AI
Unlocking insights from qualitative text with LLM-enhanced topic modeling

Sreyoshi Bhaduri, Satya Kapoor

December 11, 2024

LLM-augmented clustering enables QualIT to outperform other topic-modeling methods in both topic coherence and topic diversity.

Conversational AI
Amazon opens new AI lab in San Francisco focused on long-term research bets

David Luan, Pieter Abbeel

December 09, 2024

The Amazon AGI SF Lab will focus on developing new foundational capabilities for enabling useful AI agents.

Conversational AI

Conversational AI

Publications

Related content

Work with us