Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

KGQA without retraining

Nick McKenna, Priyanka Sen

ACL 2023 Workshop on SustaiNLP

2023

Popular models for Knowledge Graph Question Answering (KGQA), including semantic parsing and End-to-End (E2E) models, decode into a constrained space of KG relations. Al-though E2E models accommodate novel entities at test-time, this constraint means they cannot access novel relations, requiring expensive and time-consuming retraining whenever a new relation is added to the KG. We propose KG-Flex, a new

Conversational AI
Knowledge-augmented language model prompting for zero-shot knowledge graph question answering

Jinheon Baek, Alham Fikri Aji, Amir Saffari

ACL 2023 Workshop on Matching Entities

2023

Large Language Models (LLMs) are capable of performing zero-shot closed-book question answering tasks, based on their internal knowl-edge stored in parameters during pre-training. However, such internalized knowledge might be insufficient and incorrect, which could lead LLMs to generate factually wrong answers. Furthermore, fine-tuning LLMs to update their knowledge is expensive. To this end, we pro-pose

Conversational AI
Towards building a robust toxicity predictor

Dmitriy Bespalov, Sourav Bhabesh, Yi Xiang, Yanjun (Jane) Qi

ACL 2023

2023

Recent NLP literature pays little attention to the robustness of toxicity language predictors, while these systems are most likely to be used in adversarial contexts. This paper presents a novel adversarial attack, ToxicTrap, introducing small word-level perturbations to fool SOTA text classifiers to predict toxic text samples as benign. ToxicTrap exploits greedy based search strategies to enable fast and

Conversational AI
Recipes for sequential pre-training of multilingual encoder and seq2seq models

Saleh Soltan, Andy Rosenbaum, Tobias Falke, Qin Lu, Anna Rumshisky, Wael Hamza

ACL Findings 2023, ACL 2023 Workshop on SustaiNLP

2023

Pre-trained encoder-only and sequence-to-sequence (seq2seq) models each have advantages; however, training both model types from scratch is computationally expensive. We explore recipes to improve pre-training efficiency by initializing one model from the other. (1) Extracting the encoder from a seq2seq model, we show it underperforms a Masked Language Modeling (MLM) encoder, particularly on sequence labeling

Conversational AI
Improving low resource speech translation with data augmentation and ensemble strategies

Akshaya Vishnu Kudlu Shanbhogue, Ran Xue, Soumya Saha, Daniel Zhang, Ashwin Ganesan

IWSLT 2023

2023

This paper describes the speech translation system submitted as part of the IWSLT 2023 shared task on low resource speech translation. The low resource task aids in building models for language pairs where the training corpus is limited. In this paper, we focus on two language pairs, namely, Tamasheq-French (Tmh→Fra) and Marathi-Hindi (Mr→Hi) and implement a speech translation system that is unconstrained

Conversational AI

Credit: Glynis Condon

Automatically generating text from structured data

Isabel Groves

April 07, 2021

Technique that lets devices convey information in natural language improves on state of the art.

Conversational AI
Alexa: The science must go on

Rohit Prasad

March 31, 2021

Throughout the pandemic, the Alexa team has continued to invent on behalf of our customers.

Conversational AI
ECIR 2021: Where information retrieval becomes conversation

Larry Hardesty

March 26, 2021

In the future, says Amazon Scholar Emine Yilmaz, users will interact with computers to identify just the information they need, rather than scrolling through long lists of results.

Search and information retrieval
Credit: Glynis Condon

New dataset, metrics enable evaluation of bias in language models

Jwala Dhamala

March 24, 2021

Human-evaluation studies validate metrics, and experiments show evidence of bias in popular language models.

Conversational AI
Establishing a new standard in answer selection precision

Alessandro Moschitti

March 19, 2021

A model that uses both local and global context improves on the state of the art by 6% and 11% on two benchmark datasets.

Search and information retrieval
Credit: Emma Waldron Trammell

How one intern’s research had real-world impact for Twitch moderators

Staff writer

March 16, 2021

Amanda Cullen, a PhD candidate in informatics at the University of California, Irvine, wanted to do work that had an impact outside of academia — she found an ideal opportunity at Twitch.

Machine learning

Conversational AI

Publications

Related content

Work with us