Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

MEND: Meta demonstration distillation for efficient and effective in-context learning

Yichuan Li, Xiyao Ma, Sixing Lu, Kyumin Lee, Xiaohu Liu, Chenlei (Edward) Guo

ICLR 2024

2024

Large Language models (LLMs) have demonstrated impressive in-context learning (ICL) capabilities, where a LLM makes predictions for a given test input together with a few input-output pairs (demonstrations). Nevertheless, the inclusion of demonstrations leads to a quadratic increase in the computational overhead of the self-attention mechanism. Existing solutions attempt to distill lengthy demonstrations

Conversational AI
Towards robustness analysis of e-commerce ranking system

Ningfei Wang, Yupin Huang, Han Cheng, Jiri Gesi, Xiaojie Wang, Vivek Mittal

The Web Conference 2024

2024

Information retrieval (IR) is a pivotal component in various applications. Recent advances in machine learning (ML) have enabled the integration of ML algorithms into IR, particularly in ranking systems. While there is a plethora of research on the robustness of ML-based ranking systems, these studies largely neglect commercial e-commerce systems and fail to establish a connection between real-world and

Conversational AI
DeepMMATE: Deep learning based multimodal architecture for audit taxability classification with XAI

Harish Y V S

WSDM 2024

2024

Review of non-taxable products is an important internal audit which is carried out by majority of e-commerce stakeholders. This process usually cross checks the initial taxability assignments to avoid any unnecessary penalties incurred to the companies during the actual audits by the respective state compliance teams/tax departments. In order to handle millions of products sold online on e-commerce websites

Conversational AI
Harnessing the power of LLMs in practice: A survey on ChatGPT and beyond

Jingfeng Yang, Haongye Jin, Ruixiang Tang, Xiaotian Han, Qizhang Feng, Haoming Jiang, Shaochen Zhong, Bing Yin, Xia Hu

TKDD 2024

2024

This paper presents a comprehensive and practical guide for practitioners and end-users working with Large Language Models (LLMs) in their downstream natural language processing (NLP) tasks. We provide discussions and insights into the usage of LLMs from the perspectives of models, data, and downstream tasks. Firstly, we offer an introduction and brief summary of current language models. Then, we discuss

Conversational AI
Leveraging confidence models for identifying challenging data subgroups in speech models

Alkis Koudounas, Eliana Pastor, Vittorio Mazzia, Manuel Giollo, Thomas Gueudre, Elisa Reale, Giuseppe Attanasio, Luca Cagliero, Sandro Cumani, Luca de Alfaro, Elena Baralis, Daniele Amberti

ICASSP 2024

2024

State-of-the-art speech models may exhibit suboptimal performance in specific population subgroups. Detecting these challenging subgroups is crucial to enhance model robustness and fairness. Traditional methods for subgroup identification typically rely on demographic information such as age, gender, and origin. However, collecting such sensitive data at deployment time can be impractical or unfeasible

Conversational AI

Teaching computers to answer complex questions

Abdalghani Abujabal

July 31, 2019

Computerized question-answering systems usually take one of two approaches. Either they do a text search and try to infer the semantic relationships between entities named in the text, or they explore a hand-curated knowledge graph, a data structure that directly encodes relationships among entities.

Search and information retrieval
Image: Getty Images

Bringing the Power of Neural Networks to the Problem of Search

Kai Hui

July 22, 2019

Using machine learning to train information retrieval models — such as Internet search engines — is difficult because it requires so much manually annotated data. Of course, training most machine learning systems requires manually annotated data, but because information retrieval models must handle such a wide variety of queries, they require a lot of data. Consequently, most information retrieval systems rely primarily on mechanisms other than machine learning.

Search and information retrieval
Amazon Mentors Help UMass Graduate Students Make Concrete Advances on Vital Machine Learning Problems

Larry Hardesty

June 27, 2019

Earlier this month, Varun Sharma and Akshit Tyagi, two master’s students from the University of Massachusetts Amherst, began summer internships at Amazon, where, like many other scientists in training, they will be working on Alexa’s spoken-language-understanding systems.

Conversational AI
Active learning: Algorithmically selecting training data to improve Alexa’s natural-language understanding

Stanislav Peshterliev

June 13, 2019

Alexa’s ability to respond to customer requests is largely the result of machine learning models trained on annotated data. The models are fed sample texts such as “Play the Prince song 1999” or “Play River by Joni Mitchell”. In each text, labels are attached to particular words — SongName for “1999” and “River”, for instance, and ArtistName for Prince and Joni Mitchell. By analyzing annotated data, the system learns to classify unannotated data on its own.

Conversational AI
Adapting Alexa to regional language variations

Young-Bum Kim

June 11, 2019

As Alexa expands into new countries, she usually has to be trained on new languages. But sometimes, she has to be re-trained on languages she’s already learned. British English, American English, and Indian English, for instance, are different enough that for each of them, we trained a new machine learning model from scratch.

Conversational AI
Animation by O’Reilly Science Art

Teaching Alexa to follow conversations

Arpit Gupta

June 06, 2019

New approach to reference resolution rewrites queries to clarify ambiguous references.

Conversational AI

Conversational AI

Publications

Related content

Work with us