Machine learning

Developing algorithms and statistical models that computer systems use to perform tasks without explicit instructions, relying on patterns and inference instead.

Precise model benchmarking with only a few observations

Riccardo Fogliato, Pratik Patil, Nil-Jana Akpinar, Mathew Monfort

EMNLP 2024

2024

How can we precisely estimate a large language model’s (LLM) accuracy on questions belonging to a specific topic within a larger question-answering dataset? The standard direct estimator, which averages the model’s accuracy on the questions in each subgroup, may exhibit high variance for subgroups (topics) with small sample sizes. Synthetic regression modeling, which leverages the model’s accuracy on questions

Machine learning
Dancing in chains: Reconciling instruction following and faithfulness in language models

Zhengxuan Wu, Yuhao Zhang, Peng Qi, Yumo Xu, Rujun Han, Yian Zhang, Jifan Chen, Bonan Min, Zhiheng Huang

EMNLP 2024

2024

Modern language models (LMs) need to follow human instructions while being faithful; yet, they often fail to achieve both. Here, we provide concrete evidence of a trade-off between instruction following (i.e., follow open-ended instructions) and faithfulness (i.e., ground responses in given context) when training LMs with these objectives. For instance, fine-tuning LLaMA-7B on instruction following datasets

Conversational AI
Optimal design for human preference elicitation

Subhojyoti Mukherjee, Anusha Lalitha, Kousha Kalantari, Aniket Deshmukh, Ge Liu, Yifei Ma, Branislav Kveton

NeurIPS 2024

2024

Learning of preference models from human feedback has been central to recent advances in artificial intelligence. Motivated by the cost of obtaining high-quality human annotations, we study efficient human preference elicitation for learning preference models. The key idea in our work is to generalize optimal designs, a methodology for computing optimal information-gathering policies, to questions with

Conversational AI
MARCO: Multi-agent real-time chat orchestration

Anubhav Shrimal, Stanley Kanagaraj, Kriti Biswas, Swarnalatha Raghuraman, Anish Nediyanchath, Yi Zhang, Promod Yenigalla

EMNLP 2024

2024

Large language model advancements have enabled the development of multi-agent frameworks to tackle complex, real-world problems such as to automate tasks that require interactions with diverse tools, reasoning, and human collaboration. We present MARCO, a Multi-Agent Real-time Chat Orchestration framework for automating tasks using LLMs. MARCO addresses key challenges in utilizing LLMs for complex, multi-step

Conversational AI
Unraveling the gradient descent dynamics of transformers

Bingqing Song, Boran Han, Shuai Zhang, Jie Ding, Mingyi Hong

NeurIPS 2024

2024

While the Transformer architecture has achieved remarkable success across various domains, a thorough theoretical foundation explaining its optimization dynamics is yet to be fully developed. In this study, we aim to bridge this understanding gap by answering the following two core questions: (1) Which types of Transformer architectures allow Gradient Descent (GD) to achieve guaranteed convergence? and

Related: Understanding the training dynamics of transformers

Machine learning

Popular deep-learning book from Amazon authors gets update

Staff writer

December 22, 2022

Google JAX Python library implementation and new topics added; volume 1 of book to be published by Cambridge University Press.

Machine learning
Machine Learning University debuts responsible AI course

Staff writer

December 15, 2022

New, free offering provides students of any level practical skills and code examples for every stage, from the machine learning problem all the way to deployment.

Machine learning
Cognixion

Cognixion gives voice to a user’s thoughts

Staff writer

December 14, 2022

Alexa Fund company’s assisted reality tech could unlock speech for hundreds of millions of people who struggle to communicate.

Machine learning
AWS VP Bratin Saha: ML is becoming a 'mainstream endeavor'

Staff writer

December 12, 2022

Vice president of ML and AI Services says more than 100,000 customers are doing machine learning on AWS.

Machine learning
Amazon SageMaker's fifth birthday: Looking back, looking forward

Larry Hardesty

December 12, 2022

Vice president Bratin Saha reflects on the past and future of Amazon Web Services’ machine learning tools and AI services.

Machine learning
Self-learning Alexa: ML model updates with no human in the loop

Staff writer

December 07, 2022

Learn about a real-time continual, lifelong learning system that trains machine learning models using production data at scale.

Machine learning

Machine learning

Recent publications

Related content

Work with us