Bayesian prompt ensembles: Model uncertainty estimation for black-box large language models

Francesco Tonolini; Jordan Massiah; Nikolaos Aletras; Gabriella Kazai

Publication

Bayesian prompt ensembles: Model uncertainty estimation for black-box large language models

By Francesco Tonolini, Jordan Massiah, Nikolaos Aletras, Gabriella Kazai

2024

Download Copy BibTeX

Share

Download

Copy BibTeX

Share

An important requirement for the reliable deployment of pre-trained large language models (LLMs) is the well-calibrated quantification of the uncertainty in their outputs. While the likelihood of predicting the next token is a practical surrogate of the data uncertainty learned during training, model uncertainty is challenging to estimate, i.e., due to lack of knowledge acquired during training. Prior efforts to quantify uncertainty of neural networks require specific architectures or (re-)training strategies, which are impractical to apply to LLMs with several billion parameters, or for black-box models where the architecture and parameters are not available. In this paper, we propose Bayesian Prompts Ensembles (BayesPE), a novel approach to effectively obtain well-calibrated uncertainty for the output of pre-trained LLMs. BayesPE computes output probabilities through a weighted ensemble of different, but semantically equivalent, task instruction prompts. The relative weights of the different prompts in the ensemble are estimated through approximate Bayesian variational inference over a small labeled validation set. We demonstrate that BayesPE approximates a Bayesian input layer for the LLM, providing a lower bound on the expected model error. In our extensive experiments, we show that BayesPE achieves significantly superior uncertainty calibration compared to several baselines over a range of natural language classification tasks, both in zero- and few-shot settings.

Bayesian prompt ensembles: Model uncertainty estimation for black-box large language models

Latest news

Work with us