Calibration of Natural Language Understanding Models with Venn–ABERS Predictors

Patrizio Giovannotti

Calibration of Natural Language Understanding Models with Venn–ABERS Predictors

Department of Computer Science

Research output: Contribution to conference › Paper › peer-review

Abstract

Transformers, currently the state-of-the-art in natural language understanding (NLU) tasks, are prone to generate uncalibrated predictions or extreme probabilities, making the process of taking different decisions based on their output relatively difficult. In this paper we propose to build several inductive Venn–ABERS predictors (IVAP), which are guaranteed to be well calibrated under minimal assumptions, based on a selection of pre-trained transformers. We test their performance over a set of diverse NLU tasks and show that they are capable of producing well-calibrated probabilistic predictions that are uniformly spread over the [0,1] interval – all while retaining the original model's predictive accuracy.

Original language	English
Publication status	Published - 2022
Event	The 11th Symposium on Conformal and Probabilistic Prediction with Applications: COPA 2022 - Brighton, United Kingdom Duration: 24 Aug 2022 → 26 Aug 2022 https://copa-conference.com

Conference

Conference	The 11th Symposium on Conformal and Probabilistic Prediction with Applications: COPA 2022
Country/Territory	United Kingdom
City	Brighton
Period	24/08/22 → 26/08/22
Internet address	https://copa-conference.com

Access to Document

https://proceedings.mlr.press/v179/giovannotti22a.htmlLicence: Unspecified

Cite this

@conference{64fea8021a1646aeabea86fdca210043,

title = "Calibration of Natural Language Understanding Models with Venn–ABERS Predictors",

abstract = "Transformers, currently the state-of-the-art in natural language understanding (NLU) tasks, are prone to generate uncalibrated predictions or extreme probabilities, making the process of taking different decisions based on their output relatively difficult. In this paper we propose to build several inductive Venn–ABERS predictors (IVAP), which are guaranteed to be well calibrated under minimal assumptions, based on a selection of pre-trained transformers. We test their performance over a set of diverse NLU tasks and show that they are capable of producing well-calibrated probabilistic predictions that are uniformly spread over the [0,1] interval – all while retaining the original model's predictive accuracy.",

author = "Patrizio Giovannotti",

year = "2022",

language = "English",

note = "The 11th Symposium on Conformal and Probabilistic Prediction with Applications: COPA 2022 ; Conference date: 24-08-2022 Through 26-08-2022",

url = "https://copa-conference.com",

}

TY - CONF

T1 - Calibration of Natural Language Understanding Models with Venn–ABERS Predictors

AU - Giovannotti, Patrizio

PY - 2022

Y1 - 2022

N2 - Transformers, currently the state-of-the-art in natural language understanding (NLU) tasks, are prone to generate uncalibrated predictions or extreme probabilities, making the process of taking different decisions based on their output relatively difficult. In this paper we propose to build several inductive Venn–ABERS predictors (IVAP), which are guaranteed to be well calibrated under minimal assumptions, based on a selection of pre-trained transformers. We test their performance over a set of diverse NLU tasks and show that they are capable of producing well-calibrated probabilistic predictions that are uniformly spread over the [0,1] interval – all while retaining the original model's predictive accuracy.

AB - Transformers, currently the state-of-the-art in natural language understanding (NLU) tasks, are prone to generate uncalibrated predictions or extreme probabilities, making the process of taking different decisions based on their output relatively difficult. In this paper we propose to build several inductive Venn–ABERS predictors (IVAP), which are guaranteed to be well calibrated under minimal assumptions, based on a selection of pre-trained transformers. We test their performance over a set of diverse NLU tasks and show that they are capable of producing well-calibrated probabilistic predictions that are uniformly spread over the [0,1] interval – all while retaining the original model's predictive accuracy.

UR - https://arxiv.org/abs/2205.10586

M3 - Paper

T2 - The 11th Symposium on Conformal and Probabilistic Prediction with Applications: COPA 2022

Y2 - 24 August 2022 through 26 August 2022

ER -

Calibration of Natural Language Understanding Models with Venn–ABERS Predictors

Abstract

Conference

Access to Document

Other files and links

Cite this