Distance-Based Learning from Errors for Confidence Calibration

Chen Xing; Sercan Arik; Zizhao Zhang; Tomas Pfister

Distance-Based Learning from Errors for Confidence Calibration

Chen Xing, Sercan Arik, Zizhao Zhang, Tomas Pfister

Keywords: calibration, uncertainty, uncertainty estimation

Abstract Paper Code Reviews Chat

Tues Session 3 (12:00-14:00 GMT) [Live QA] [Cal]

Tues Session 4 (17:00-19:00 GMT) [Live QA] [Cal]

Abstract: Deep neural networks (DNNs) are poorly calibrated when trained in conventional ways. To improve confidence calibration of DNNs, we propose a novel training method, distance-based learning from errors (DBLE). DBLE bases its confidence estimation on distances in the representation space. In DBLE, we first adapt prototypical learning to train classification models. It yields a representation space where the distance between a test sample and its ground truth class center can calibrate the model's classification performance. At inference, however, these distances are not available due to the lack of ground truth labels. To circumvent this by inferring the distance for every test sample, we propose to train a confidence model jointly with the classification model. We integrate this into training by merely learning from mis-classified training samples, which we show to be highly beneficial for effective learning. On multiple datasets and DNN architectures, we demonstrate that DBLE outperforms alternative single-model confidence calibration approaches. DBLE also achieves comparable performance with computationally-expensive ensemble approaches with lower computational cost and lower number of parameters.

Distance-Based Learning from Errors for Confidence Calibration

Chen Xing, Sercan Arik, Zizhao Zhang, Tomas Pfister

Similar Papers

PAC Confidence Sets for Deep Neural Networks via Calibrated Prediction

Sangdon Park, Osbert Bastani, Nikolai Matni, Insup Lee,

Probabilistic Connection Importance Inference and Lossless Compression of Deep Neural Networks

Xin Xing, Long Sha, Pengyu Hong, Zuofeng Shang, Jun S. Liu,

Understanding and Robustifying Differentiable Architecture Search

Arber Zela, Thomas Elsken, Tonmoy Saikia, Yassine Marrakchi, Thomas Brox, Frank Hutter,

Probability Calibration for Knowledge Graph Embedding Models

Pedro Tabacof, Luca Costabello,