Failure Prediction Is a Better Performance Proxy for Early-Exit Networks Than Calibration

Dec 6, 2025·

Piotr Kubaty

Filip Szatkowski

Metod Jazbec

Bartosz Wójcik

· 0 min read

PDF Preprint

Abstract

Early-exit models accelerate inference by attaching internal classifiers to intermediate layers of the network, allowing computation to halt once a prediction meets a predefined exit criterion. Most early-exit methods rely on confidence-based exit strategies, which has motivated prior work to calibrate intermediate classifiers in pursuit of improved performance-efficiency trade-offs. In this paper, we argue that calibration metrics can be misleading indicators of multi-exit model performance. Specifically, we present empirical evidence showing that miscalibrated networks can outperform calibrated ones. As an alternative, we propose using failure prediction as a more informative proxy for early-exit model performance. Unlike calibration, failure prediction captures changes in sample rankings and correlates strongly with efficiency gains, offering a more reliable framework for designing and evaluating early-exit models.

Publication

In Structured Probabilistic Inference and Generative Modeling Workshop, NeurIPS 2025

Last updated on Dec 6, 2025

Early Exits Adaptive Computation Calibration Failure Prediction Efficiency Deep Learning

Authors

Filip Szatkowski (he/him)

PhD Student

Universal Properties of Activation Sparsity in Modern Large Language Models Dec 6, 2025 →