Reliability degeneration

Practitioners of deep learning often assume that test data and training data share the same distribution. Unfortunately, this assumption doesn’t always hold in practice. The world evolves, and data generated from the future is often out-of-distribution (ood). Consequently, as context changes, the in-distribution assumption becomes less realistic, and so does the reliability of our predictions and uncertainties (Fort, Hu, and Lakshminarayanan 2019, Nalisnick et al. 2019, Ovadia et al. 2019). In fact, predictive performance can decrease while measures of confidence increase, which causes a silent failure.

Warning Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Document Conventions

An information theoretic approach to uncertainty

Methods for estimating uncertainty in deep learning