Wiki/Topics/AI / ML/Foundations/Bias-Variance Tradeoff

Bias-Variance Tradeoff

biasvarianceoverfittingunderfittingfundamentals2026-04-08

I Use This When...

Training and validation results disagree, or the model is obviously too simple or too unstable. Bias-variance is the diagnosis layer that explains whether I need a richer model, more data, stronger regularization, or a different learning setup.

History

Geman, Bienenstock, Doursat (1992) formalized it. The fundamental tension in all of machine learning.

Why It Exists

Every model makes two types of errors. Bias: the model is too simple to capture the pattern (underfitting). Variance: the model memorizes noise (overfitting). You can't minimize both — you must balance.

How It Works

Visual Intuition

Imagine fitting curves to noisy points:

  • a straight line misses the real pattern and underfits
  • a wildly wiggly curve memorizes noise and overfits
  • a moderate curve captures the structure without chasing every fluctuation

That middle region is the bias-variance compromise we want.

Step by Step

  1. Train a model and evaluate on validation data
  2. If both train and validation errors are high, bias is likely high
  3. If train error is low but validation error is high, variance is likely high
  4. Adjust model capacity, regularization, data, or ensembling accordingly

Code

train_error = 0.08
val_error = 0.21

if val_error - train_error > 0.1:
    print("likely high variance")

The Math Inside

A common decomposition view is:

expected error = bias^2 + variance + irreducible noise

Interpretation:

  • bias: systematic error from a too-simple model class
  • variance: sensitivity to the particular training sample
  • irreducible noise: uncertainty in the data that no model can eliminate

Different methods shift the balance differently:

  • bagging and random forests mainly reduce variance
  • boosting often reduces bias
  • regularization trades a little more bias for less variance

Math Prerequisites

Linked from