Ergodicity and mixing: why your backtest can quietly lie

When you backtest a strategy on 20 years of data, you're implicitly assuming the market is ergodic — that one long path represents the full distribution. When ergodicity breaks, the backtest is meaningless. Mixing speed says how long the path needs to be for it to be safe.

Method · Ergodicity And Mixing

Prereqs: Stationarity Invariant Measures Markov Chain Polyas Urn

Intro

When you backtest a strategy on 20 years of data and assume the result generalises, you are implicitly assuming the market is ergodic: one long historical path gives the same distribution as averaging across infinitely many parallel universes. When ergodicity breaks — regime changes, structural shifts, absorbing states — your backtest is a sample of one non-representative trajectory, and ‘on average’ becomes a meaningless phrase. Birkhoff’s ergodic theorem is the formal guarantee that the swap is legal, and mixing speed says how long the horizon needs to be before the guarantee kicks in.

Used in practice

Backtest validity: ergodicity is the implicit assumption behind every backtest. If the return process is non-ergodic (regime-switching, absorbing states), a single historical path doesn't represent the strategy's true distribution — over-fitting risk is invisible in the backtest.
Newey-West HAC standard errors (used by every econometrics package) exist because slow mixing (high λ₂) inflates OLS variance estimates — the mixing time directly sets the bandwidth parameter.
Interview context: 'Why can't you trust a Sharpe ratio from a single backtest?' — ergodicity failure: the realized path may be one of many non-representative trajectories. Polya's urn is the extreme example: the time average converges to a random limit, not a fixed number.

✓ Intro · expand

Ergodicity and mixing: why your backtest can quietly lie

Goals

New Goal

Weekly Missions