ML//Inference//distributional shift//basin of attraction

Regions of latent space where certain types of continuations are strongly preferred — the model "falls into" them based on context.


Regions of latent space where certain types of continuations are strongly preferred — the model "falls into" them based on context.

Pretraining created these basins: "after reasoning → coherent conclusion" is one basin, "after question → direct answer" is another, "after nonsense → unpredictable continuation" is another.

Random text as context doesn't help because it positions the model in a basin with no semantic structure — the model has no signal about what type of continuation to produce.

Structured reasoning activates a specific basin because during pretraining, that pattern co-occurred systematically with correct conclusions. It's intentional distributional shift toward a useful attractor.

The technical effect: reasoning tokens reduce entropy of the output distribution — fewer plausible tokens as continuations, and the ones that remain tend to be more correct.

Overthinking happens when excessive reasoning tokens push the model past the useful basin into a "complex reasoning" attractor that's counterproductive for simple problems.

Exposure bias is falling into a bad attractor — once errors accumulate, the context vector enters a basin the model never navigated during training.