ML//Inference//extended thinking//overthinking

2026-03-05

Documented empirically: forcing long extended thinking on simple problems makes the model **worse**, not better.

Documented empirically: forcing long extended thinking on simple problems makes the model worse, not better.

On simple questions where the direct answer has high probability from the start, thinking introduces:

More opportunities for self-consistency bias: the model prefers branches consistent with its own pretraining biases, not necessarily the most correct ones.

The model can "convince itself" to change a correct answer: reasoning generates context that activates irrelevant complexity.

Drift toward regions of latent space associated with unnecessary complexity.

Papers show o1 performs worse than GPT-4 on simple common-sense questions precisely because of this: the thinking generates context that triggers a distributional shift to the wrong region.

Token budgets (like Anthropic's) are partially a solution: not just cost control, but preventing the model from thinking too much on problems that don't need it.

The deeper problem: the model has no reliable metacognition. It can't assess "do I need to think more?" It just generates tokens until told to stop.