Systems Theory//emergence

2026-03-08

Macro-level behavior that arises from micro-level interactions without being explicitly designed or programmed: the whole does something the parts do not.

Macro-level behavior that arises from micro-level interactions without being explicitly designed or programmed: the whole does something the parts do not.

First-order emergence: a single surprising behavior appears. Emergent behavior in RL-trained models (chain-of-thought reasoning appearing in DeepSeek-R1-Zero, tool use appearing in Cursor's Composer) are first-order: one capability the training signal did not specify.

Second-order emergence: individually emergent behaviors compose into workflows or strategies no one designed. Cursor's model learned to search, then edit, then self-test, then fix. Each step emerged independently via RL, but together they form a coherent strategy. In extended thinking, individual reasoning steps are first-order, but the model's ability to sequence them into multi-step proofs is second-order.

Strong vs weak emergence: weak emergence is in principle deducible from the parts (given enough compute). Strong emergence is not. Most ML emergence is weak: surprising to us but implicit in the optimization landscape.

Connection to scaling laws: many emergent capabilities appear abruptly at certain model scales. Below the threshold, the capability is absent. Above it, it appears suddenly. This is phase-transition-like behavior.

Model collapse is emergence in reverse: a macro-level failure (distribution narrowing) that emerges from micro-level dynamics (training on synthetic data), with no single training step being the cause.