ML//Training//dataset//tail distribution

The rare events at the edges of a probability distribution — the 0.1% cases that standard training underweights.


The rare events at the edges of a probability distribution — the 0.1% cases that standard training underweights.

In language: unusual phrasings, rare factual combinations, edge-case reasoning chains. In safety: jailbreaks, adversarial inputs, novel failure modes.

Why they matter: robustness lives in the tails. A model that handles the average case perfectly but fails on tails is fragile — and failures in tails are often the most consequential.

Model collapse erodes tails first because they have the weakest signal — the model "forgets" what it barely learned.

Reward hacking exploits tail blindness: if the RM hasn't seen a particular adversarial pattern, the model can optimize toward it without penalty.

Data augmentation and targeted collection of rare cases are the main defenses — but you can't augment what you don't know exists.