ML//scaling laws

Kaplan et al. (2020): performance follows power laws in compute, data, and parameters.


Kaplan et al. (2020): performance follows power laws in compute, data, and parameters.

Predictable — you can forecast capability from training budget before running the experiment.

Changed how labs plan: don't architect smarter, just scale bigger. Then Chinchilla refined the recipe.