Abstract

Empirical study of how language model performance scales with model size, dataset size, and compute budget, revealing predictable power-law relationships.

Research Topics
Read Full Paper