Abstract
Empirical study of how language model performance scales with model size, dataset size, and compute budget, revealing predictable power-law relationships.
Research Topics
Empirical study of how language model performance scales with model size, dataset size, and compute budget, revealing predictable power-law relationships.