Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling

Open-source suite of models (70M–12B) with checkpoints and training code for studying how LLM capabilities evolve during training.
Scaling & Training
Author

Imad Dabbura

Published

December 9, 2023

Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling

#nlp #llm

Back to top