LLM Training Puzzles Save

What would you do with 1000 H100s...

Project README

LLM Training Puzzles

image

This is a collection of 8 challenging puzzles about training large language models (or really any NN) on many, many GPUs. Very few people actually get a chance to train on thousands of computers, but it is an interesting challenge and one that is critically important for modern AI. The goal of these puzzles is to get hands-on experience with the key primitives and to understand the goals of memory efficiency and compute pipelining.

I recommend running in Colab. Click here and copy the notebook to get start.

Open In Colab

image

If you are into this kind of thing, this is 6th in a series of these puzzles.

Open Source Agenda is not affiliated with "LLM Training Puzzles" Project. README Source: srush/LLM-Training-Puzzles
Stars
729
Open Issues
2
Last Commit
3 months ago
License
MIT
Tags

Open Source Agenda Badge

Open Source Agenda Rating