Forrest Bicker

LLM Resources

A lot of people haved asked how they can learn more about modern ML and LLMs, so I thought I'd compile an (incomplete) selection of papers for getting started. 

1. Antebellum

2. Early Transformers

3. Scaling

3.0 Scaling Theory 3.1 Getting Big 3.2 Approximate Attention Graveyard 3.3 Mixture of Experts

4. Alignment, RL, Reasoning

4.1 Traditional RL 4.2 Modern RL 4.3 Efficient RL 4.4 Neo RL

5. Agentic, Inference-time compute

Thinking Agents

6. Research Revival

ML Systems

Quantization  KV-Cache Deployment

Vision / Multimodal

Optimizers

Footnotes

Mechanistic Interpretability