LLM Wiki

A personal knowledge base for machine learning, large language models, and adjacent topics — inspired by Karpathy's LLM Wiki, but broader in scope.

Concepts explained clearly, with code, math, and references.

Browse by Category

Recent Articles

Applications 1 Feb 2024

Retrieval-Augmented Generation

Combining retrieval systems with generative models to produce accurate, grounded responses that go beyond the training data.
Training 25 Jan 2024

Fine-tuning LLMs

Adapting pre-trained LLMs to specific tasks: instruction tuning, RLHF, LoRA, and QLoRA.
Architectures 20 Jan 2024

Attention Mechanisms

A deep dive into attention mechanisms: scaled dot-product attention, cross-attention, and flash attention.
Architectures 15 Jan 2024

Transformer Architecture

The "Attention Is All You Need" paper revolutionized NLP. Understanding the encoder-decoder structure, multi-head attention, and positional encodings.