LLM Wiki
  • Home
  • Articles
  • Search

Articles

6 articles across all categories

  • All
  • Fundamentals
  • Architectures
  • Training
  • Applications
  • Applications 1 Feb 2024

    Retrieval-Augmented Generation

    Combining retrieval systems with generative models to produce accurate, grounded responses that go beyond the training data.

    Read article →
  • Training 25 Jan 2024

    Fine-tuning LLMs

    Adapting pre-trained LLMs to specific tasks: instruction tuning, RLHF, LoRA, and QLoRA.

    Read article →
  • Architectures 20 Jan 2024

    Attention Mechanisms

    A deep dive into attention mechanisms: scaled dot-product attention, cross-attention, and flash attention.

    Read article →
  • Architectures 15 Jan 2024

    Transformer Architecture

    The "Attention Is All You Need" paper revolutionized NLP. Understanding the encoder-decoder structure, multi-head attention, and positional encodings.

    Read article →
  • Fundamentals 10 Jan 2024

    Backpropagation

    The algorithm behind training neural networks — from computing gradients via the chain rule to updating weights with gradient descent.

    Read article →
  • Fundamentals 8 Jan 2024

    Tokenization

    How text is split into tokens for LLM input: BPE, WordPiece, SentencePiece, and the vocabulary trade-offs.

    Read article →

LLM Wiki — Personal knowledge base

  • Home
  • Articles
  • Search