Book detail

Cuantum trackFull access

Under the Hood of Large Language Models

8 chapters and 47 canonical sections synced from the Cuantum content database.

Author

Cuantum Tech.

Chapters

8

Reading time

~ 9h

Level

Professional

Language

English

Edition

2025

Your progress0%

Chapters & sections

8 chapters - 47 sections

Chapter 01

Chapter 1: What Are LLMs? From Transformers to Titans

0/5

11.1 From GPT to LLaMA, Claude, Gemini, Mistral, DeepSeek12m 21.2 Decoder-Only vs Encoder-Decoder vs Mixture-of-Experts (MoE)12m 31.3 Scaling Laws: Kaplan, Chinchilla, and Data–Model Trade-Offs12m 5Chapter 1 Summary – From Transformers to Titans12m 4Practical Exercises – Chapter 112m

Chapter 02

Chapter 2: Tokenization and Embeddings

0/5

12.1 Byte Pair Encoding (BPE), WordPiece, SentencePiece12m 22.2 Training Custom Tokenizers for Domain-Specific Tasks12m 32.3 Subword, Character-Level, and Multimodal Embeddings12m 5Chapter 2 Summary12m 4Practical Exercises – Chapter 212m

Chapter 03

Chapter 3: Anatomy of an LLM

0/5

13.1 Multi-Head Attention, Rotary Embeddings, and Normalization Strategies12m 23.2 Transformer Depth vs Width, Position Encoding Tricks (ALiBi, RoPE)12m 33.3 Advanced Architectures: SwiGLU, GQA, Attention Sparsity12m 5Chapter 3 Summary – Anatomy of an LLM12m 4Practical Exercises – Chapter 312m

Chapter 04

Chapter 4: Training LLMs from Scratch

0/6

14.1 Data Collection, Cleaning, Deduplication, and Filtering12m 24.2 Curriculum Learning, Mixture Datasets, and Synthetic Data12m 34.3 Infrastructure: Distributed Training, GPUs vs TPUs vs Accelerators12m 44.4 Cost Optimization & Sustainability in Large-Scale Training12m 5Chapter 4 Summary – Training LLMs from Scratch12m 6Practical Exercises – Chapter 412m

Chapter 05

Chapter 5: Beyond Text: Multimodal LLMs

0/5

15.1 Text+Image Models (LLaVA, Flamingo, GPT-4o, DeepSeek-VL)12m 25.2 Audio & Speech Integration (Whisper, SpeechLM)12m 35.3 Video and Cross-Modal Research Directions12m 4Chapter 5 Summary – Beyond Text: Multimodal LLMs12m 5Practical Exercises – Chapter 512m

Chapter 06

Quiz

0/2

2Answers12m 1Questions12m

Chapter 07

Project 1: Build a Toy Transformer from Scratch in PyTorch

0/8

10. Setup12m 21. Tiny Dataset & Character Tokenizer12m 32. Model Components12m 43. The Tiny GPT-Style Model12m 54. Training Loop (Causal LM)12m 65. Text Generation12m 76. Where to go next (your experiments)12m 8Learning outcomes12m

Chapter 08

Project 2: Train a Custom Domain-Specific Tokenizer (e.g., for legal or medical texts)

0/11

10. Setup12m 21. Gather a Representative Mini-Corpus12m 32. Train a BPE Tokenizer (🤗 tokenizers)12m 43. Train a SentencePiece Tokenizer (Unigram or BPE)12m 54. Wrap Your Tokenizer for Transformers12m 65. Evaluate Tokenizer Quality12m 76. Add a User Vocabulary (optional but powerful)12m 87. Save, Load, and Version12m 98. Plug Into a Small Model (sanity run)12m 11Learning outcomes12m 10Pitfalls & Tips12m