Transformers
Master the architecture that powers modern AI, from attention mechanisms to GPT, BERT, and beyond
Part 1
Prerequisites
Essential background for understanding transformers
Part 2
Foundations
The building blocks of transformer architecture
Part 3
The Attention Mechanism
Understanding the core innovation
Part 4
The Full Architecture
Assembling the complete transformer
Part 5
Transformer Variants
Different architectures for different tasks
Part 6
Real-World Systems
From theory to production
18
Training Transformers
From random to intelligent
19
Scaling Laws and Emergence
Why bigger is different
20
How LLMs Work
Inside a modern language model
21
The Inference Pipeline
Making generation fast
22
Fine-Tuning and Adaptation
Customizing models
23
Alignment and Safety
Making models helpful and harmless
24
RAG and Vector Search
Grounding in external knowledge
25
Prompt Engineering
Techniques to get the best outputs
26
Tool Calling and Agents
Interacting with the world
27
What Comes Next
The frontier of research