Attention: the series11 interactive diagramsProf. Tom YehMay 25, 2026ShareAttention Series:QKV ProjectionAttention ComputationSelf AttentionCross AttentionSelf Attention vs Cross AttentionSelf Attention (Shared KV)Multi-Head AttentionFused QKV (Multi-Head)Single vs Multi-Head AttentionMulti-Query AttentionGrouped-Query Attention1. QKV ProjectionRead QKV Projection →2. Attention ComputationRead Attention Computation →3. Self AttentionRead Self Attention →4. Cross AttentionRead Cross Attention →5. Self Attention vs Cross AttentionRead Self Attention vs Cross Attention →6. Self Attention (Shared KV)Read Self Attention (Shared KV) →7. Multi-Head AttentionRead Multi-Head Attention →8. Fused QKV (Multi-Head)Read Fused QKV (Multi-Head) →9. Single vs Multi-Head AttentionRead Single vs Multi-Head Attention →10. Multi-Query AttentionRead Multi-Query Attention →11. Grouped-Query AttentionRead Grouped-Query Attention →