GPU by Hand ✍️ Together

Keynote Lecture Sponsored by Together AI

Oct 20, 2025

GPU by Hand ✍️ - blank

42.6KB ∙ XLSX file

“Dad, why didn’t you buy a ton of NVIDIA stock and become a millionaire? Didn’t you work on AI?”

My son hit me with that one recently. Ouch! He asked me the same question about Bitcoins last year. 🤣

No, I didn’t retire by owning $NVDA. But I do own something that I chose: The knowledge of how GPUs actually work — threads, warps, blocks, CUDA.

Next week I am going to share that knowledge in my keynote lecture.

Title: GPU by hand ✍️ Together

Date: 10/30

Time: 10am (Pacific Time)

Register: https://luma.com/khuz5upe

The leap from Hopper to Blackwell isn’t just a story of “faster GPUs” — it’s a fundamental rethinking of how every layer of the stack is organized to unlock the next frontier of AI scale. And as typical in an AI by Hand lecture, instead of focusing on benchmarks from the outside, we will dive inside the black box — unpacking the math behind these advances and making the underlying mechanisms accessible to you.

Outline

Primer: Grid -> Block -> Warp -> Thread
Quantization: FP8 -> MXFP4
Scaling: Layer -> Micro-Tensor
Packaging: Single-Die -> Dual-Die
Communication: In-GPU -> In-switch reduction

In the end…

do I want to be remembered as a lucky dad,

or a devoted dad who helped him understand the world he’s growing into?

RSVP

This keynote lecture is sponsored by Together AI.

AI by Hand ✍️

Discussion about this post

Ready for more?