<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[AI by Hand ✍️]]></title><description><![CDATA[Math, Algorithms, Architectures, by hand]]></description><link>https://www.byhand.ai</link><image><url>https://substackcdn.com/image/fetch/$s_!fOmN!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F618d59fa-9b8f-4814-8d16-f29c2f23ae53_371x371.png</url><title>AI by Hand ✍️</title><link>https://www.byhand.ai</link></image><generator>Substack</generator><lastBuildDate>Tue, 28 Apr 2026 20:10:13 GMT</lastBuildDate><atom:link href="https://www.byhand.ai/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Tom Yeh]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[aibyhand@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[aibyhand@substack.com]]></itunes:email><itunes:name><![CDATA[Prof. Tom Yeh]]></itunes:name></itunes:owner><itunes:author><![CDATA[Prof. Tom Yeh]]></itunes:author><googleplay:owner><![CDATA[aibyhand@substack.com]]></googleplay:owner><googleplay:email><![CDATA[aibyhand@substack.com]]></googleplay:email><googleplay:author><![CDATA[Prof. Tom Yeh]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[Seminar next week ~ Google's Gemma 4]]></title><description><![CDATA[Frontier Model Seminar Series #1]]></description><link>https://www.byhand.ai/p/seminar-next-week-googles-gemma-4</link><guid isPermaLink="false">https://www.byhand.ai/p/seminar-next-week-googles-gemma-4</guid><dc:creator><![CDATA[Prof. Tom Yeh]]></dc:creator><pubDate>Mon, 27 Apr 2026 18:18:59 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!mP7n!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fafdd8bbb-faf6-405a-8e77-ebe73556d566_1980x1512.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><strong>AI by Hand &#9997;&#65039; Seminars are back! &#128588;</strong></p><p>&#8203;This time, I&#8217;ll cover four frontier open-source models, two from the US and two from China. I&#8217;ll calculate the math and sketch the architectures by hand &#9997;&#65039;.</p><p>&#8203;We start with <strong>Gemma 4</strong>, released by Google DeepMind just a few weeks ago (April 2). This model is the perfect baseline: it runs on a phone, shares its recipe with Gemini 3, and its design choices show exactly which transformer techniques have finally <em>matured</em> into the modern stack. From there, we head to the frontier.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!mP7n!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fafdd8bbb-faf6-405a-8e77-ebe73556d566_1980x1512.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!mP7n!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fafdd8bbb-faf6-405a-8e77-ebe73556d566_1980x1512.png 424w, https://substackcdn.com/image/fetch/$s_!mP7n!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fafdd8bbb-faf6-405a-8e77-ebe73556d566_1980x1512.png 848w, https://substackcdn.com/image/fetch/$s_!mP7n!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fafdd8bbb-faf6-405a-8e77-ebe73556d566_1980x1512.png 1272w, https://substackcdn.com/image/fetch/$s_!mP7n!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fafdd8bbb-faf6-405a-8e77-ebe73556d566_1980x1512.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!mP7n!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fafdd8bbb-faf6-405a-8e77-ebe73556d566_1980x1512.png" width="1456" height="1112" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/afdd8bbb-faf6-405a-8e77-ebe73556d566_1980x1512.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1112,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:918813,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/195658389?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fafdd8bbb-faf6-405a-8e77-ebe73556d566_1980x1512.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!mP7n!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fafdd8bbb-faf6-405a-8e77-ebe73556d566_1980x1512.png 424w, https://substackcdn.com/image/fetch/$s_!mP7n!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fafdd8bbb-faf6-405a-8e77-ebe73556d566_1980x1512.png 848w, https://substackcdn.com/image/fetch/$s_!mP7n!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fafdd8bbb-faf6-405a-8e77-ebe73556d566_1980x1512.png 1272w, https://substackcdn.com/image/fetch/$s_!mP7n!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fafdd8bbb-faf6-405a-8e77-ebe73556d566_1980x1512.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>Date:</strong> Thursday, May 7</p><p><strong>Time:</strong> 10am (Pacific Time)</p><div class="callout-block" data-callout="true"><p><a href="https://luma.com/f0annk18">Register here</a> &#8599;  </p></div><h2>Upcoming Seminars</h2><p>&#8203;<strong>Week 2 &#8212; Qwen 3.5 (Alibaba, China):</strong> long-context scaling techniques and attention optimizations improving practical context length.</p><p>&#8203;<strong>Week 3 &#8212; Nemotron 3 Super (NVIDIA, US):</strong> large-scale transformer systems, with emerging directions toward state-space alternatives like Mamba.</p><p>&#8203;<strong>Week 4 &#8212; DeepSeek-V4 (DeepSeek, China):</strong> a 1.6T-parameter MoE with Compressed Sparse Attention for 1M-context and three test-time reasoning modes, advancing chain-of-thought at the open-source frontier, released just in-time for this special seminar series. &#128521;</p><h2>Previous Seminars</h2><ul><li><p>&#128293; <a href="https://www.byhand.ai/p/recording-ppo-dpo-grpo-rubrics">PPO&#8594;DPO&#8594;GRPO&#8594;Rubrics</a> (2/26/2026)</p></li><li><p>&#128293; <a href="https://www.byhand.ai/p/openclaw-seminar">OpenClaw - 12 Stages of Evolution from the Transformer</a> (2/19/2026)</p></li><li><p>&#128293; <a href="https://www.byhand.ai/p/transformer-six-levels-of-understanding">Transformer - Six Levels of Understanding</a> (2/12/2026)</p></li><li><p><a href="https://www.byhand.ai/p/meta-superintelligence-labs-vs-facebook">Meta Superintelligence Labs vs Facebook AI Research</a> (2/5/2026)</p></li><li><p><a href="https://www.byhand.ai/p/ai-eval-equations-you-must-know">9 AI Eval Formulas</a> (1/29/2026)</p></li><li><p><a href="https://www.byhand.ai/p/google-ironwood-tpu-from-bits-to">Google Ironwood TPU: From Bits to HBM</a> (1/22/2026)</p></li><li><p><a href="https://www.byhand.ai/p/how-small-models-learn-tool-use">How AWS Uses Small Models Learn Tool Use</a> (1/15/2026)</p></li><li><p><a href="https://www.byhand.ai/p/seminar-foundation-attention-to-frontier">Attention</a> (1/15/2026)</p></li><li><p><a href="https://www.byhand.ai/p/manifold-constrained-hyper-connections">DeepSeek&#8217;s Manifold-Constrained Hyper Connection (mHC)</a> (1/8/2026)</p></li><li><p><a href="https://www.byhand.ai/p/introduction-to-gen-ai">Introduction to Generative AI</a> (1/8/2026)</p></li><li><p><a href="https://www.byhand.ai/p/gated-attention-neurips-2025-best">Gated Attention (NeurIPS 2025 Best Paper)</a> (12/16/2025)</p></li></ul>]]></content:encoded></item><item><title><![CDATA[Single vs Multi-Head Attention]]></title><description><![CDATA[Try the interactive version &#8599;]]></description><link>https://www.byhand.ai/p/library-models-attention-single-vs-multi-head</link><guid isPermaLink="false">https://www.byhand.ai/p/library-models-attention-single-vs-multi-head</guid><dc:creator><![CDATA[Prof. Tom Yeh]]></dc:creator><pubDate>Mon, 27 Apr 2026 14:22:53 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!nZUr!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5febdba8-34c3-45b2-acc1-172537199a20_2400x1800.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!nZUr!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5febdba8-34c3-45b2-acc1-172537199a20_2400x1800.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!nZUr!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5febdba8-34c3-45b2-acc1-172537199a20_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!nZUr!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5febdba8-34c3-45b2-acc1-172537199a20_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!nZUr!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5febdba8-34c3-45b2-acc1-172537199a20_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!nZUr!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5febdba8-34c3-45b2-acc1-172537199a20_2400x1800.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!nZUr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5febdba8-34c3-45b2-acc1-172537199a20_2400x1800.jpeg" width="728" height="409.5" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5febdba8-34c3-45b2-acc1-172537199a20_2400x1800.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:728,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;captionedImage&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!nZUr!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5febdba8-34c3-45b2-acc1-172537199a20_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!nZUr!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5febdba8-34c3-45b2-acc1-172537199a20_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!nZUr!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5febdba8-34c3-45b2-acc1-172537199a20_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!nZUr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5febdba8-34c3-45b2-acc1-172537199a20_2400x1800.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="callout-block" data-callout="true"><p><strong><a href="https://library.byhand.ai/models/attention/single-vs-multi-head?embed=1&amp;back=https%3A%2F%2Fwww.byhand.ai%2Fp%2Flibrary-models-attention-single-vs-multi-head">Try the interactive version &#8599;</a></strong></p></div><p><strong>One important fact about multi-head attention:</strong> it has the <em>same parameter count</em> as single-head attention. The difference is purely structural &#8212; same total Wqkv weights, partitioned into smaller q&#8211;k&#8211;v triples.</p><p>Look at the two diagrams below. Both Wqkv matrices have the <em>same height</em> &#8212; same number of weight rows, same number of parameters. What changes is how that single tall block is sliced.</p><ul><li><p><strong>Left.</strong> One head. The full Wqkv produces one big QKV: a tall Q (36 rows), a tall K, a tall V. One scoring computation runs over those full-width tensors.</p></li><li><p><strong>Right.</strong> 3 heads. The same-height Wqkv is sliced into 3 smaller q&#8211;k&#8211;v triples &#8212; each 12 rows tall. 3 scoring computations run in parallel, each a thinner version of the left.</p></li></ul><p><strong>The compute trade-off &#8212; kind of.</strong> Same Wqkv weights. Multi-head runs the attention scoring `S = K&#7488; &#215; Q` once <em>per head</em>, so the dot-product count multiplies by H.</p><ul><li><p>Single-head: seq &#215; seq = 80&#178; = <strong>6400</strong> dot products</p></li><li><p>Multi-head:  seq &#215; seq &#215; H = 80&#178; &#215; 3 = <strong>19200</strong> dot products (3&#215;)</p></li></ul><p>But each multi-head dot product is <em>narrower</em> &#8212; its inner dimension is `head_dim` instead of `H &#215; head_dim`. So when you count actual scalar multiplications, the totals are equal:</p><ul><li><p>Single-head: seq&#178; &#215; (H &#215; head_dim) = 80&#178; &#215; 36 = <strong>230400</strong></p></li><li><p>Multi-head:  seq&#178; &#215; H &#215; head_dim   = 80&#178; &#215; 3 &#215; 12 = <strong>230400</strong></p></li></ul><p>Same FLOPs. Multi-head buys you H independent attention patterns at no extra weight cost <em>and</em> no extra arithmetic cost &#8212; it's the same total compute, sliced into H finer-grained heads.</p>]]></content:encoded></item><item><title><![CDATA[Self Attention vs Cross Attention]]></title><description><![CDATA[Try the interactive version &#8599;]]></description><link>https://www.byhand.ai/p/library-models-attention-self-vs-cross</link><guid isPermaLink="false">https://www.byhand.ai/p/library-models-attention-self-vs-cross</guid><dc:creator><![CDATA[Prof. Tom Yeh]]></dc:creator><pubDate>Sat, 25 Apr 2026 13:58:52 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!nE-O!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff65c1030-1b38-456b-860a-f02835c7e121_2400x1800.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!nE-O!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff65c1030-1b38-456b-860a-f02835c7e121_2400x1800.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!nE-O!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff65c1030-1b38-456b-860a-f02835c7e121_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!nE-O!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff65c1030-1b38-456b-860a-f02835c7e121_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!nE-O!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff65c1030-1b38-456b-860a-f02835c7e121_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!nE-O!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff65c1030-1b38-456b-860a-f02835c7e121_2400x1800.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!nE-O!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff65c1030-1b38-456b-860a-f02835c7e121_2400x1800.jpeg" width="728" height="409.5" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f65c1030-1b38-456b-860a-f02835c7e121_2400x1800.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:728,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;captionedImage&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!nE-O!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff65c1030-1b38-456b-860a-f02835c7e121_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!nE-O!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff65c1030-1b38-456b-860a-f02835c7e121_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!nE-O!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff65c1030-1b38-456b-860a-f02835c7e121_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!nE-O!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff65c1030-1b38-456b-860a-f02835c7e121_2400x1800.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="callout-block" data-callout="true"><p><strong><a href="https://library.byhand.ai/models/attention/self-vs-cross?embed=1&amp;back=https%3A%2F%2Fwww.byhand.ai%2Fp%2Flibrary-models-attention-self-vs-cross">Try the interactive version &#8599;</a></strong></p></div><p>Two attention mechanisms, side by side. Both project X into queries; both compute attention via S = K&#7488; &#215; Q and F = V &#215; A. The only difference is the source of K and V.</p><p><strong>Self attention</strong> uses X for everything. Q, K, and V all come from projecting X. Each X token attends to every <em>other</em> X token. The score matrix S is square &#8212; 128 &#215; 128.</p><p><strong>Cross attention</strong> uses X for queries and a <em>second</em> sequence E for keys and values. Each X token attends to every E token instead. The score matrix S is rectangular &#8212; 64 &#215; 128.</p><p>Notice what's shared and what's not:</p><ul><li><p><strong>X is the same</strong> in both &#8212; same 36 &#215; 128 input.</p></li><li><p><strong>Q and K share the 16 dimension</strong> &#8212; that's what makes the dot product K&#7488; &#215; Q valid in either case.</p></li><li><p><strong>V dimensions are independent</strong>: self-attention uses 12, cross-attention uses 12. The choice doesn't depend on which mechanism you're using; it depends on what output dimension your downstream layer expects.</p></li></ul>]]></content:encoded></item><item><title><![CDATA[Fine-Tuning: the series]]></title><description><![CDATA[8 interactive lessons &#183; click any card below to open the post]]></description><link>https://www.byhand.ai/p/library-series-models-fine-tuning</link><guid isPermaLink="false">https://www.byhand.ai/p/library-series-models-fine-tuning</guid><dc:creator><![CDATA[Prof. Tom Yeh]]></dc:creator><pubDate>Fri, 24 Apr 2026 16:05:06 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!SAEx!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10f4be58-fad4-4694-9048-915bcfce5820_2400x1800.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<blockquote><p>Library  &#8250;  Models  &#8250;  Fine-Tuning</p></blockquote><h2>The Fine-Tuning series</h2><p>Fine-tuning is how you turn a general-purpose pretrained model into something that actually does your task &#8212; and getting it right means knowing which weights to update.</p><p>Fittingly, this series is itself a fine-tune: you bring what you already know about basic MLP neural networks, and each lesson specializes that foundation into one fine-tuning technique.</p><p>I teach this the way I teach my master's students &#8212; through higher-education metaphors. The pretrained network is someone who's finished their bachelor's.</p><p>Each of the eight lessons below shows one way to specialize further &#8212; retake every subject, refresh just the advanced course, add a certificate, pursue a PhD, invite a private tutor.</p><p>Pick any lesson. They form a sequence, but each stands on its own.</p><h3><a href="https://www.byhand.ai/p/library-models-fine-tuning-weight-update">1. Weight Update</a></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!SAEx!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10f4be58-fad4-4694-9048-915bcfce5820_2400x1800.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!SAEx!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10f4be58-fad4-4694-9048-915bcfce5820_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!SAEx!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10f4be58-fad4-4694-9048-915bcfce5820_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!SAEx!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10f4be58-fad4-4694-9048-915bcfce5820_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!SAEx!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10f4be58-fad4-4694-9048-915bcfce5820_2400x1800.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!SAEx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10f4be58-fad4-4694-9048-915bcfce5820_2400x1800.jpeg" width="728" height="409.5" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/10f4be58-fad4-4694-9048-915bcfce5820_2400x1800.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:728,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!SAEx!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10f4be58-fad4-4694-9048-915bcfce5820_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!SAEx!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10f4be58-fad4-4694-9048-915bcfce5820_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!SAEx!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10f4be58-fad4-4694-9048-915bcfce5820_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!SAEx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10f4be58-fad4-4694-9048-915bcfce5820_2400x1800.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>When a neural network <em>learns</em>, what actually changes? Not the architecture &#8212; the shape of the network stays fixed. Not the inputs &#8212; those come from outside. What moves is the <strong>weights</strong>.</p><p>Read &#8594; <a href="https://www.byhand.ai/p/library-models-fine-tuning-weight-update">https://www.byhand.ai/p/library-models-fine-tuning-weight-update</a></p><div><hr></div><h3><a href="https://www.byhand.ai/p/library-models-fine-tuning-weight-fine-tune">2. Pretrain vs Fine-Tune</a></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Xcvg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fced595a9-ecd3-450a-a720-3de05148a2ce_2400x1800.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Xcvg!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fced595a9-ecd3-450a-a720-3de05148a2ce_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Xcvg!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fced595a9-ecd3-450a-a720-3de05148a2ce_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Xcvg!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fced595a9-ecd3-450a-a720-3de05148a2ce_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Xcvg!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fced595a9-ecd3-450a-a720-3de05148a2ce_2400x1800.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Xcvg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fced595a9-ecd3-450a-a720-3de05148a2ce_2400x1800.jpeg" width="728" height="409.5" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ced595a9-ecd3-450a-a720-3de05148a2ce_2400x1800.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:728,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Xcvg!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fced595a9-ecd3-450a-a720-3de05148a2ce_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Xcvg!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fced595a9-ecd3-450a-a720-3de05148a2ce_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Xcvg!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fced595a9-ecd3-450a-a720-3de05148a2ce_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Xcvg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fced595a9-ecd3-450a-a720-3de05148a2ce_2400x1800.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The previous lesson showed W + &#916; W = W1 as a single abstract step. That same step shows up in two very different settings &#8212; and the <em>setting</em> is what separates <strong>pretraining</strong> from <strong>fine-tuning</strong>.</p><p>Read &#8594; <a href="https://www.byhand.ai/p/library-models-fine-tuning-weight-fine-tune">https://www.byhand.ai/p/library-models-fine-tuning-weight-fine-tune</a></p><div><hr></div><h3><a href="https://www.byhand.ai/p/library-models-fine-tuning-full-fine-tuning">3. Full Fine-Tuning</a></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!cUah!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf22c61-9447-4c4e-90ac-6a5c8a9731d0_2400x1800.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!cUah!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf22c61-9447-4c4e-90ac-6a5c8a9731d0_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!cUah!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf22c61-9447-4c4e-90ac-6a5c8a9731d0_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!cUah!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf22c61-9447-4c4e-90ac-6a5c8a9731d0_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!cUah!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf22c61-9447-4c4e-90ac-6a5c8a9731d0_2400x1800.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!cUah!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf22c61-9447-4c4e-90ac-6a5c8a9731d0_2400x1800.jpeg" width="728" height="409.5" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bcf22c61-9447-4c4e-90ac-6a5c8a9731d0_2400x1800.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:728,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!cUah!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf22c61-9447-4c4e-90ac-6a5c8a9731d0_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!cUah!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf22c61-9447-4c4e-90ac-6a5c8a9731d0_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!cUah!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf22c61-9447-4c4e-90ac-6a5c8a9731d0_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!cUah!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf22c61-9447-4c4e-90ac-6a5c8a9731d0_2400x1800.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>In the previous lesson, fine-tuning meant updating one weight matrix. A real network has many &#8212; three layers in this example, billions of parameters in a production model. What does fine-tuning look like when you update <em>all</em> of them?</p><p>Read &#8594; <a href="https://www.byhand.ai/p/library-models-fine-tuning-full-fine-tuning">https://www.byhand.ai/p/library-models-fine-tuning-full-fine-tuning</a></p><div><hr></div><h3><a href="https://www.byhand.ai/p/library-models-fine-tuning-frozen-layers">4. Freezing Layers</a></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!DxsH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58f5269f-35e7-4e04-bb7d-64a71494e968_2400x1800.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!DxsH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58f5269f-35e7-4e04-bb7d-64a71494e968_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!DxsH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58f5269f-35e7-4e04-bb7d-64a71494e968_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!DxsH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58f5269f-35e7-4e04-bb7d-64a71494e968_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!DxsH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58f5269f-35e7-4e04-bb7d-64a71494e968_2400x1800.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!DxsH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58f5269f-35e7-4e04-bb7d-64a71494e968_2400x1800.jpeg" width="728" height="409.5" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/58f5269f-35e7-4e04-bb7d-64a71494e968_2400x1800.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:728,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!DxsH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58f5269f-35e7-4e04-bb7d-64a71494e968_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!DxsH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58f5269f-35e7-4e04-bb7d-64a71494e968_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!DxsH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58f5269f-35e7-4e04-bb7d-64a71494e968_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!DxsH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58f5269f-35e7-4e04-bb7d-64a71494e968_2400x1800.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>In the previous lesson, full fine-tuning reviewed <em>every</em> prerequisite &#8212; Linear Algebra, Probability, Advanced ML &#8212; to refresh each subject with the latest topics. Effective, but exhausting.</p><p>Read &#8594; <a href="https://www.byhand.ai/p/library-models-fine-tuning-frozen-layers">https://www.byhand.ai/p/library-models-fine-tuning-frozen-layers</a></p><div><hr></div><h3><a href="https://www.byhand.ai/p/library-models-fine-tuning-linear-probe">5. Linear Probe</a></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!MjhH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92735086-d849-4373-b33b-66103769adb9_2400x1800.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!MjhH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92735086-d849-4373-b33b-66103769adb9_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!MjhH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92735086-d849-4373-b33b-66103769adb9_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!MjhH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92735086-d849-4373-b33b-66103769adb9_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!MjhH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92735086-d849-4373-b33b-66103769adb9_2400x1800.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!MjhH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92735086-d849-4373-b33b-66103769adb9_2400x1800.jpeg" width="728" height="409.5" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/92735086-d849-4373-b33b-66103769adb9_2400x1800.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:728,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!MjhH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92735086-d849-4373-b33b-66103769adb9_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!MjhH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92735086-d849-4373-b33b-66103769adb9_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!MjhH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92735086-d849-4373-b33b-66103769adb9_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!MjhH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92735086-d849-4373-b33b-66103769adb9_2400x1800.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>In <a href="https://www.byhand.ai/p/library-models-fine-tuning-frozen-layers">Freezing Layers</a>, we kept the foundational prerequisites fixed and refreshed only the advanced course on top &#8212; W3. But even refreshing <em>one</em> course is still a whole course. W3 is a full 40 &#215; 40 matrix &#8212; 1600 weights to update &#8212; weeks of lectures, assignments, and exams to work through. What if we don't re-take any existing course at all, and instead pick up a single new <strong>one-credit certificate</strong> &#8212; the kind you can finish in a month?</p><p>Read &#8594; <a href="https://www.byhand.ai/p/library-models-fine-tuning-linear-probe">https://www.byhand.ai/p/library-models-fine-tuning-linear-probe</a></p><div><hr></div><h3><a href="https://www.byhand.ai/p/library-models-fine-tuning-feature-head">6. Feature Extraction + Head</a></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!d3O3!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5be87eb3-855e-48b4-9601-70dd51f42619_2400x1800.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!d3O3!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5be87eb3-855e-48b4-9601-70dd51f42619_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!d3O3!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5be87eb3-855e-48b4-9601-70dd51f42619_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!d3O3!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5be87eb3-855e-48b4-9601-70dd51f42619_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!d3O3!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5be87eb3-855e-48b4-9601-70dd51f42619_2400x1800.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!d3O3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5be87eb3-855e-48b4-9601-70dd51f42619_2400x1800.jpeg" width="728" height="409.5" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5be87eb3-855e-48b4-9601-70dd51f42619_2400x1800.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:728,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!d3O3!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5be87eb3-855e-48b4-9601-70dd51f42619_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!d3O3!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5be87eb3-855e-48b4-9601-70dd51f42619_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!d3O3!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5be87eb3-855e-48b4-9601-70dd51f42619_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!d3O3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5be87eb3-855e-48b4-9601-70dd51f42619_2400x1800.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>A <strong>feature head</strong> is a small trainable MLP bolted onto a frozen pretrained backbone. Think of it as pursuing a <strong>PhD</strong> on top of a master's degree. The master's &#8212; your pretrained backbone &#8212; stays exactly as it was, with no review. You aren't re-taking Linear Algebra or Probability; you're building something specialized on top of it: the PhD adds its own coursework, its own nonlinearity, and its own thesis layer.</p><p>Read &#8594; <a href="https://www.byhand.ai/p/library-models-fine-tuning-feature-head">https://www.byhand.ai/p/library-models-fine-tuning-feature-head</a></p><div><hr></div><h3><a href="https://www.byhand.ai/p/library-models-fine-tuning-adapter-layers">7. Adapter Layers</a></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!x-T6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ac9a3d9-5e2d-430a-a514-d15466ba9330_2400x1800.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!x-T6!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ac9a3d9-5e2d-430a-a514-d15466ba9330_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!x-T6!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ac9a3d9-5e2d-430a-a514-d15466ba9330_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!x-T6!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ac9a3d9-5e2d-430a-a514-d15466ba9330_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!x-T6!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ac9a3d9-5e2d-430a-a514-d15466ba9330_2400x1800.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!x-T6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ac9a3d9-5e2d-430a-a514-d15466ba9330_2400x1800.jpeg" width="728" height="409.5" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8ac9a3d9-5e2d-430a-a514-d15466ba9330_2400x1800.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:728,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!x-T6!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ac9a3d9-5e2d-430a-a514-d15466ba9330_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!x-T6!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ac9a3d9-5e2d-430a-a514-d15466ba9330_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!x-T6!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ac9a3d9-5e2d-430a-a514-d15466ba9330_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!x-T6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ac9a3d9-5e2d-430a-a514-d15466ba9330_2400x1800.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><a href="https://arxiv.org/abs/1902.00751">Houlsby et al., 2019</a> proposed a different strategy: instead of choosing which layers to freeze, <strong>insert small trainable modules</strong> &#8212; adapters &#8212; between every frozen layer.</p><p>Read &#8594; <a href="https://www.byhand.ai/p/library-models-fine-tuning-adapter-layers">https://www.byhand.ai/p/library-models-fine-tuning-adapter-layers</a></p><div><hr></div><h3><a href="https://www.byhand.ai/p/library-models-fine-tuning-lora-intro">8. LoRA</a></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!jdzg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa067e4a3-5e48-4be9-97b3-d77fec3326eb_2400x1800.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!jdzg!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa067e4a3-5e48-4be9-97b3-d77fec3326eb_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!jdzg!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa067e4a3-5e48-4be9-97b3-d77fec3326eb_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!jdzg!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa067e4a3-5e48-4be9-97b3-d77fec3326eb_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!jdzg!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa067e4a3-5e48-4be9-97b3-d77fec3326eb_2400x1800.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!jdzg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa067e4a3-5e48-4be9-97b3-d77fec3326eb_2400x1800.jpeg" width="728" height="409.5" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a067e4a3-5e48-4be9-97b3-d77fec3326eb_2400x1800.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:728,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!jdzg!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa067e4a3-5e48-4be9-97b3-d77fec3326eb_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!jdzg!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa067e4a3-5e48-4be9-97b3-d77fec3326eb_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!jdzg!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa067e4a3-5e48-4be9-97b3-d77fec3326eb_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!jdzg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa067e4a3-5e48-4be9-97b3-d77fec3326eb_2400x1800.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><a href="https://www.byhand.ai/p/library-models-fine-tuning-adapter-layers">Adapters</a> showed that a low-rank bottleneck can specialize a frozen layer with far fewer parameters than a full &#916;W.</p><p>Read &#8594; <a href="https://www.byhand.ai/p/library-models-fine-tuning-lora-intro">https://www.byhand.ai/p/library-models-fine-tuning-lora-intro</a></p><div><hr></div>]]></content:encoded></item><item><title><![CDATA[LoRA]]></title><description><![CDATA[Fine-Tuning series &#183; 8 of 8]]></description><link>https://www.byhand.ai/p/library-models-fine-tuning-lora-intro</link><guid isPermaLink="false">https://www.byhand.ai/p/library-models-fine-tuning-lora-intro</guid><dc:creator><![CDATA[Prof. Tom Yeh]]></dc:creator><pubDate>Fri, 24 Apr 2026 15:05:45 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!jdzg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa067e4a3-5e48-4be9-97b3-d77fec3326eb_2400x1800.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<blockquote><p>Library  &#8250;  Models  &#8250;  Fine-Tuning  &#8250;  LoRA</p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!jdzg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa067e4a3-5e48-4be9-97b3-d77fec3326eb_2400x1800.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!jdzg!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa067e4a3-5e48-4be9-97b3-d77fec3326eb_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!jdzg!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa067e4a3-5e48-4be9-97b3-d77fec3326eb_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!jdzg!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa067e4a3-5e48-4be9-97b3-d77fec3326eb_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!jdzg!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa067e4a3-5e48-4be9-97b3-d77fec3326eb_2400x1800.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!jdzg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa067e4a3-5e48-4be9-97b3-d77fec3326eb_2400x1800.jpeg" width="728" height="409.5" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a067e4a3-5e48-4be9-97b3-d77fec3326eb_2400x1800.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:728,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;captionedImage&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!jdzg!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa067e4a3-5e48-4be9-97b3-d77fec3326eb_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!jdzg!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa067e4a3-5e48-4be9-97b3-d77fec3326eb_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!jdzg!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa067e4a3-5e48-4be9-97b3-d77fec3326eb_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!jdzg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa067e4a3-5e48-4be9-97b3-d77fec3326eb_2400x1800.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><a href="https://www.byhand.ai/p/library-models-fine-tuning-adapter-layers">Adapters</a> showed that a low-rank bottleneck can specialize a frozen layer with far fewer parameters than a full &#916;W.</p><p><a href="https://arxiv.org/abs/2106.09685">LoRA (Hu et al., 2021)</a> takes the same low-rank idea and makes two changes:</p><p>1. <strong>Drop the nonlinearity.</strong></p><p>"Down-project + ReLU + up-project" collapses into a single low-rank matrix product: &#916; W = B A.</p><p>2. <strong>Add to the weight matrix, not between layers.</strong></p><p>The effective weight is W' = W + &#916; W = W + B A.</p><p>At inference, &#916; W <em>disappears</em> &#8212; meaning it gets folded directly into W.</p><p>Below, the left column shows an adapter (a new module in the forward path). The right column shows LoRA: same low-rank idea, but expressed as a correction to the weight matrix.</p><p>The benefit of dropping the nonlinearity is that B and A can be merged into a single low-rank &#916;W matrix at inference time. During training, B and A live alongside W as extra matrices.</p><p>You compute the weight update in two separate paths so gradients can flow into both matrices during training.</p><p>At inference you merge them once and for all: compute the combined weight, ship it, and discard the individual matrices. The resulting model has the <strong>exact same architecture and inference cost</strong> as the pretrained one. Same shape, same number of matrix multiplications, no extra latency, no extra memory.</p><p>An adapter can't do this. Its down-project&#8211;ReLU&#8211;up-project module stays in the forward path forever, and every inference call pays for those extra operations.</p><p>This is why LoRA is the default for large models today. You can keep thousands of tiny (B, A) pairs &#8212; one per fine-tuning task, a few megabytes each. Swap them in and out of a shared frozen backbone without changing its runtime shape.</p><div><hr></div><p>&#8592; Previous: <a href="https://www.byhand.ai/p/library-models-fine-tuning-adapter-layers">Adapter Layers</a></p><p><strong>Paid subscribers:</strong> open the interactive diagram below &#8595;</p>
      <p>
          <a href="https://www.byhand.ai/p/library-models-fine-tuning-lora-intro">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[Adapter Layers]]></title><description><![CDATA[Fine-Tuning series &#183; 7 of 8]]></description><link>https://www.byhand.ai/p/library-models-fine-tuning-adapter-layers</link><guid isPermaLink="false">https://www.byhand.ai/p/library-models-fine-tuning-adapter-layers</guid><dc:creator><![CDATA[Prof. Tom Yeh]]></dc:creator><pubDate>Fri, 24 Apr 2026 15:05:27 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!x-T6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ac9a3d9-5e2d-430a-a514-d15466ba9330_2400x1800.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<blockquote><p>Library  &#8250;  Models  &#8250;  Fine-Tuning  &#8250;  Adapter Layers</p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!x-T6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ac9a3d9-5e2d-430a-a514-d15466ba9330_2400x1800.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!x-T6!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ac9a3d9-5e2d-430a-a514-d15466ba9330_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!x-T6!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ac9a3d9-5e2d-430a-a514-d15466ba9330_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!x-T6!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ac9a3d9-5e2d-430a-a514-d15466ba9330_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!x-T6!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ac9a3d9-5e2d-430a-a514-d15466ba9330_2400x1800.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!x-T6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ac9a3d9-5e2d-430a-a514-d15466ba9330_2400x1800.jpeg" width="728" height="409.5" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8ac9a3d9-5e2d-430a-a514-d15466ba9330_2400x1800.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:728,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;captionedImage&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!x-T6!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ac9a3d9-5e2d-430a-a514-d15466ba9330_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!x-T6!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ac9a3d9-5e2d-430a-a514-d15466ba9330_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!x-T6!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ac9a3d9-5e2d-430a-a514-d15466ba9330_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!x-T6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ac9a3d9-5e2d-430a-a514-d15466ba9330_2400x1800.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><a href="https://arxiv.org/abs/1902.00751">Houlsby et al., 2019</a> proposed a different strategy: instead of choosing which layers to freeze, <strong>insert small trainable modules</strong> &#8212; adapters &#8212; between every frozen layer.</p><p>Each adapter has a down-projection (d &#8594; r), a nonlinearity, and an up-projection (r &#8594; d). The bottleneck dimension r is much smaller than d, so each adapter adds very few parameters. The pretrained weights stay completely frozen.</p><p>With rank r = 8 and hidden size d = 64, <strong>each adapter</strong> has only 2 &#215; r &#215; d = 1024 trainable parameters.</p><p><strong>Across the two layers</strong> of this network:</p><ul><li><p>The first layer projects from input to hidden, so W&#8321; has d &#215; input = 2048 weights.</p></li><li><p>The second layer has d &#215; d = 4096 weights.</p></li><li><p>Full fine-tuning would update all 6144 of them.</p></li><li><p>Adapter fine-tuning adds only 2 &#215; r &#215; d = 1024 new trainable weights &#8212; about a 3&#215; reduction.</p></li></ul><div><hr></div><p>&#8592; Previous: <a href="https://www.byhand.ai/p/library-models-fine-tuning-feature-head">Feature Extraction + Head</a>  |  <a href="https://www.byhand.ai/p/library-models-fine-tuning-lora-intro">LoRA</a> &#8594;</p><p><strong>Paid subscribers:</strong> open the interactive diagram below &#8595;</p><div class="paywall-jump" data-component-name="PaywallToDOM"></div><p><strong><a href="https://library.byhand.ai/models/fine-tuning/adapter-layers?embed=1&amp;back=https%3A%2F%2Fwww.byhand.ai%2Fp%2Flibrary-models-fine-tuning-adapter-layers">Try the interactive version &#8599;</a></strong></p>]]></content:encoded></item><item><title><![CDATA[Feature Extraction + Head]]></title><description><![CDATA[Fine-Tuning series &#183; 6 of 8]]></description><link>https://www.byhand.ai/p/library-models-fine-tuning-feature-head</link><guid isPermaLink="false">https://www.byhand.ai/p/library-models-fine-tuning-feature-head</guid><dc:creator><![CDATA[Prof. Tom Yeh]]></dc:creator><pubDate>Fri, 24 Apr 2026 15:04:38 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!d3O3!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5be87eb3-855e-48b4-9601-70dd51f42619_2400x1800.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<blockquote><p>Library  &#8250;  Models  &#8250;  Fine-Tuning  &#8250;  Feature Extraction + Head</p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!d3O3!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5be87eb3-855e-48b4-9601-70dd51f42619_2400x1800.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!d3O3!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5be87eb3-855e-48b4-9601-70dd51f42619_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!d3O3!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5be87eb3-855e-48b4-9601-70dd51f42619_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!d3O3!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5be87eb3-855e-48b4-9601-70dd51f42619_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!d3O3!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5be87eb3-855e-48b4-9601-70dd51f42619_2400x1800.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!d3O3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5be87eb3-855e-48b4-9601-70dd51f42619_2400x1800.jpeg" width="728" height="409.5" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5be87eb3-855e-48b4-9601-70dd51f42619_2400x1800.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:728,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;captionedImage&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!d3O3!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5be87eb3-855e-48b4-9601-70dd51f42619_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!d3O3!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5be87eb3-855e-48b4-9601-70dd51f42619_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!d3O3!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5be87eb3-855e-48b4-9601-70dd51f42619_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!d3O3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5be87eb3-855e-48b4-9601-70dd51f42619_2400x1800.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>A <strong>feature head</strong> is a small trainable MLP bolted onto a frozen pretrained backbone. Think of it as pursuing a <strong>PhD</strong> on top of a master's degree. The master's &#8212; your pretrained backbone &#8212; stays exactly as it was, with no review. You aren't re-taking Linear Algebra or Probability; you're building something specialized on top of it: the PhD adds its own coursework, its own nonlinearity, and its own thesis layer.</p><p>This is one step richer than a <a href="https://www.byhand.ai/p/library-models-fine-tuning-linear-probe">linear probe</a>, which bolts on a single linear projection &#8212; like earning one certificate after the master's. Certificates are quick and cheap, but they can only form linear combinations of subjects you already know. A feature head, with multiple trainable layers, can form <em>nonlinear</em> connections and capture task-specific structure the probe can't.</p><p>And it's a tighter budget than <a href="https://www.byhand.ai/p/library-models-fine-tuning-frozen-layers">Freezing Layers</a>, where we still refreshed a top layer's weights directly. Here, every weight in the backbone is permanently frozen &#8212; no &#916;W anywhere. All trainable parameters live in the head.</p><p>In the diagram, the backbone (gray, dashed) extracts features without changing. The head (red border) is the trainable MLP: a nonlinear mapping from those frozen features to task predictions. This is the standard recipe in computer vision &#8212; take a pretrained ResNet or ViT, freeze it, and train a task-specific head on top.</p><p><strong>How much did we save?</strong></p><p>Full fine-tuning would update every weight in both the backbone and the head &#8212; W1 through W5:</p><p>64 &#215; 32 + 2 &#215; 64 &#215; 64 + 20 &#215; 64 + 10 &#215; 20 = 11720</p><p>parameters.</p><p>Freezing the backbone leaves only the head trainable &#8212; W4 and W5:</p><p>20 &#215; 64 + 10 &#215; 20 = 1480</p><p>parameters. That's about 7.92&#215; fewer weights to train, and &#8212; because the backbone is shared &#8212; the same frozen model can support dozens of downstream tasks, each with its own tiny PhD head.</p><p>The next lesson takes the idea in a different direction: instead of bolting a head onto the <em>end</em>, we'll sprinkle small trainable modules <em>throughout</em> the network &#8212; the adapter layers.</p><div><hr></div><p>&#8592; Previous: <a href="https://www.byhand.ai/p/library-models-fine-tuning-linear-probe">Linear Probe</a>  |  <a href="https://www.byhand.ai/p/library-models-fine-tuning-adapter-layers">Adapter Layers</a> &#8594;</p><p><strong>Paid subscribers:</strong> open the interactive diagram below &#8595;</p>
      <p>
          <a href="https://www.byhand.ai/p/library-models-fine-tuning-feature-head">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[Linear Probe]]></title><description><![CDATA[Fine-Tuning series &#183; 5 of 8]]></description><link>https://www.byhand.ai/p/library-models-fine-tuning-linear-probe</link><guid isPermaLink="false">https://www.byhand.ai/p/library-models-fine-tuning-linear-probe</guid><dc:creator><![CDATA[Prof. Tom Yeh]]></dc:creator><pubDate>Fri, 24 Apr 2026 15:04:12 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!MjhH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92735086-d849-4373-b33b-66103769adb9_2400x1800.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<blockquote><p>Library  &#8250;  Models  &#8250;  Fine-Tuning  &#8250;  Linear Probe</p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!MjhH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92735086-d849-4373-b33b-66103769adb9_2400x1800.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!MjhH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92735086-d849-4373-b33b-66103769adb9_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!MjhH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92735086-d849-4373-b33b-66103769adb9_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!MjhH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92735086-d849-4373-b33b-66103769adb9_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!MjhH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92735086-d849-4373-b33b-66103769adb9_2400x1800.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!MjhH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92735086-d849-4373-b33b-66103769adb9_2400x1800.jpeg" width="728" height="409.5" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/92735086-d849-4373-b33b-66103769adb9_2400x1800.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:728,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;captionedImage&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!MjhH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92735086-d849-4373-b33b-66103769adb9_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!MjhH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92735086-d849-4373-b33b-66103769adb9_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!MjhH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92735086-d849-4373-b33b-66103769adb9_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!MjhH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92735086-d849-4373-b33b-66103769adb9_2400x1800.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>In <a href="https://www.byhand.ai/p/library-models-fine-tuning-frozen-layers">Freezing Layers</a>, we kept the foundational prerequisites fixed and refreshed only the advanced course on top &#8212; W3. But even refreshing <em>one</em> course is still a whole course. W3 is a full 40 &#215; 40 matrix &#8212; 1600 weights to update &#8212; weeks of lectures, assignments, and exams to work through. What if we don't re-take any existing course at all, and instead pick up a single new <strong>one-credit certificate</strong> &#8212; the kind you can finish in a month?</p><p>That's a <strong>linear probe</strong>. The master's degree &#8212; your pretrained network &#8212; stays exactly as it was, untouched. On top of it, you add one small linear layer: no homework for the old material, just a single combination of subjects you already know, tuned for the new task.</p><p>That new layer is Wn &#8212; a thin 10 &#215; 40 matrix, just 400 weights. That's 4&#215; fewer trainable parameters than refreshing W3 alone. Everything in W1, W2, and W3 is permanently frozen.</p><p>Why bother with something so tiny? Because it's also a <em>probe</em>: if one thin linear layer can already do the downstream task well, that tells you the pretrained features are rich enough on their own. If it can't, you know you need something bigger &#8212; and the pretrained features still got you most of the way.</p><p>A one-credit certificate is quick and cheap, but it can only form <em>linear</em> combinations of what you already know. The next lesson adds a whole PhD on top &#8212; a multi-layer head that can form nonlinear connections too.</p><div><hr></div><p>&#8592; Previous: <a href="https://www.byhand.ai/p/library-models-fine-tuning-frozen-layers">Freezing Layers</a>  |  <a href="https://www.byhand.ai/p/library-models-fine-tuning-feature-head">Feature Extraction + Head</a> &#8594;</p><p><strong>Paid subscribers:</strong> open the interactive diagram below &#8595;</p>
      <p>
          <a href="https://www.byhand.ai/p/library-models-fine-tuning-linear-probe">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[Freezing Layers]]></title><description><![CDATA[Fine-Tuning series &#183; 4 of 8]]></description><link>https://www.byhand.ai/p/library-models-fine-tuning-frozen-layers</link><guid isPermaLink="false">https://www.byhand.ai/p/library-models-fine-tuning-frozen-layers</guid><dc:creator><![CDATA[Prof. Tom Yeh]]></dc:creator><pubDate>Fri, 24 Apr 2026 15:03:47 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!DxsH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58f5269f-35e7-4e04-bb7d-64a71494e968_2400x1800.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<blockquote><p>Library  &#8250;  Models  &#8250;  Fine-Tuning  &#8250;  Freezing Layers</p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!DxsH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58f5269f-35e7-4e04-bb7d-64a71494e968_2400x1800.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!DxsH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58f5269f-35e7-4e04-bb7d-64a71494e968_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!DxsH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58f5269f-35e7-4e04-bb7d-64a71494e968_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!DxsH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58f5269f-35e7-4e04-bb7d-64a71494e968_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!DxsH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58f5269f-35e7-4e04-bb7d-64a71494e968_2400x1800.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!DxsH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58f5269f-35e7-4e04-bb7d-64a71494e968_2400x1800.jpeg" width="728" height="409.5" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/58f5269f-35e7-4e04-bb7d-64a71494e968_2400x1800.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:728,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;captionedImage&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!DxsH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58f5269f-35e7-4e04-bb7d-64a71494e968_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!DxsH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58f5269f-35e7-4e04-bb7d-64a71494e968_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!DxsH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58f5269f-35e7-4e04-bb7d-64a71494e968_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!DxsH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58f5269f-35e7-4e04-bb7d-64a71494e968_2400x1800.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>In the previous lesson, full fine-tuning reviewed <em>every</em> prerequisite &#8212; Linear Algebra, Probability, Advanced ML &#8212; to refresh each subject with the latest topics. Effective, but exhausting.</p><p>Then you realize something. The prerequisites haven't actually changed that much. Linear Algebra is still Linear Algebra; the matrix decompositions you learned still hold. Probability is still Probability; the distributions and Bayes' rule haven't moved. Almost all the new material &#8212; the new ideas, the recent discoveries &#8212; lives in the <em>advanced</em> layer at the top.</p><p>That's <strong>freezing layers</strong>: keep the prerequisite layers fixed at their pretrained state, and only update the advanced one. In the diagram below, W1 and W2 &#8212; the foundational prerequisites &#8212; stay frozen. Only W3 &#8212; the layer closest to your task-specific output &#8212; gets a &#916;W.</p><p>Its update flows through the equation:</p><p>W'3 = W3 + &#916; W3</p><p>The right column shows the network after the update. W'1 and W'2 are identical to W1 and W2 &#8212; nothing changed there. Only W'3 is new.</p><p><strong>How much did we save?</strong></p><p>Full fine-tuning would train all three layers:</p><p>40 &#215; 30 + 40 &#215; 40 + 40 &#215; 40 = 4400</p><p>parameters.</p><p>Freezing layers 1 and 2 leaves only:</p><p>40 &#215; 40 = 1600</p><p>parameters trainable. The other 2800 parameters are skipped &#8212; no &#916;W, no gradient, no storage.</p><p>For a real-world model with billions of parameters, freezing the first 80% of the network can shrink fine-tuning cost by an order of magnitude.</p><p>There's a second, subtler benefit. The frozen prerequisites can't drift, so the model can't <em>forget</em> its foundations &#8212; Linear Algebra stays Linear Algebra. This problem has a name: <strong>catastrophic forgetting</strong> &#8212; the new task overwrites old knowledge. Frozen layers prevent it simply by not letting those weights move.</p><p>The next lesson takes the idea even further: freeze <em>all</em> the prerequisites and bolt a brand-new advanced course on at the end.</p><div><hr></div><p>&#8592; Previous: <a href="https://www.byhand.ai/p/library-models-fine-tuning-full-fine-tuning">Full Fine-Tuning</a>  |  <a href="https://www.byhand.ai/p/library-models-fine-tuning-linear-probe">Linear Probe</a> &#8594;</p><p><strong>Paid subscribers:</strong> open the interactive diagram below &#8595;</p>
      <p>
          <a href="https://www.byhand.ai/p/library-models-fine-tuning-frozen-layers">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[Full Fine-Tuning]]></title><description><![CDATA[Fine-Tuning series &#183; 3 of 8]]></description><link>https://www.byhand.ai/p/library-models-fine-tuning-full-fine-tuning</link><guid isPermaLink="false">https://www.byhand.ai/p/library-models-fine-tuning-full-fine-tuning</guid><dc:creator><![CDATA[Prof. Tom Yeh]]></dc:creator><pubDate>Fri, 24 Apr 2026 15:02:49 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!cUah!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf22c61-9447-4c4e-90ac-6a5c8a9731d0_2400x1800.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<blockquote><p>Library  &#8250;  Models  &#8250;  Fine-Tuning  &#8250;  Full Fine-Tuning</p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!cUah!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf22c61-9447-4c4e-90ac-6a5c8a9731d0_2400x1800.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!cUah!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf22c61-9447-4c4e-90ac-6a5c8a9731d0_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!cUah!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf22c61-9447-4c4e-90ac-6a5c8a9731d0_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!cUah!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf22c61-9447-4c4e-90ac-6a5c8a9731d0_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!cUah!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf22c61-9447-4c4e-90ac-6a5c8a9731d0_2400x1800.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!cUah!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf22c61-9447-4c4e-90ac-6a5c8a9731d0_2400x1800.jpeg" width="728" height="409.5" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bcf22c61-9447-4c4e-90ac-6a5c8a9731d0_2400x1800.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:728,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;captionedImage&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!cUah!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf22c61-9447-4c4e-90ac-6a5c8a9731d0_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!cUah!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf22c61-9447-4c4e-90ac-6a5c8a9731d0_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!cUah!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf22c61-9447-4c4e-90ac-6a5c8a9731d0_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!cUah!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf22c61-9447-4c4e-90ac-6a5c8a9731d0_2400x1800.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>In the previous lesson, fine-tuning meant updating one weight matrix. A real network has many &#8212; three layers in this example, billions of parameters in a production model. What does fine-tuning look like when you update <em>all</em> of them?</p><p>That's <strong>full fine-tuning</strong>: continue training every weight in the pretrained network on your new task. Every layer's W gets its own &#916;W. Nothing is frozen &#8212; every parameter is in play.</p><p>Think of an MLP as a chain of prerequisites leading to an advanced course. Layer 1 might be <em>Linear Algebra</em>, layer 2 <em>Probability</em>, layer 3 <em>Advanced Machine Learning</em> &#8212; each one building on what came before.</p><p>Fine-tuning is what happens during graduate study: the foundations are already there from undergrad, so you're not re-learning. <strong>Full fine-tuning</strong> is reviewing <em>every</em> prerequisite to see what new topics have appeared and what discoveries the field has made since the last time you sat through them. Effective &#8212; but exhausting.</p><p>This diagram shows the same three-layer MLP twice, side by side.</p><p>On the <strong>left</strong>, the pretrained network runs on input X: three weight matrices W&#8321;, W&#8322;, W&#8323;, each followed by a ReLU activation.</p><p>In the <strong>middle</strong>, one update equation per layer:</p><p>W'i = Wi + &#916; Wi</p><p>On the <strong>right</strong>, the same network after the update &#8212; same shape, same flow, but every weight is now its updated version W'i. The arrows trace each Wi flowing down into its update, and each W'i flowing back up into its layer on the right.</p><p>Full fine-tuning gives the model the most freedom to specialize. Every parameter can move &#8212; and every parameter that can move must be stored.</p><p>In this small example, the trainable count is 40 &#215; 30 + 40 &#215; 40 + 40 &#215; 40 = 4400 values. For a production model with billions of parameters, fine-tuning for one task means saving a billion-parameter &#916;W set. Ten downstream tasks means ten complete model copies.</p><p>But not every prerequisite needs revisiting. The further you go back in the chain, the less the material has changed since pretraining &#8212; the linear-algebra basics under your computer-vision course are largely the same as they ever were. The next lesson does exactly that: freeze the prerequisites that haven't moved, and only refresh the advanced one closest to your specialization.</p><div><hr></div><p>&#8592; Previous: <a href="https://www.byhand.ai/p/library-models-fine-tuning-weight-fine-tune">Pretrain vs Fine-Tune</a>  |  <a href="https://www.byhand.ai/p/library-models-fine-tuning-frozen-layers">Freezing Layers</a> &#8594;</p><p><strong>Paid subscribers:</strong> open the interactive diagram below &#8595;</p><p><strong><a href="https://library.byhand.ai/models/fine-tuning/full-fine-tuning?embed=1&amp;back=https%3A%2F%2Fwww.byhand.ai%2Fp%2Flibrary-models-fine-tuning-full-fine-tuning">Try the interactive version &#8599;</a></strong></p>]]></content:encoded></item><item><title><![CDATA[Pretrain vs Fine-Tune]]></title><description><![CDATA[Fine-Tuning series &#183; 2 of 8]]></description><link>https://www.byhand.ai/p/library-models-fine-tuning-weight-fine-tune</link><guid isPermaLink="false">https://www.byhand.ai/p/library-models-fine-tuning-weight-fine-tune</guid><dc:creator><![CDATA[Prof. Tom Yeh]]></dc:creator><pubDate>Fri, 24 Apr 2026 15:02:12 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!Xcvg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fced595a9-ecd3-450a-a720-3de05148a2ce_2400x1800.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<blockquote><p>Library  &#8250;  Models  &#8250;  Fine-Tuning  &#8250;  Pretrain vs Fine-Tune</p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Xcvg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fced595a9-ecd3-450a-a720-3de05148a2ce_2400x1800.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Xcvg!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fced595a9-ecd3-450a-a720-3de05148a2ce_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Xcvg!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fced595a9-ecd3-450a-a720-3de05148a2ce_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Xcvg!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fced595a9-ecd3-450a-a720-3de05148a2ce_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Xcvg!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fced595a9-ecd3-450a-a720-3de05148a2ce_2400x1800.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Xcvg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fced595a9-ecd3-450a-a720-3de05148a2ce_2400x1800.jpeg" width="728" height="409.5" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ced595a9-ecd3-450a-a720-3de05148a2ce_2400x1800.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:728,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;captionedImage&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Xcvg!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fced595a9-ecd3-450a-a720-3de05148a2ce_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Xcvg!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fced595a9-ecd3-450a-a720-3de05148a2ce_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Xcvg!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fced595a9-ecd3-450a-a720-3de05148a2ce_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Xcvg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fced595a9-ecd3-450a-a720-3de05148a2ce_2400x1800.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The previous lesson showed W + &#916; W = W1 as a single abstract step. That same step shows up in two very different settings &#8212; and the <em>setting</em> is what separates <strong>pretraining</strong> from <strong>fine-tuning</strong>.</p><p>In <strong>pretraining</strong>, the dataset D is massive and general-purpose &#8212; say 200 examples per batch, multiplied across billions of steps. Each &#916;W is tiny, but the cumulative effect is a weight matrix that has absorbed broad patterns of the data: W is now the model's foundational knowledge.</p><p>In <strong>fine-tuning</strong>, the dataset is much smaller &#8212; maybe just 12 task-specific examples. You don't want to throw out everything W has learned, and 12 examples is nowhere near enough to relearn it from scratch anyway. You want to add a small specialization on top.</p><p>I once told my master's students, jokingly, <em>"You are all fine-tuning here."</em> You don't go back to re-learn calculus &#8212; you build on the undergraduate foundation and specialize. Fine-tuning does the same for a model: keep the foundational W, and add a small correction on top.</p><p>Concretely, freeze W and run task data X through it:</p><p>F = W &#215; X</p><p>The output F is reasonable but generic &#8212; W has never seen your task, so the prediction is whatever a general-purpose model would say. To specialize, we add an adjustment &#8212; a new matrix <strong>&#916;W</strong>, same shape as W (40 &#215; 40), trained <em>only</em> on the task data while W stays put:</p><p>W1 = W + &#916; W</p><p>Now the forward pass uses the fine-tuned weight:</p><p>F1 = W1 &#215; X = (W + &#916; W) &#215; X</p><p>&#916;W is the entire artifact of fine-tuning &#8212; the per-task adjustment learned from 12 examples, layered on top of months or years of pretraining.</p><div><hr></div><p>&#8592; Previous: <a href="https://www.byhand.ai/p/library-models-fine-tuning-weight-update">Weight Update</a>  |  <a href="https://www.byhand.ai/p/library-models-fine-tuning-full-fine-tuning">Full Fine-Tuning</a> &#8594;</p><p><strong><a href="https://library.byhand.ai/models/fine-tuning/weight-fine-tune?embed=1&amp;back=https%3A%2F%2Fwww.byhand.ai%2Fp%2Flibrary-models-fine-tuning-weight-fine-tune">Try the interactive version &#8599;</a></strong></p>]]></content:encoded></item><item><title><![CDATA[Weight Update]]></title><description><![CDATA[Fine-Tuning series &#183; 1 of 8]]></description><link>https://www.byhand.ai/p/library-models-fine-tuning-weight-update</link><guid isPermaLink="false">https://www.byhand.ai/p/library-models-fine-tuning-weight-update</guid><dc:creator><![CDATA[Prof. Tom Yeh]]></dc:creator><pubDate>Fri, 24 Apr 2026 15:01:13 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!SAEx!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10f4be58-fad4-4694-9048-915bcfce5820_2400x1800.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<blockquote><p>Library  &#8250;  Models  &#8250;  Fine-Tuning  &#8250;  Weight Update</p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!SAEx!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10f4be58-fad4-4694-9048-915bcfce5820_2400x1800.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!SAEx!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10f4be58-fad4-4694-9048-915bcfce5820_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!SAEx!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10f4be58-fad4-4694-9048-915bcfce5820_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!SAEx!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10f4be58-fad4-4694-9048-915bcfce5820_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!SAEx!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10f4be58-fad4-4694-9048-915bcfce5820_2400x1800.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!SAEx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10f4be58-fad4-4694-9048-915bcfce5820_2400x1800.jpeg" width="728" height="409.5" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/10f4be58-fad4-4694-9048-915bcfce5820_2400x1800.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:728,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;captionedImage&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!SAEx!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10f4be58-fad4-4694-9048-915bcfce5820_2400x1800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!SAEx!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10f4be58-fad4-4694-9048-915bcfce5820_2400x1800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!SAEx!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10f4be58-fad4-4694-9048-915bcfce5820_2400x1800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!SAEx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10f4be58-fad4-4694-9048-915bcfce5820_2400x1800.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>When a neural network <em>learns</em>, what actually changes? Not the architecture &#8212; the shape of the network stays fixed. Not the inputs &#8212; those come from outside. What moves is the <strong>weights</strong>.</p><p>A training step takes the current weight matrix W0, computes a small nudge &#916; W from a batch of examples, and adds the two together to produce the new weights:</p><p>W1 = W0 + &#916; W</p><p>This single equation &#8212; <em>add a correction to the existing weights</em> &#8212; underlies everything in this chapter. Pretraining, full fine-tuning, frozen layers, adapters, LoRA: they're all variations on who gets a &#916;W and how big that &#916;W is allowed to be.</p><p>The diagram below shows it for one layer. Before the step, the forward pass uses the old weight:</p><p>F0 = W0 &#215; X</p><p>After the step, the <em>same</em> input flows through the <em>new</em> weight:</p><p>F1 = W1 &#215; X</p><p>&#916;W always has the same shape as W0 &#8212; for a 40 &#215; 30 weight matrix, &#916;W also has 1200 values, and every one of them is a free parameter learned by gradient descent.</p><p>Repeat this step billions of times across a massive general-purpose corpus, and you get a <strong>pretrained</strong> model: weights that encode broad patterns of language, code, and reasoning.</p><p>The same W + &#916; W step can serve very different goals. When the dataset is huge and the goal is to absorb general patterns, we call it <strong>pretraining</strong>. When the dataset is small and the goal is to specialize an already-pretrained model, we call it <strong>fine-tuning</strong>. The next lesson puts those two side by side.</p><div><hr></div><p><a href="https://www.byhand.ai/p/library-models-fine-tuning-weight-fine-tune">Pretrain vs Fine-Tune</a> &#8594;</p><p></p><p><strong><a href="https://library.byhand.ai/models/fine-tuning/weight-update?embed=1&amp;back=https%3A%2F%2Fwww.byhand.ai%2Fp%2Flibrary-models-fine-tuning-weight-update">Try the interactive version &#8599;</a></strong></p>]]></content:encoded></item><item><title><![CDATA[The Library is Live! ~ check your access]]></title><description><![CDATA[The AI by Hand &#9997;&#65039; Library is live!]]></description><link>https://www.byhand.ai/p/the-library-is-live-check-your-access</link><guid isPermaLink="false">https://www.byhand.ai/p/the-library-is-live-check-your-access</guid><dc:creator><![CDATA[Prof. Tom Yeh]]></dc:creator><pubDate>Mon, 13 Apr 2026 19:07:01 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!03P9!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2673f5d3-3e7d-4722-9959-61dc9948e1a5_2056x1692.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><strong>The AI by Hand &#9997;&#65039; Library is live!</strong></p><p>As a paid member of the AI by Hand Academy, you now have full access to the Library&#8217;s interactive models, exercises, and calculations that bring AI concepts to life.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!03P9!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2673f5d3-3e7d-4722-9959-61dc9948e1a5_2056x1692.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!03P9!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2673f5d3-3e7d-4722-9959-61dc9948e1a5_2056x1692.png 424w, https://substackcdn.com/image/fetch/$s_!03P9!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2673f5d3-3e7d-4722-9959-61dc9948e1a5_2056x1692.png 848w, https://substackcdn.com/image/fetch/$s_!03P9!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2673f5d3-3e7d-4722-9959-61dc9948e1a5_2056x1692.png 1272w, https://substackcdn.com/image/fetch/$s_!03P9!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2673f5d3-3e7d-4722-9959-61dc9948e1a5_2056x1692.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!03P9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2673f5d3-3e7d-4722-9959-61dc9948e1a5_2056x1692.png" width="1456" height="1198" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2673f5d3-3e7d-4722-9959-61dc9948e1a5_2056x1692.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1198,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:450537,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/194107491?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2673f5d3-3e7d-4722-9959-61dc9948e1a5_2056x1692.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!03P9!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2673f5d3-3e7d-4722-9959-61dc9948e1a5_2056x1692.png 424w, https://substackcdn.com/image/fetch/$s_!03P9!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2673f5d3-3e7d-4722-9959-61dc9948e1a5_2056x1692.png 848w, https://substackcdn.com/image/fetch/$s_!03P9!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2673f5d3-3e7d-4722-9959-61dc9948e1a5_2056x1692.png 1272w, https://substackcdn.com/image/fetch/$s_!03P9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2673f5d3-3e7d-4722-9959-61dc9948e1a5_2056x1692.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>&#8230;</p>
      <p>
          <a href="https://www.byhand.ai/p/the-library-is-live-check-your-access">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[Two New Interactive Workbooks ~ Matmul & Linear Layer]]></title><description><![CDATA[This week, I continued building the interactive version of my popular Deep Learning Math Workbook.]]></description><link>https://www.byhand.ai/p/two-new-interactive-workbooks-matmul</link><guid isPermaLink="false">https://www.byhand.ai/p/two-new-interactive-workbooks-matmul</guid><dc:creator><![CDATA[Prof. Tom Yeh]]></dc:creator><pubDate>Fri, 10 Apr 2026 23:25:41 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!cMfl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff87aa899-a2aa-4d5d-9f88-f854337b4461_1890x1372.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>This week, I continued building the interactive version of my popular <a href="https://www.amazon.com/Deep-Learning-Math-Workbook-mathematical/dp/1806674777">Deep Learning Math Workbook</a>. </p><p>I just completed two new workbooks: Matrix Multiplication and Linear Layer, each has 25 exercises. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!cMfl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff87aa899-a2aa-4d5d-9f88-f854337b4461_1890x1372.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!cMfl!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff87aa899-a2aa-4d5d-9f88-f854337b4461_1890x1372.png 424w, https://substackcdn.com/image/fetch/$s_!cMfl!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff87aa899-a2aa-4d5d-9f88-f854337b4461_1890x1372.png 848w, https://substackcdn.com/image/fetch/$s_!cMfl!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff87aa899-a2aa-4d5d-9f88-f854337b4461_1890x1372.png 1272w, https://substackcdn.com/image/fetch/$s_!cMfl!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff87aa899-a2aa-4d5d-9f88-f854337b4461_1890x1372.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!cMfl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff87aa899-a2aa-4d5d-9f88-f854337b4461_1890x1372.png" width="1456" height="1057" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f87aa899-a2aa-4d5d-9f88-f854337b4461_1890x1372.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1057,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:133933,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/193844429?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff87aa899-a2aa-4d5d-9f88-f854337b4461_1890x1372.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!cMfl!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff87aa899-a2aa-4d5d-9f88-f854337b4461_1890x1372.png 424w, https://substackcdn.com/image/fetch/$s_!cMfl!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff87aa899-a2aa-4d5d-9f88-f854337b4461_1890x1372.png 848w, https://substackcdn.com/image/fetch/$s_!cMfl!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff87aa899-a2aa-4d5d-9f88-f854337b4461_1890x1372.png 1272w, https://substackcdn.com/image/fetch/$s_!cMfl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff87aa899-a2aa-4d5d-9f88-f854337b4461_1890x1372.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>Linear Layer</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!yYqK!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0849866e-2b74-4e9a-af65-bb4bc23882f3_1872x1362.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!yYqK!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0849866e-2b74-4e9a-af65-bb4bc23882f3_1872x1362.png 424w, https://substackcdn.com/image/fetch/$s_!yYqK!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0849866e-2b74-4e9a-af65-bb4bc23882f3_1872x1362.png 848w, https://substackcdn.com/image/fetch/$s_!yYqK!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0849866e-2b74-4e9a-af65-bb4bc23882f3_1872x1362.png 1272w, https://substackcdn.com/image/fetch/$s_!yYqK!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0849866e-2b74-4e9a-af65-bb4bc23882f3_1872x1362.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!yYqK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0849866e-2b74-4e9a-af65-bb4bc23882f3_1872x1362.png" width="1456" height="1059" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0849866e-2b74-4e9a-af65-bb4bc23882f3_1872x1362.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1059,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:200883,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/193844429?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0849866e-2b74-4e9a-af65-bb4bc23882f3_1872x1362.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!yYqK!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0849866e-2b74-4e9a-af65-bb4bc23882f3_1872x1362.png 424w, https://substackcdn.com/image/fetch/$s_!yYqK!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0849866e-2b74-4e9a-af65-bb4bc23882f3_1872x1362.png 848w, https://substackcdn.com/image/fetch/$s_!yYqK!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0849866e-2b74-4e9a-af65-bb4bc23882f3_1872x1362.png 1272w, https://substackcdn.com/image/fetch/$s_!yYqK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0849866e-2b74-4e9a-af65-bb4bc23882f3_1872x1362.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>&#128073; <a href="https://aibyhand-library-beta.pages.dev/workbooks">Check out these two workbooks</a></p><p>(Available as a limited-time free preview for newsletter subscribers.)</p><p>~ Prof. Tom Yeh</p>]]></content:encoded></item><item><title><![CDATA[My Deep Learning Math Workbook — Now Interactive]]></title><description><![CDATA[Happy to share something new: an interactive edition of my popular Deep Learning Math Workbook.]]></description><link>https://www.byhand.ai/p/my-deep-learning-math-workbook-now</link><guid isPermaLink="false">https://www.byhand.ai/p/my-deep-learning-math-workbook-now</guid><dc:creator><![CDATA[Prof. Tom Yeh]]></dc:creator><pubDate>Thu, 02 Apr 2026 19:14:26 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!SIUD!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86d14166-7b3b-495d-bae3-bc051e298448_1880x1376.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Happy to share something new: an interactive edition of my popular <a href="https://www.amazon.com/Deep-Learning-Math-Workbook-mathematical/dp/1806674777">Deep Learning Math Workbook</a>.</p><p>This is a proof-of-concept prototype with the first 15 exercises of Chapter 1: Dot Product. You fill in the blanks, pick from multiple choices, and check your answers &#8212; all in your browser.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!SIUD!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86d14166-7b3b-495d-bae3-bc051e298448_1880x1376.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!SIUD!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86d14166-7b3b-495d-bae3-bc051e298448_1880x1376.png 424w, https://substackcdn.com/image/fetch/$s_!SIUD!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86d14166-7b3b-495d-bae3-bc051e298448_1880x1376.png 848w, https://substackcdn.com/image/fetch/$s_!SIUD!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86d14166-7b3b-495d-bae3-bc051e298448_1880x1376.png 1272w, https://substackcdn.com/image/fetch/$s_!SIUD!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86d14166-7b3b-495d-bae3-bc051e298448_1880x1376.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!SIUD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86d14166-7b3b-495d-bae3-bc051e298448_1880x1376.png" width="1456" height="1066" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/86d14166-7b3b-495d-bae3-bc051e298448_1880x1376.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1066,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:110106,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/192984949?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86d14166-7b3b-495d-bae3-bc051e298448_1880x1376.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!SIUD!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86d14166-7b3b-495d-bae3-bc051e298448_1880x1376.png 424w, https://substackcdn.com/image/fetch/$s_!SIUD!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86d14166-7b3b-495d-bae3-bc051e298448_1880x1376.png 848w, https://substackcdn.com/image/fetch/$s_!SIUD!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86d14166-7b3b-495d-bae3-bc051e298448_1880x1376.png 1272w, https://substackcdn.com/image/fetch/$s_!SIUD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86d14166-7b3b-495d-bae3-bc051e298448_1880x1376.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>&#128073; <a href="https://aibyhand-library-beta.pages.dev/workbooks/dot-product/1">Give it a try</a></p><p>Let me know what you think.</p><p>I still believe pen and paper is the best way to learn math. There&#8217;s something about the focus and concentration that a blank page demands. But many of you have asked for an interactive option, and I think there&#8217;s real value in being able to test yourself and get immediate feedback.</p><p>~ Prof. Tom Yeh</p>]]></content:encoded></item><item><title><![CDATA[AI by Hand Library ~ Attention, MHA, MQA, GQA]]></title><description><![CDATA[I&#8217;m building a library of interactive flow diagrams for the paid members of the AI by Hand Academy.]]></description><link>https://www.byhand.ai/p/ai-by-hand-library-attention-mha</link><guid isPermaLink="false">https://www.byhand.ai/p/ai-by-hand-library-attention-mha</guid><dc:creator><![CDATA[Prof. Tom Yeh]]></dc:creator><pubDate>Tue, 31 Mar 2026 15:23:53 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!z3Tj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F712c20d3-8f75-413c-8304-e2aa1173e620_3034x1676.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>I&#8217;m building a library of interactive flow diagrams for the paid members of the AI by Hand Academy. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!z3Tj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F712c20d3-8f75-413c-8304-e2aa1173e620_3034x1676.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!z3Tj!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F712c20d3-8f75-413c-8304-e2aa1173e620_3034x1676.png 424w, https://substackcdn.com/image/fetch/$s_!z3Tj!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F712c20d3-8f75-413c-8304-e2aa1173e620_3034x1676.png 848w, https://substackcdn.com/image/fetch/$s_!z3Tj!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F712c20d3-8f75-413c-8304-e2aa1173e620_3034x1676.png 1272w, https://substackcdn.com/image/fetch/$s_!z3Tj!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F712c20d3-8f75-413c-8304-e2aa1173e620_3034x1676.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!z3Tj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F712c20d3-8f75-413c-8304-e2aa1173e620_3034x1676.png" width="1456" height="804" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/712c20d3-8f75-413c-8304-e2aa1173e620_3034x1676.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:804,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:430912,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/192739256?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F712c20d3-8f75-413c-8304-e2aa1173e620_3034x1676.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!z3Tj!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F712c20d3-8f75-413c-8304-e2aa1173e620_3034x1676.png 424w, https://substackcdn.com/image/fetch/$s_!z3Tj!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F712c20d3-8f75-413c-8304-e2aa1173e620_3034x1676.png 848w, https://substackcdn.com/image/fetch/$s_!z3Tj!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F712c20d3-8f75-413c-8304-e2aa1173e620_3034x1676.png 1272w, https://substackcdn.com/image/fetch/$s_!z3Tj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F712c20d3-8f75-413c-8304-e2aa1173e620_3034x1676.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The first collection covers <strong>attention </strong>with seven diagrams in one learning path:</p><ol><li><p><strong>QKV Projection</strong>: wher&#8230;</p></li></ol>
      <p>
          <a href="https://www.byhand.ai/p/ai-by-hand-library-attention-mha">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[Self Attention Flow ~ New Release!]]></title><description><![CDATA[Building upon Matmul Flow I released two weeks ago, I&#8217;d like to share with you Self-Attention Flow &#8212; my attempt to turn the self-attention diagram I&#8217;ve drawn by hand &#9997;&#65039;over a hundred times in class into an interactive tool you can explore yourself.]]></description><link>https://www.byhand.ai/p/self-attention-flow-new-release</link><guid isPermaLink="false">https://www.byhand.ai/p/self-attention-flow-new-release</guid><dc:creator><![CDATA[Prof. Tom Yeh]]></dc:creator><pubDate>Sat, 28 Mar 2026 00:08:27 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!GNIU!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d026fe9-4fc0-48cb-9979-5bbf66350fff_1200x800.gif" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Building upon <a href="https://matmul-flow.pages.dev/">Matmul Flow</a> I released two weeks ago, I&#8217;d like to share with you <a href="https://flow.byhand.ai/self-attention">Self-Attention Flow</a> &#8212; my attempt to turn the self-attention diagram I&#8217;ve drawn by hand &#9997;&#65039;over a hundred times in class into an interactive tool you can explore yourself.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ESVC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d872719-d8d1-4ca9-bcd6-23bc5a40be75_1686x1312.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ESVC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d872719-d8d1-4ca9-bcd6-23bc5a40be75_1686x1312.png 424w, https://substackcdn.com/image/fetch/$s_!ESVC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d872719-d8d1-4ca9-bcd6-23bc5a40be75_1686x1312.png 848w, https://substackcdn.com/image/fetch/$s_!ESVC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d872719-d8d1-4ca9-bcd6-23bc5a40be75_1686x1312.png 1272w, https://substackcdn.com/image/fetch/$s_!ESVC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d872719-d8d1-4ca9-bcd6-23bc5a40be75_1686x1312.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ESVC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d872719-d8d1-4ca9-bcd6-23bc5a40be75_1686x1312.png" width="1456" height="1133" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9d872719-d8d1-4ca9-bcd6-23bc5a40be75_1686x1312.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1133,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:150833,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/192359428?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d872719-d8d1-4ca9-bcd6-23bc5a40be75_1686x1312.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!ESVC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d872719-d8d1-4ca9-bcd6-23bc5a40be75_1686x1312.png 424w, https://substackcdn.com/image/fetch/$s_!ESVC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d872719-d8d1-4ca9-bcd6-23bc5a40be75_1686x1312.png 848w, https://substackcdn.com/image/fetch/$s_!ESVC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d872719-d8d1-4ca9-bcd6-23bc5a40be75_1686x1312.png 1272w, https://substackcdn.com/image/fetch/$s_!ESVC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d872719-d8d1-4ca9-bcd6-23bc5a40be75_1686x1312.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Self-attention is the core mechanism behind Transformers, yet it&#8217;s almost always explained through equations and code. What gets lost is the spatial relationship between matrices &#8212; how dimensions propagate, why certain shapes must match. You can&#8217;t see that Q and K must be the same size but V doesn&#8217;t have to be, or that the score matrix S  and attention matrix A are always square, just by reading S = softmax(K&#7488; Q / &#8730;d).</p><p>Think you already know self-attention inside and out and don&#8217;t need this tool? Here are three questions to test your knowledge:</p><ol><li><p>Does increasing the <strong>model size</strong> (X&#8217;s height) change the shape of the score matrix S?</p></li><li><p>Does increasing the <strong>sequence length</strong> (X&#8217;s width) change the size of Wq, Wk, or Wv?</p></li><li><p>Does increasing the <strong>key dimension</strong> (K&#8217;s height) change the size of the attention matrix S = QK&#7488;?</p></li></ol><p>Use the sliders to change these values and observe what happens before you read the answer key below.</p><p></p><p></p><p></p><p>[space intentionally left blank so you don&#8217;t read the answers right away]</p><p></p><p></p><p></p><p></p><p></p><h2>Answers:</h2><h3>Model Size</h3><p>Q: Does increasing the model size (X&#8217;s height) change the shape of the score matrix S?</p><p> A: No. Drag the Model Size slider and watch: X gets taller, the weight matrices Wq, Wk, Wv grow wider &#8212; but S stays exactly the same size. The score matrix S is always sequence length &#215; sequence length, regardless of model size. Model size controls how many features each token carries, but the attention pattern &#8212; which tokens attend to which &#8212; is purely determined by sequence length and key dimension. You can see nothing in the right side is changing.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!tcIB!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe03013d-15dd-4068-a4c6-4c2bef55c9d5_1200x800.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!tcIB!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe03013d-15dd-4068-a4c6-4c2bef55c9d5_1200x800.gif 424w, https://substackcdn.com/image/fetch/$s_!tcIB!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe03013d-15dd-4068-a4c6-4c2bef55c9d5_1200x800.gif 848w, https://substackcdn.com/image/fetch/$s_!tcIB!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe03013d-15dd-4068-a4c6-4c2bef55c9d5_1200x800.gif 1272w, https://substackcdn.com/image/fetch/$s_!tcIB!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe03013d-15dd-4068-a4c6-4c2bef55c9d5_1200x800.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!tcIB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe03013d-15dd-4068-a4c6-4c2bef55c9d5_1200x800.gif" width="1200" height="800" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/be03013d-15dd-4068-a4c6-4c2bef55c9d5_1200x800.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:800,&quot;width&quot;:1200,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:158148,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/gif&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/192359428?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe03013d-15dd-4068-a4c6-4c2bef55c9d5_1200x800.gif&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!tcIB!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe03013d-15dd-4068-a4c6-4c2bef55c9d5_1200x800.gif 424w, https://substackcdn.com/image/fetch/$s_!tcIB!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe03013d-15dd-4068-a4c6-4c2bef55c9d5_1200x800.gif 848w, https://substackcdn.com/image/fetch/$s_!tcIB!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe03013d-15dd-4068-a4c6-4c2bef55c9d5_1200x800.gif 1272w, https://substackcdn.com/image/fetch/$s_!tcIB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe03013d-15dd-4068-a4c6-4c2bef55c9d5_1200x800.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3>Sequence length</h3><p>Q:  Does increasing the sequence length (X&#8217;s width) change the size of Wq, Wk, or Wv?</p><p>A: No. Increasing sequence length does not change any weight matrix. The number of trainable parameters stays the same. That&#8217;s why when you hear a model increases its context length from 5K to 5M, the &#8220;size&#8221; of the model does not change. However, the time to compute the attention matrices (S and A) grows quadratically with sequence length.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!GNIU!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d026fe9-4fc0-48cb-9979-5bbf66350fff_1200x800.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!GNIU!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d026fe9-4fc0-48cb-9979-5bbf66350fff_1200x800.gif 424w, https://substackcdn.com/image/fetch/$s_!GNIU!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d026fe9-4fc0-48cb-9979-5bbf66350fff_1200x800.gif 848w, https://substackcdn.com/image/fetch/$s_!GNIU!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d026fe9-4fc0-48cb-9979-5bbf66350fff_1200x800.gif 1272w, https://substackcdn.com/image/fetch/$s_!GNIU!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d026fe9-4fc0-48cb-9979-5bbf66350fff_1200x800.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!GNIU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d026fe9-4fc0-48cb-9979-5bbf66350fff_1200x800.gif" width="1200" height="800" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5d026fe9-4fc0-48cb-9979-5bbf66350fff_1200x800.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:800,&quot;width&quot;:1200,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:162056,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/gif&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/192359428?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d026fe9-4fc0-48cb-9979-5bbf66350fff_1200x800.gif&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!GNIU!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d026fe9-4fc0-48cb-9979-5bbf66350fff_1200x800.gif 424w, https://substackcdn.com/image/fetch/$s_!GNIU!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d026fe9-4fc0-48cb-9979-5bbf66350fff_1200x800.gif 848w, https://substackcdn.com/image/fetch/$s_!GNIU!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d026fe9-4fc0-48cb-9979-5bbf66350fff_1200x800.gif 1272w, https://substackcdn.com/image/fetch/$s_!GNIU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d026fe9-4fc0-48cb-9979-5bbf66350fff_1200x800.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3>Key Dimension</h3><p>Q: Does increasing the key dimension (K&#8217;s height) change the size of the attention matrix S = QK&#7488;?</p><p>A:<strong> </strong>No. As you increase the key dimension, query and key vectors grow longer, but the attention score matrix (S) and attention weight matrix (A) do not change size. The attention matrix is always sequence length x sequence length regardless of key dimension. However, each value in it &#8212; a dot product &#8212; takes more to compute, because the key and query vectors are longer.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!awyx!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31c94e0b-3049-4ab5-9ac5-a4da13caeacb_1200x800.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!awyx!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31c94e0b-3049-4ab5-9ac5-a4da13caeacb_1200x800.gif 424w, https://substackcdn.com/image/fetch/$s_!awyx!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31c94e0b-3049-4ab5-9ac5-a4da13caeacb_1200x800.gif 848w, https://substackcdn.com/image/fetch/$s_!awyx!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31c94e0b-3049-4ab5-9ac5-a4da13caeacb_1200x800.gif 1272w, https://substackcdn.com/image/fetch/$s_!awyx!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31c94e0b-3049-4ab5-9ac5-a4da13caeacb_1200x800.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!awyx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31c94e0b-3049-4ab5-9ac5-a4da13caeacb_1200x800.gif" width="1200" height="800" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/31c94e0b-3049-4ab5-9ac5-a4da13caeacb_1200x800.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:800,&quot;width&quot;:1200,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:162461,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/gif&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/192359428?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31c94e0b-3049-4ab5-9ac5-a4da13caeacb_1200x800.gif&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!awyx!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31c94e0b-3049-4ab5-9ac5-a4da13caeacb_1200x800.gif 424w, https://substackcdn.com/image/fetch/$s_!awyx!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31c94e0b-3049-4ab5-9ac5-a4da13caeacb_1200x800.gif 848w, https://substackcdn.com/image/fetch/$s_!awyx!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31c94e0b-3049-4ab5-9ac5-a4da13caeacb_1200x800.gif 1272w, https://substackcdn.com/image/fetch/$s_!awyx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31c94e0b-3049-4ab5-9ac5-a4da13caeacb_1200x800.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><p><a href="https://flow.byhand.ai/self-attention">Give Self Attention Flow a try.</a> Resize any matrix in the diagram and watch the dimensions &#8220;flow&#8221; to other matrices in the self-attention mechanism.</p>]]></content:encoded></item><item><title><![CDATA[Matmul Flow ~ New Visualization Tool]]></title><description><![CDATA[AI by Hand &#9997;&#65039; Academy]]></description><link>https://www.byhand.ai/p/matmul-flow-new-visualization-tool</link><guid isPermaLink="false">https://www.byhand.ai/p/matmul-flow-new-visualization-tool</guid><dc:creator><![CDATA[Prof. Tom Yeh]]></dc:creator><pubDate>Tue, 17 Mar 2026 14:26:26 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!3FCV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91e29dbf-f109-439a-9a74-3ba0ed0e1358_1978x1698.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>I built a new interactive tool called <a href="https://matmul-flow.pages.dev/">MatmulFlow</a> to visualize the dimensions of matrix multiplications. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3FCV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91e29dbf-f109-439a-9a74-3ba0ed0e1358_1978x1698.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3FCV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91e29dbf-f109-439a-9a74-3ba0ed0e1358_1978x1698.png 424w, https://substackcdn.com/image/fetch/$s_!3FCV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91e29dbf-f109-439a-9a74-3ba0ed0e1358_1978x1698.png 848w, https://substackcdn.com/image/fetch/$s_!3FCV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91e29dbf-f109-439a-9a74-3ba0ed0e1358_1978x1698.png 1272w, https://substackcdn.com/image/fetch/$s_!3FCV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91e29dbf-f109-439a-9a74-3ba0ed0e1358_1978x1698.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3FCV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91e29dbf-f109-439a-9a74-3ba0ed0e1358_1978x1698.png" width="1456" height="1250" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/91e29dbf-f109-439a-9a74-3ba0ed0e1358_1978x1698.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1250,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:291482,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/191248822?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91e29dbf-f109-439a-9a74-3ba0ed0e1358_1978x1698.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!3FCV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91e29dbf-f109-439a-9a74-3ba0ed0e1358_1978x1698.png 424w, https://substackcdn.com/image/fetch/$s_!3FCV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91e29dbf-f109-439a-9a74-3ba0ed0e1358_1978x1698.png 848w, https://substackcdn.com/image/fetch/$s_!3FCV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91e29dbf-f109-439a-9a74-3ba0ed0e1358_1978x1698.png 1272w, https://substackcdn.com/image/fetch/$s_!3FCV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F91e29dbf-f109-439a-9a74-3ba0ed0e1358_1978x1698.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>Matmul&#8217;s Dimensions are Confusing</h2><p>If you&#8217;ve worked with matrix multiplication or matmul, you know the rule: if W&#8217;s shape is (<em>m</em>, <em>n</em>) and X&#8217;s shape is (<em>n, p</em>), then F = WX&#8217;s shape must be (<em>m</em>, <em>p</em>). The inner dimensions must match, the outer dimensions give you the result. It&#8217;s simple enough to state. But in practice, I struggled as a student in the past and I&#8217;ve watched my students struggle stumbling over this again and again. Which one is the inner dimension? Is it the columns of the first or the rows of the second? </p><p>And when you chain multiple multiplications together, it gets worse. </p><p>Consider W&#8325;(W&#8324;(W&#8323;(W&#8322;(W&#8321;X)))). What happen if we increase W&#8323;&#8217;s width by 40?</p><p>Consider ((((WX&#8321;)X&#8322;)X&#8323;)X&#8324;)X&#8325;.<strong> </strong>What happen if X&#8322;&#8217;s height increases from 150 to 300?</p><p>This is when my brain starts to hurt. But remember these questions for now, because we will revisit them using <a href="https://matmul-flow.pages.dev/">MatmulFlow</a> later. </p><p>The problem with the standard notation, I think, is it is too abstract. You&#8217;re staring at letters &#8212; W, X, F, m, n, p &#8212; trying to keep track of which belongs to which. There&#8217;s nothing to see.</p><h2>Just Shift the Second Matrix Up</h2><p>Several years ago, I came up with a visual method that makes the shapes of matrix multiplication obvious at a glance. This method is extensible to complex chains of multiplication. No memorization needed. Here&#8217;s how it works.</p><p>Take your two matrices, W and X, and represent them as rectangles whose width and height correspond to the number of columns and rows. Now take the second matrix X and shift it up, placing it above and to the right of W.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!wfcP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F910fc0de-392e-46e9-a4c0-3df214d3672e_1344x906.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!wfcP!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F910fc0de-392e-46e9-a4c0-3df214d3672e_1344x906.png 424w, https://substackcdn.com/image/fetch/$s_!wfcP!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F910fc0de-392e-46e9-a4c0-3df214d3672e_1344x906.png 848w, https://substackcdn.com/image/fetch/$s_!wfcP!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F910fc0de-392e-46e9-a4c0-3df214d3672e_1344x906.png 1272w, https://substackcdn.com/image/fetch/$s_!wfcP!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F910fc0de-392e-46e9-a4c0-3df214d3672e_1344x906.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!wfcP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F910fc0de-392e-46e9-a4c0-3df214d3672e_1344x906.png" width="1344" height="906" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/910fc0de-392e-46e9-a4c0-3df214d3672e_1344x906.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:906,&quot;width&quot;:1344,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:52604,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/191248822?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F910fc0de-392e-46e9-a4c0-3df214d3672e_1344x906.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!wfcP!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F910fc0de-392e-46e9-a4c0-3df214d3672e_1344x906.png 424w, https://substackcdn.com/image/fetch/$s_!wfcP!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F910fc0de-392e-46e9-a4c0-3df214d3672e_1344x906.png 848w, https://substackcdn.com/image/fetch/$s_!wfcP!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F910fc0de-392e-46e9-a4c0-3df214d3672e_1344x906.png 1272w, https://substackcdn.com/image/fetch/$s_!wfcP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F910fc0de-392e-46e9-a4c0-3df214d3672e_1344x906.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Look at what happens. The right edge of W lines up with the bottom edge of X. That alignment is the inner dimension &#8212; the one that has to match for the multiplication to be valid. And the result F = WX simply fills in the remaining rectangle below X and to the right of W. It takes its height from W and its width from X.</p><p>The constraint that felt abstract &#8212; &#8220;the inner dimensions must match&#8221; &#8212; becomes geometric and obvious. If W&#8217;s right edge doesn&#8217;t line up with X&#8217;s bottom edge, the multiplication can&#8217;t happen. You don&#8217;t need to remember the rule. You can see it.</p><h2>Chain of Matmuls</h2><p>This visualization method extends naturally to chains of matrix multiplications. Each layer multiplies the output F of the previous layer by a new weight matrix W.</p><p>In <a href="https://matmul-flow.pages.dev/">MatmulFlow</a>, stacking layers vertically means multiplying on the left. The output F&#8321; of the first layer feeds down to become X&#8322; for the next layer, giving F&#8322; = W&#8322; (W&#8321;X&#8321;). </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!kA47!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0755916d-54f6-4b1a-a27e-3bdb49015e51_1388x1334.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!kA47!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0755916d-54f6-4b1a-a27e-3bdb49015e51_1388x1334.png 424w, https://substackcdn.com/image/fetch/$s_!kA47!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0755916d-54f6-4b1a-a27e-3bdb49015e51_1388x1334.png 848w, https://substackcdn.com/image/fetch/$s_!kA47!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0755916d-54f6-4b1a-a27e-3bdb49015e51_1388x1334.png 1272w, https://substackcdn.com/image/fetch/$s_!kA47!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0755916d-54f6-4b1a-a27e-3bdb49015e51_1388x1334.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!kA47!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0755916d-54f6-4b1a-a27e-3bdb49015e51_1388x1334.png" width="1388" height="1334" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0755916d-54f6-4b1a-a27e-3bdb49015e51_1388x1334.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1334,&quot;width&quot;:1388,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:87894,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/191248822?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0755916d-54f6-4b1a-a27e-3bdb49015e51_1388x1334.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!kA47!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0755916d-54f6-4b1a-a27e-3bdb49015e51_1388x1334.png 424w, https://substackcdn.com/image/fetch/$s_!kA47!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0755916d-54f6-4b1a-a27e-3bdb49015e51_1388x1334.png 848w, https://substackcdn.com/image/fetch/$s_!kA47!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0755916d-54f6-4b1a-a27e-3bdb49015e51_1388x1334.png 1272w, https://substackcdn.com/image/fetch/$s_!kA47!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0755916d-54f6-4b1a-a27e-3bdb49015e51_1388x1334.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The same visual logic applies in the other direction.  Stacking layers horizontally means multiplying on the right. The output F&#8321; feeds across to become W&#8322; for the next layer, giving F&#8322; = (W&#8321;X&#8321;)X&#8322;.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!7wRK!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8556054-7546-4066-abe8-7dc6f1e87405_2328x834.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!7wRK!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8556054-7546-4066-abe8-7dc6f1e87405_2328x834.png 424w, https://substackcdn.com/image/fetch/$s_!7wRK!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8556054-7546-4066-abe8-7dc6f1e87405_2328x834.png 848w, https://substackcdn.com/image/fetch/$s_!7wRK!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8556054-7546-4066-abe8-7dc6f1e87405_2328x834.png 1272w, https://substackcdn.com/image/fetch/$s_!7wRK!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8556054-7546-4066-abe8-7dc6f1e87405_2328x834.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!7wRK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8556054-7546-4066-abe8-7dc6f1e87405_2328x834.png" width="1456" height="522" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b8556054-7546-4066-abe8-7dc6f1e87405_2328x834.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:522,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:91041,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/191248822?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8556054-7546-4066-abe8-7dc6f1e87405_2328x834.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!7wRK!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8556054-7546-4066-abe8-7dc6f1e87405_2328x834.png 424w, https://substackcdn.com/image/fetch/$s_!7wRK!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8556054-7546-4066-abe8-7dc6f1e87405_2328x834.png 848w, https://substackcdn.com/image/fetch/$s_!7wRK!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8556054-7546-4066-abe8-7dc6f1e87405_2328x834.png 1272w, https://substackcdn.com/image/fetch/$s_!7wRK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8556054-7546-4066-abe8-7dc6f1e87405_2328x834.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Let&#8217;s revisit W&#8325;(W&#8324;(W&#8323;(W&#8322;(W&#8321;X)))) and (((W&#8321;X&#8321;)X&#8322;)X&#8323;)X&#8324;)X&#8325;. If we apply the same logic to visualize them, we can immediately see how they are different. The first expression stacks vertically, while the second expression stacks horizontally.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!HpHD!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b8b8127-8077-4439-a78f-900e2d612e25_4960x1740.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!HpHD!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b8b8127-8077-4439-a78f-900e2d612e25_4960x1740.png 424w, https://substackcdn.com/image/fetch/$s_!HpHD!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b8b8127-8077-4439-a78f-900e2d612e25_4960x1740.png 848w, https://substackcdn.com/image/fetch/$s_!HpHD!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b8b8127-8077-4439-a78f-900e2d612e25_4960x1740.png 1272w, https://substackcdn.com/image/fetch/$s_!HpHD!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b8b8127-8077-4439-a78f-900e2d612e25_4960x1740.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!HpHD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b8b8127-8077-4439-a78f-900e2d612e25_4960x1740.png" width="1456" height="511" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3b8b8127-8077-4439-a78f-900e2d612e25_4960x1740.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:511,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:276876,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/191248822?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b8b8127-8077-4439-a78f-900e2d612e25_4960x1740.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!HpHD!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b8b8127-8077-4439-a78f-900e2d612e25_4960x1740.png 424w, https://substackcdn.com/image/fetch/$s_!HpHD!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b8b8127-8077-4439-a78f-900e2d612e25_4960x1740.png 848w, https://substackcdn.com/image/fetch/$s_!HpHD!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b8b8127-8077-4439-a78f-900e2d612e25_4960x1740.png 1272w, https://substackcdn.com/image/fetch/$s_!HpHD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b8b8127-8077-4439-a78f-900e2d612e25_4960x1740.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><h2>Resizing a Matrix</h2><p>Back to our questions earlier, what happen if we increase W&#8323;&#8217;s height by 40? </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!yFQL!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c3e2af1-1383-46a8-8257-6698d8d1e05d_1860x1718.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!yFQL!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c3e2af1-1383-46a8-8257-6698d8d1e05d_1860x1718.png 424w, https://substackcdn.com/image/fetch/$s_!yFQL!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c3e2af1-1383-46a8-8257-6698d8d1e05d_1860x1718.png 848w, https://substackcdn.com/image/fetch/$s_!yFQL!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c3e2af1-1383-46a8-8257-6698d8d1e05d_1860x1718.png 1272w, https://substackcdn.com/image/fetch/$s_!yFQL!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c3e2af1-1383-46a8-8257-6698d8d1e05d_1860x1718.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!yFQL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c3e2af1-1383-46a8-8257-6698d8d1e05d_1860x1718.png" width="1456" height="1345" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1c3e2af1-1383-46a8-8257-6698d8d1e05d_1860x1718.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1345,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:155797,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/191248822?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c3e2af1-1383-46a8-8257-6698d8d1e05d_1860x1718.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!yFQL!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c3e2af1-1383-46a8-8257-6698d8d1e05d_1860x1718.png 424w, https://substackcdn.com/image/fetch/$s_!yFQL!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c3e2af1-1383-46a8-8257-6698d8d1e05d_1860x1718.png 848w, https://substackcdn.com/image/fetch/$s_!yFQL!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c3e2af1-1383-46a8-8257-6698d8d1e05d_1860x1718.png 1272w, https://substackcdn.com/image/fetch/$s_!yFQL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c3e2af1-1383-46a8-8257-6698d8d1e05d_1860x1718.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>What happen if X&#8322;&#8217;s height increases from 150 to 300?</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!0mcr!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff836945f-3881-454d-a10d-9e450dffd3c7_1866x986.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!0mcr!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff836945f-3881-454d-a10d-9e450dffd3c7_1866x986.png 424w, https://substackcdn.com/image/fetch/$s_!0mcr!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff836945f-3881-454d-a10d-9e450dffd3c7_1866x986.png 848w, https://substackcdn.com/image/fetch/$s_!0mcr!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff836945f-3881-454d-a10d-9e450dffd3c7_1866x986.png 1272w, https://substackcdn.com/image/fetch/$s_!0mcr!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff836945f-3881-454d-a10d-9e450dffd3c7_1866x986.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!0mcr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff836945f-3881-454d-a10d-9e450dffd3c7_1866x986.png" width="1456" height="769" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f836945f-3881-454d-a10d-9e450dffd3c7_1866x986.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:769,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:217435,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/191248822?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff836945f-3881-454d-a10d-9e450dffd3c7_1866x986.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!0mcr!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff836945f-3881-454d-a10d-9e450dffd3c7_1866x986.png 424w, https://substackcdn.com/image/fetch/$s_!0mcr!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff836945f-3881-454d-a10d-9e450dffd3c7_1866x986.png 848w, https://substackcdn.com/image/fetch/$s_!0mcr!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff836945f-3881-454d-a10d-9e450dffd3c7_1866x986.png 1272w, https://substackcdn.com/image/fetch/$s_!0mcr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff836945f-3881-454d-a10d-9e450dffd3c7_1866x986.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><a href="https://matmul-flow.pages.dev/">Give MatmulFlow a try</a>. Resize any matrix in the chain and watch the dimensions &#8220;flow&#8221; to other matrices in the chain in both directions.</p><h2>Mastering Matrix Multiplication</h2><p>If you want to commit yourself to mastering matrix multiplication, the paper version of my Matrix Multiplication Workbook was recently released on Amazon. <a href="https://www.amazon.com/Matrix-Multiplication-Workbook-Fill-Blank/dp/1807609677/">Click this link to take a look.</a></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!eVZH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F397dfccb-1067-4ac6-85bc-bda37b5c7f38_1548x1044.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!eVZH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F397dfccb-1067-4ac6-85bc-bda37b5c7f38_1548x1044.png 424w, https://substackcdn.com/image/fetch/$s_!eVZH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F397dfccb-1067-4ac6-85bc-bda37b5c7f38_1548x1044.png 848w, https://substackcdn.com/image/fetch/$s_!eVZH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F397dfccb-1067-4ac6-85bc-bda37b5c7f38_1548x1044.png 1272w, https://substackcdn.com/image/fetch/$s_!eVZH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F397dfccb-1067-4ac6-85bc-bda37b5c7f38_1548x1044.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!eVZH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F397dfccb-1067-4ac6-85bc-bda37b5c7f38_1548x1044.png" width="1456" height="982" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/397dfccb-1067-4ac6-85bc-bda37b5c7f38_1548x1044.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:982,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:407830,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/191248822?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F397dfccb-1067-4ac6-85bc-bda37b5c7f38_1548x1044.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!eVZH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F397dfccb-1067-4ac6-85bc-bda37b5c7f38_1548x1044.png 424w, https://substackcdn.com/image/fetch/$s_!eVZH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F397dfccb-1067-4ac6-85bc-bda37b5c7f38_1548x1044.png 848w, https://substackcdn.com/image/fetch/$s_!eVZH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F397dfccb-1067-4ac6-85bc-bda37b5c7f38_1548x1044.png 1272w, https://substackcdn.com/image/fetch/$s_!eVZH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F397dfccb-1067-4ac6-85bc-bda37b5c7f38_1548x1044.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>If you are a member of the <a href="https://byhand.ai/academy">AI by Hand Academy</a>, just a reminder that you already have access to the <a href="https://www.byhand.ai/about#&#167;workbook-matrix-multiplication">electronic version of this workbook.</a></p>]]></content:encoded></item><item><title><![CDATA[SwiGLU: The Activation Function Behind Frontier AI]]></title><description><![CDATA[Essential AI Math Excel Blueprints]]></description><link>https://www.byhand.ai/p/swiglu-the-activation-function-behind</link><guid isPermaLink="false">https://www.byhand.ai/p/swiglu-the-activation-function-behind</guid><dc:creator><![CDATA[Prof. Tom Yeh]]></dc:creator><pubDate>Tue, 10 Mar 2026 17:54:56 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!mYcl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F175cebff-5859-42f1-9d4e-099fe1298181_1332x1270.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p></p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!6wMJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c036b7d-0384-4aef-9a17-ae92d0f793a1_2062x266.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!6wMJ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c036b7d-0384-4aef-9a17-ae92d0f793a1_2062x266.png 424w, https://substackcdn.com/image/fetch/$s_!6wMJ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c036b7d-0384-4aef-9a17-ae92d0f793a1_2062x266.png 848w, https://substackcdn.com/image/fetch/$s_!6wMJ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c036b7d-0384-4aef-9a17-ae92d0f793a1_2062x266.png 1272w, https://substackcdn.com/image/fetch/$s_!6wMJ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c036b7d-0384-4aef-9a17-ae92d0f793a1_2062x266.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!6wMJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c036b7d-0384-4aef-9a17-ae92d0f793a1_2062x266.png" width="1456" height="188" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6c036b7d-0384-4aef-9a17-ae92d0f793a1_2062x266.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:188,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:94717,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/190528790?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c036b7d-0384-4aef-9a17-ae92d0f793a1_2062x266.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!6wMJ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c036b7d-0384-4aef-9a17-ae92d0f793a1_2062x266.png 424w, https://substackcdn.com/image/fetch/$s_!6wMJ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c036b7d-0384-4aef-9a17-ae92d0f793a1_2062x266.png 848w, https://substackcdn.com/image/fetch/$s_!6wMJ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c036b7d-0384-4aef-9a17-ae92d0f793a1_2062x266.png 1272w, https://substackcdn.com/image/fetch/$s_!6wMJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c036b7d-0384-4aef-9a17-ae92d0f793a1_2062x266.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p>SwiGLU has become a standard activation in many frontier large language models because it increases expressive power without adding much structural complexity. It combines GLU-style input-dependent gating with Swish-style amplification: the gate allows features to be selectively passed or suppressed based on the current input, while the multiplicative Swish behavior lets strong signals be amplified rather than merely filtered.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!mYcl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F175cebff-5859-42f1-9d4e-099fe1298181_1332x1270.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!mYcl!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F175cebff-5859-42f1-9d4e-099fe1298181_1332x1270.png 424w, https://substackcdn.com/image/fetch/$s_!mYcl!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F175cebff-5859-42f1-9d4e-099fe1298181_1332x1270.png 848w, https://substackcdn.com/image/fetch/$s_!mYcl!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F175cebff-5859-42f1-9d4e-099fe1298181_1332x1270.png 1272w, https://substackcdn.com/image/fetch/$s_!mYcl!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F175cebff-5859-42f1-9d4e-099fe1298181_1332x1270.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!mYcl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F175cebff-5859-42f1-9d4e-099fe1298181_1332x1270.png" width="1332" height="1270" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/175cebff-5859-42f1-9d4e-099fe1298181_1332x1270.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1270,&quot;width&quot;:1332,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:103010,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/190528790?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F273a9805-f57b-4f23-ab59-4573c37c1a46_1332x1436.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!mYcl!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F175cebff-5859-42f1-9d4e-099fe1298181_1332x1270.png 424w, https://substackcdn.com/image/fetch/$s_!mYcl!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F175cebff-5859-42f1-9d4e-099fe1298181_1332x1270.png 848w, https://substackcdn.com/image/fetch/$s_!mYcl!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F175cebff-5859-42f1-9d4e-099fe1298181_1332x1270.png 1272w, https://substackcdn.com/image/fetch/$s_!mYcl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F175cebff-5859-42f1-9d4e-099fe1298181_1332x1270.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>In contrast, <a href="https://www.byhand.ai/p/glu-gated-linear-unit">GLU</a> uses the same dynamic gating mechanism to allow identical features to have different activation values, but it does not amplify them.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!cMhO!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa58f9303-4e35-46c6-ac47-53cd92794ad5_1348x1250.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!cMhO!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa58f9303-4e35-46c6-ac47-53cd92794ad5_1348x1250.png 424w, https://substackcdn.com/image/fetch/$s_!cMhO!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa58f9303-4e35-46c6-ac47-53cd92794ad5_1348x1250.png 848w, https://substackcdn.com/image/fetch/$s_!cMhO!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa58f9303-4e35-46c6-ac47-53cd92794ad5_1348x1250.png 1272w, https://substackcdn.com/image/fetch/$s_!cMhO!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa58f9303-4e35-46c6-ac47-53cd92794ad5_1348x1250.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!cMhO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa58f9303-4e35-46c6-ac47-53cd92794ad5_1348x1250.png" width="1348" height="1250" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a58f9303-4e35-46c6-ac47-53cd92794ad5_1348x1250.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1250,&quot;width&quot;:1348,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:117950,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/190528790?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa58f9303-4e35-46c6-ac47-53cd92794ad5_1348x1250.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!cMhO!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa58f9303-4e35-46c6-ac47-53cd92794ad5_1348x1250.png 424w, https://substackcdn.com/image/fetch/$s_!cMhO!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa58f9303-4e35-46c6-ac47-53cd92794ad5_1348x1250.png 848w, https://substackcdn.com/image/fetch/$s_!cMhO!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa58f9303-4e35-46c6-ac47-53cd92794ad5_1348x1250.png 1272w, https://substackcdn.com/image/fetch/$s_!cMhO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa58f9303-4e35-46c6-ac47-53cd92794ad5_1348x1250.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Furthermore, <a href="https://www.byhand.ai/p/swish-silu">Swish</a> uses a static gating mechanism; the activation values are identical because the features are identical, independent of the input.</p><p></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!LY0k!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79a839cb-6eba-46c2-bf9c-13e5bcdc50bf_1322x1160.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!LY0k!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79a839cb-6eba-46c2-bf9c-13e5bcdc50bf_1322x1160.png 424w, https://substackcdn.com/image/fetch/$s_!LY0k!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79a839cb-6eba-46c2-bf9c-13e5bcdc50bf_1322x1160.png 848w, https://substackcdn.com/image/fetch/$s_!LY0k!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79a839cb-6eba-46c2-bf9c-13e5bcdc50bf_1322x1160.png 1272w, https://substackcdn.com/image/fetch/$s_!LY0k!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79a839cb-6eba-46c2-bf9c-13e5bcdc50bf_1322x1160.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!LY0k!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79a839cb-6eba-46c2-bf9c-13e5bcdc50bf_1322x1160.png" width="1322" height="1160" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/79a839cb-6eba-46c2-bf9c-13e5bcdc50bf_1322x1160.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1160,&quot;width&quot;:1322,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:103289,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/190528790?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79a839cb-6eba-46c2-bf9c-13e5bcdc50bf_1322x1160.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!LY0k!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79a839cb-6eba-46c2-bf9c-13e5bcdc50bf_1322x1160.png 424w, https://substackcdn.com/image/fetch/$s_!LY0k!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79a839cb-6eba-46c2-bf9c-13e5bcdc50bf_1322x1160.png 848w, https://substackcdn.com/image/fetch/$s_!LY0k!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79a839cb-6eba-46c2-bf9c-13e5bcdc50bf_1322x1160.png 1272w, https://substackcdn.com/image/fetch/$s_!LY0k!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79a839cb-6eba-46c2-bf9c-13e5bcdc50bf_1322x1160.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Going even further back, <a href="https://www.byhand.ai/p/relu">ReLU</a> represents the extreme case of static activation: a hard cutoff at zero with no gating signal at all. Every neuron follows the same fixed rule, which highlights why later designs moved toward input-dependent, learnable gating.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!sVfx!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd224f129-2b1e-4425-a03e-024a07f7a41f_1352x1148.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!sVfx!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd224f129-2b1e-4425-a03e-024a07f7a41f_1352x1148.png 424w, https://substackcdn.com/image/fetch/$s_!sVfx!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd224f129-2b1e-4425-a03e-024a07f7a41f_1352x1148.png 848w, https://substackcdn.com/image/fetch/$s_!sVfx!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd224f129-2b1e-4425-a03e-024a07f7a41f_1352x1148.png 1272w, https://substackcdn.com/image/fetch/$s_!sVfx!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd224f129-2b1e-4425-a03e-024a07f7a41f_1352x1148.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!sVfx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd224f129-2b1e-4425-a03e-024a07f7a41f_1352x1148.png" width="1352" height="1148" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d224f129-2b1e-4425-a03e-024a07f7a41f_1352x1148.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1148,&quot;width&quot;:1352,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:84924,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/190528790?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd224f129-2b1e-4425-a03e-024a07f7a41f_1352x1148.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!sVfx!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd224f129-2b1e-4425-a03e-024a07f7a41f_1352x1148.png 424w, https://substackcdn.com/image/fetch/$s_!sVfx!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd224f129-2b1e-4425-a03e-024a07f7a41f_1352x1148.png 848w, https://substackcdn.com/image/fetch/$s_!sVfx!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd224f129-2b1e-4425-a03e-024a07f7a41f_1352x1148.png 1272w, https://substackcdn.com/image/fetch/$s_!sVfx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd224f129-2b1e-4425-a03e-024a07f7a41f_1352x1148.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>Calculation</h2><p>SwiGLU begins with an input vector x that is sent through two learned linear transformations in parallel. The first transformation, W1, produces the feature vector W1x. The second transformation, W2, produces a gate logit W2x. This gate logit is passed through a <a href="https://www.byhand.ai/p/sigmoid">sigmoid</a> function to obtain a gate value between 0 and 1, which can be interpreted as the &#8220;open percentage&#8221; of the gate. Up to this point, the mechanism matches GLU: the feature vector is modulated by a learned gate derived from the same input.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!F1FZ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7693d4b1-d57c-4335-8164-27d35b837403_1528x802.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!F1FZ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7693d4b1-d57c-4335-8164-27d35b837403_1528x802.png 424w, https://substackcdn.com/image/fetch/$s_!F1FZ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7693d4b1-d57c-4335-8164-27d35b837403_1528x802.png 848w, https://substackcdn.com/image/fetch/$s_!F1FZ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7693d4b1-d57c-4335-8164-27d35b837403_1528x802.png 1272w, https://substackcdn.com/image/fetch/$s_!F1FZ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7693d4b1-d57c-4335-8164-27d35b837403_1528x802.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!F1FZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7693d4b1-d57c-4335-8164-27d35b837403_1528x802.png" width="1456" height="764" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7693d4b1-d57c-4335-8164-27d35b837403_1528x802.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:764,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:67318,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/190528790?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7693d4b1-d57c-4335-8164-27d35b837403_1528x802.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!F1FZ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7693d4b1-d57c-4335-8164-27d35b837403_1528x802.png 424w, https://substackcdn.com/image/fetch/$s_!F1FZ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7693d4b1-d57c-4335-8164-27d35b837403_1528x802.png 848w, https://substackcdn.com/image/fetch/$s_!F1FZ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7693d4b1-d57c-4335-8164-27d35b837403_1528x802.png 1272w, https://substackcdn.com/image/fetch/$s_!F1FZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7693d4b1-d57c-4335-8164-27d35b837403_1528x802.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The distinctive step in SwiGLU is that the gate does not stop at &#963;(W2x). Instead, it multiplies the sigmoid output by the original gate logit, forming (W2x) &#183; &#963;(W2x). This creates an amplified, Swish-style gate that can smoothly scale features rather than only attenuate them. Finally, the output is computed as W1x &#8857; ((W2x) &#183; &#963;(W2x)), meaning the feature vector is element-wise multiplied by this amplified gate. The result is a dynamic, input-dependent transformation that can both suppress and amplify features in a smooth and expressive way.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!8pUm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe57352f0-b403-4bd1-8b87-50e3ddd06093_1424x872.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!8pUm!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe57352f0-b403-4bd1-8b87-50e3ddd06093_1424x872.png 424w, https://substackcdn.com/image/fetch/$s_!8pUm!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe57352f0-b403-4bd1-8b87-50e3ddd06093_1424x872.png 848w, https://substackcdn.com/image/fetch/$s_!8pUm!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe57352f0-b403-4bd1-8b87-50e3ddd06093_1424x872.png 1272w, https://substackcdn.com/image/fetch/$s_!8pUm!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe57352f0-b403-4bd1-8b87-50e3ddd06093_1424x872.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!8pUm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe57352f0-b403-4bd1-8b87-50e3ddd06093_1424x872.png" width="1424" height="872" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e57352f0-b403-4bd1-8b87-50e3ddd06093_1424x872.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:872,&quot;width&quot;:1424,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:87140,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/190528790?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe57352f0-b403-4bd1-8b87-50e3ddd06093_1424x872.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!8pUm!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe57352f0-b403-4bd1-8b87-50e3ddd06093_1424x872.png 424w, https://substackcdn.com/image/fetch/$s_!8pUm!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe57352f0-b403-4bd1-8b87-50e3ddd06093_1424x872.png 848w, https://substackcdn.com/image/fetch/$s_!8pUm!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe57352f0-b403-4bd1-8b87-50e3ddd06093_1424x872.png 1272w, https://substackcdn.com/image/fetch/$s_!8pUm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe57352f0-b403-4bd1-8b87-50e3ddd06093_1424x872.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>For comparison, below is the diagram showing the computation of GLU. In GLU, the gate logit is passed through a sigmoid and directly used to modulate the feature vector. There is no additional multiplication with the original gate logit&#8212;so no amplification step. The gate strictly scales features between 0 and 1, meaning it can only attenuate or pass them through, but not amplify them.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!4_ju!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21980081-2e96-43c9-ad31-4dd6da1bb7ef_1412x856.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!4_ju!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21980081-2e96-43c9-ad31-4dd6da1bb7ef_1412x856.png 424w, https://substackcdn.com/image/fetch/$s_!4_ju!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21980081-2e96-43c9-ad31-4dd6da1bb7ef_1412x856.png 848w, https://substackcdn.com/image/fetch/$s_!4_ju!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21980081-2e96-43c9-ad31-4dd6da1bb7ef_1412x856.png 1272w, https://substackcdn.com/image/fetch/$s_!4_ju!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21980081-2e96-43c9-ad31-4dd6da1bb7ef_1412x856.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!4_ju!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21980081-2e96-43c9-ad31-4dd6da1bb7ef_1412x856.png" width="1412" height="856" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/21980081-2e96-43c9-ad31-4dd6da1bb7ef_1412x856.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:856,&quot;width&quot;:1412,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:79844,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/190528790?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21980081-2e96-43c9-ad31-4dd6da1bb7ef_1412x856.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!4_ju!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21980081-2e96-43c9-ad31-4dd6da1bb7ef_1412x856.png 424w, https://substackcdn.com/image/fetch/$s_!4_ju!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21980081-2e96-43c9-ad31-4dd6da1bb7ef_1412x856.png 848w, https://substackcdn.com/image/fetch/$s_!4_ju!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21980081-2e96-43c9-ad31-4dd6da1bb7ef_1412x856.png 1272w, https://substackcdn.com/image/fetch/$s_!4_ju!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21980081-2e96-43c9-ad31-4dd6da1bb7ef_1412x856.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>Batch</h2><p>Let us scale the calculation to a batch of six examples, x1 to x6. Each input vector has dimension 3, and the output dimension is 2. The two linear transformations are applied to every example in the batch, producing a 2-dimensional feature vector and a 2-dimensional gate logit for each input. After applying the sigmoid (and the additional amplification step in SwiGLU), each of the two output features receives its own independently scaled gate value. In other words, gating happens per feature and per example, so every feature channel can be modulated differently across the six inputs.</p><p></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!_lI9!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff75b5181-6318-4fd6-978a-022d1c684297_981x1570.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!_lI9!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff75b5181-6318-4fd6-978a-022d1c684297_981x1570.png 424w, https://substackcdn.com/image/fetch/$s_!_lI9!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff75b5181-6318-4fd6-978a-022d1c684297_981x1570.png 848w, https://substackcdn.com/image/fetch/$s_!_lI9!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff75b5181-6318-4fd6-978a-022d1c684297_981x1570.png 1272w, https://substackcdn.com/image/fetch/$s_!_lI9!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff75b5181-6318-4fd6-978a-022d1c684297_981x1570.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!_lI9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff75b5181-6318-4fd6-978a-022d1c684297_981x1570.png" width="981" height="1570" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f75b5181-6318-4fd6-978a-022d1c684297_981x1570.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1570,&quot;width&quot;:981,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:130800,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/190528790?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33a7c758-12a9-405f-8d0e-8bf19fd12569_1202x1570.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!_lI9!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff75b5181-6318-4fd6-978a-022d1c684297_981x1570.png 424w, https://substackcdn.com/image/fetch/$s_!_lI9!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff75b5181-6318-4fd6-978a-022d1c684297_981x1570.png 848w, https://substackcdn.com/image/fetch/$s_!_lI9!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff75b5181-6318-4fd6-978a-022d1c684297_981x1570.png 1272w, https://substackcdn.com/image/fetch/$s_!_lI9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff75b5181-6318-4fd6-978a-022d1c684297_981x1570.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>Application</h2><p>The most common application of SwiGLU is in the feed-forward network (FFN) of the Transformer architectures behind frontier models, where it replaces the traditional ReLU activation used in the original Transformer paper. The FFN typically consists of two linear layers: the first expands the hidden dimension into a higher-dimensional space (for example, from 2 to 4), and the second projects it back to the original model dimension (from 4 back to 2), as illustrated in this example. In this expanded space (dimension 4), the feature vector, the gate logit, and the resulting gated activation vector all share the same dimensionality, allowing element-wise modulation before the final projection back to the model size.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!GN5j!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce2e198f-1245-46fa-bfaf-935cad13d8b9_1232x1420.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!GN5j!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce2e198f-1245-46fa-bfaf-935cad13d8b9_1232x1420.png 424w, https://substackcdn.com/image/fetch/$s_!GN5j!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce2e198f-1245-46fa-bfaf-935cad13d8b9_1232x1420.png 848w, https://substackcdn.com/image/fetch/$s_!GN5j!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce2e198f-1245-46fa-bfaf-935cad13d8b9_1232x1420.png 1272w, https://substackcdn.com/image/fetch/$s_!GN5j!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce2e198f-1245-46fa-bfaf-935cad13d8b9_1232x1420.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!GN5j!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce2e198f-1245-46fa-bfaf-935cad13d8b9_1232x1420.png" width="1232" height="1420" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ce2e198f-1245-46fa-bfaf-935cad13d8b9_1232x1420.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1420,&quot;width&quot;:1232,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:121092,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/190528790?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce2e198f-1245-46fa-bfaf-935cad13d8b9_1232x1420.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!GN5j!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce2e198f-1245-46fa-bfaf-935cad13d8b9_1232x1420.png 424w, https://substackcdn.com/image/fetch/$s_!GN5j!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce2e198f-1245-46fa-bfaf-935cad13d8b9_1232x1420.png 848w, https://substackcdn.com/image/fetch/$s_!GN5j!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce2e198f-1245-46fa-bfaf-935cad13d8b9_1232x1420.png 1272w, https://substackcdn.com/image/fetch/$s_!GN5j!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce2e198f-1245-46fa-bfaf-935cad13d8b9_1232x1420.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Below is the layout of the feed-forward layer in the original Transformer paper for comparison. Notice the differences. First, the activation function is ReLU, a fixed, predefined nonlinearity. Second, the first projection is a single linear transformation&#8212;there is no parallel split into separate feature and gate branches. In other words, the original FFN performs a simple Linear &#8594; ReLU &#8594; Linear sequence, without any dynamic gating mechanism.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vwPS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9eb94ecd-3d09-49a2-ab4f-be31e0648f69_1196x1184.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vwPS!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9eb94ecd-3d09-49a2-ab4f-be31e0648f69_1196x1184.png 424w, https://substackcdn.com/image/fetch/$s_!vwPS!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9eb94ecd-3d09-49a2-ab4f-be31e0648f69_1196x1184.png 848w, https://substackcdn.com/image/fetch/$s_!vwPS!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9eb94ecd-3d09-49a2-ab4f-be31e0648f69_1196x1184.png 1272w, https://substackcdn.com/image/fetch/$s_!vwPS!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9eb94ecd-3d09-49a2-ab4f-be31e0648f69_1196x1184.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vwPS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9eb94ecd-3d09-49a2-ab4f-be31e0648f69_1196x1184.png" width="1196" height="1184" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9eb94ecd-3d09-49a2-ab4f-be31e0648f69_1196x1184.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1184,&quot;width&quot;:1196,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:99767,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/190528790?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9eb94ecd-3d09-49a2-ab4f-be31e0648f69_1196x1184.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!vwPS!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9eb94ecd-3d09-49a2-ab4f-be31e0648f69_1196x1184.png 424w, https://substackcdn.com/image/fetch/$s_!vwPS!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9eb94ecd-3d09-49a2-ab4f-be31e0648f69_1196x1184.png 848w, https://substackcdn.com/image/fetch/$s_!vwPS!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9eb94ecd-3d09-49a2-ab4f-be31e0648f69_1196x1184.png 1272w, https://substackcdn.com/image/fetch/$s_!vwPS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9eb94ecd-3d09-49a2-ab4f-be31e0648f69_1196x1184.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>Implementation: Standalone Layer</h2><h3>PyTorch</h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!VRMt!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedccbabb-fdae-4593-baef-bfba6d3f4e50_1232x1148.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!VRMt!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedccbabb-fdae-4593-baef-bfba6d3f4e50_1232x1148.png 424w, https://substackcdn.com/image/fetch/$s_!VRMt!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedccbabb-fdae-4593-baef-bfba6d3f4e50_1232x1148.png 848w, https://substackcdn.com/image/fetch/$s_!VRMt!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedccbabb-fdae-4593-baef-bfba6d3f4e50_1232x1148.png 1272w, https://substackcdn.com/image/fetch/$s_!VRMt!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedccbabb-fdae-4593-baef-bfba6d3f4e50_1232x1148.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!VRMt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedccbabb-fdae-4593-baef-bfba6d3f4e50_1232x1148.png" width="1232" height="1148" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/edccbabb-fdae-4593-baef-bfba6d3f4e50_1232x1148.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1148,&quot;width&quot;:1232,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:147898,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/190528790?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F73cd5151-62a1-4d96-9aa3-75210e5eb3e2_1232x1188.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!VRMt!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedccbabb-fdae-4593-baef-bfba6d3f4e50_1232x1148.png 424w, https://substackcdn.com/image/fetch/$s_!VRMt!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedccbabb-fdae-4593-baef-bfba6d3f4e50_1232x1148.png 848w, https://substackcdn.com/image/fetch/$s_!VRMt!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedccbabb-fdae-4593-baef-bfba6d3f4e50_1232x1148.png 1272w, https://substackcdn.com/image/fetch/$s_!VRMt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedccbabb-fdae-4593-baef-bfba6d3f4e50_1232x1148.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div><hr></div><h3>Excel Emulation</h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!v2Kv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62484025-6ea5-4de3-ab71-a629a4facf7f_1236x452.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!v2Kv!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62484025-6ea5-4de3-ab71-a629a4facf7f_1236x452.png 424w, https://substackcdn.com/image/fetch/$s_!v2Kv!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62484025-6ea5-4de3-ab71-a629a4facf7f_1236x452.png 848w, https://substackcdn.com/image/fetch/$s_!v2Kv!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62484025-6ea5-4de3-ab71-a629a4facf7f_1236x452.png 1272w, https://substackcdn.com/image/fetch/$s_!v2Kv!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62484025-6ea5-4de3-ab71-a629a4facf7f_1236x452.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!v2Kv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62484025-6ea5-4de3-ab71-a629a4facf7f_1236x452.png" width="1236" height="452" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/62484025-6ea5-4de3-ab71-a629a4facf7f_1236x452.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:452,&quot;width&quot;:1236,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:32285,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/190528790?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62484025-6ea5-4de3-ab71-a629a4facf7f_1236x452.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!v2Kv!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62484025-6ea5-4de3-ab71-a629a4facf7f_1236x452.png 424w, https://substackcdn.com/image/fetch/$s_!v2Kv!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62484025-6ea5-4de3-ab71-a629a4facf7f_1236x452.png 848w, https://substackcdn.com/image/fetch/$s_!v2Kv!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62484025-6ea5-4de3-ab71-a629a4facf7f_1236x452.png 1272w, https://substackcdn.com/image/fetch/$s_!v2Kv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62484025-6ea5-4de3-ab71-a629a4facf7f_1236x452.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Zk9R!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21f4b528-8f4e-45a6-becf-1aae041ecf72_1230x382.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Zk9R!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21f4b528-8f4e-45a6-becf-1aae041ecf72_1230x382.png 424w, https://substackcdn.com/image/fetch/$s_!Zk9R!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21f4b528-8f4e-45a6-becf-1aae041ecf72_1230x382.png 848w, https://substackcdn.com/image/fetch/$s_!Zk9R!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21f4b528-8f4e-45a6-becf-1aae041ecf72_1230x382.png 1272w, https://substackcdn.com/image/fetch/$s_!Zk9R!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21f4b528-8f4e-45a6-becf-1aae041ecf72_1230x382.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Zk9R!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21f4b528-8f4e-45a6-becf-1aae041ecf72_1230x382.png" width="1230" height="382" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/21f4b528-8f4e-45a6-becf-1aae041ecf72_1230x382.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:382,&quot;width&quot;:1230,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:48759,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/190528790?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21f4b528-8f4e-45a6-becf-1aae041ecf72_1230x382.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Zk9R!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21f4b528-8f4e-45a6-becf-1aae041ecf72_1230x382.png 424w, https://substackcdn.com/image/fetch/$s_!Zk9R!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21f4b528-8f4e-45a6-becf-1aae041ecf72_1230x382.png 848w, https://substackcdn.com/image/fetch/$s_!Zk9R!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21f4b528-8f4e-45a6-becf-1aae041ecf72_1230x382.png 1272w, https://substackcdn.com/image/fetch/$s_!Zk9R!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21f4b528-8f4e-45a6-becf-1aae041ecf72_1230x382.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!MW7-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6715ae25-7ec6-4a0b-b7d8-38a236f0d2c6_1222x624.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!MW7-!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6715ae25-7ec6-4a0b-b7d8-38a236f0d2c6_1222x624.png 424w, https://substackcdn.com/image/fetch/$s_!MW7-!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6715ae25-7ec6-4a0b-b7d8-38a236f0d2c6_1222x624.png 848w, https://substackcdn.com/image/fetch/$s_!MW7-!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6715ae25-7ec6-4a0b-b7d8-38a236f0d2c6_1222x624.png 1272w, https://substackcdn.com/image/fetch/$s_!MW7-!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6715ae25-7ec6-4a0b-b7d8-38a236f0d2c6_1222x624.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!MW7-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6715ae25-7ec6-4a0b-b7d8-38a236f0d2c6_1222x624.png" width="1222" height="624" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6715ae25-7ec6-4a0b-b7d8-38a236f0d2c6_1222x624.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:624,&quot;width&quot;:1222,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:73413,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/190528790?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6715ae25-7ec6-4a0b-b7d8-38a236f0d2c6_1222x624.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!MW7-!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6715ae25-7ec6-4a0b-b7d8-38a236f0d2c6_1222x624.png 424w, https://substackcdn.com/image/fetch/$s_!MW7-!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6715ae25-7ec6-4a0b-b7d8-38a236f0d2c6_1222x624.png 848w, https://substackcdn.com/image/fetch/$s_!MW7-!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6715ae25-7ec6-4a0b-b7d8-38a236f0d2c6_1222x624.png 1272w, https://substackcdn.com/image/fetch/$s_!MW7-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6715ae25-7ec6-4a0b-b7d8-38a236f0d2c6_1222x624.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!jed8!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F423f9970-0c0c-4f81-beb7-6859cb0cfe1d_1226x1040.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!jed8!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F423f9970-0c0c-4f81-beb7-6859cb0cfe1d_1226x1040.png 424w, https://substackcdn.com/image/fetch/$s_!jed8!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F423f9970-0c0c-4f81-beb7-6859cb0cfe1d_1226x1040.png 848w, https://substackcdn.com/image/fetch/$s_!jed8!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F423f9970-0c0c-4f81-beb7-6859cb0cfe1d_1226x1040.png 1272w, https://substackcdn.com/image/fetch/$s_!jed8!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F423f9970-0c0c-4f81-beb7-6859cb0cfe1d_1226x1040.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!jed8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F423f9970-0c0c-4f81-beb7-6859cb0cfe1d_1226x1040.png" width="1226" height="1040" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/423f9970-0c0c-4f81-beb7-6859cb0cfe1d_1226x1040.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1040,&quot;width&quot;:1226,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:115888,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/190528790?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F423f9970-0c0c-4f81-beb7-6859cb0cfe1d_1226x1040.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!jed8!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F423f9970-0c0c-4f81-beb7-6859cb0cfe1d_1226x1040.png 424w, https://substackcdn.com/image/fetch/$s_!jed8!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F423f9970-0c0c-4f81-beb7-6859cb0cfe1d_1226x1040.png 848w, https://substackcdn.com/image/fetch/$s_!jed8!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F423f9970-0c0c-4f81-beb7-6859cb0cfe1d_1226x1040.png 1272w, https://substackcdn.com/image/fetch/$s_!jed8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F423f9970-0c0c-4f81-beb7-6859cb0cfe1d_1226x1040.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!jDtu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5b8488f-274f-474b-852c-27da4f44c39e_1222x860.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!jDtu!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5b8488f-274f-474b-852c-27da4f44c39e_1222x860.png 424w, https://substackcdn.com/image/fetch/$s_!jDtu!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5b8488f-274f-474b-852c-27da4f44c39e_1222x860.png 848w, https://substackcdn.com/image/fetch/$s_!jDtu!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5b8488f-274f-474b-852c-27da4f44c39e_1222x860.png 1272w, https://substackcdn.com/image/fetch/$s_!jDtu!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5b8488f-274f-474b-852c-27da4f44c39e_1222x860.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!jDtu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5b8488f-274f-474b-852c-27da4f44c39e_1222x860.png" width="1222" height="860" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d5b8488f-274f-474b-852c-27da4f44c39e_1222x860.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:860,&quot;width&quot;:1222,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:97344,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/190528790?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5b8488f-274f-474b-852c-27da4f44c39e_1222x860.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!jDtu!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5b8488f-274f-474b-852c-27da4f44c39e_1222x860.png 424w, https://substackcdn.com/image/fetch/$s_!jDtu!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5b8488f-274f-474b-852c-27da4f44c39e_1222x860.png 848w, https://substackcdn.com/image/fetch/$s_!jDtu!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5b8488f-274f-474b-852c-27da4f44c39e_1222x860.png 1272w, https://substackcdn.com/image/fetch/$s_!jDtu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5b8488f-274f-474b-852c-27da4f44c39e_1222x860.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!92rC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffd93a8d4-e875-46ea-b659-7c331f08e024_1230x448.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!92rC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffd93a8d4-e875-46ea-b659-7c331f08e024_1230x448.png 424w, https://substackcdn.com/image/fetch/$s_!92rC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffd93a8d4-e875-46ea-b659-7c331f08e024_1230x448.png 848w, https://substackcdn.com/image/fetch/$s_!92rC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffd93a8d4-e875-46ea-b659-7c331f08e024_1230x448.png 1272w, https://substackcdn.com/image/fetch/$s_!92rC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffd93a8d4-e875-46ea-b659-7c331f08e024_1230x448.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!92rC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffd93a8d4-e875-46ea-b659-7c331f08e024_1230x448.png" width="1230" height="448" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fd93a8d4-e875-46ea-b659-7c331f08e024_1230x448.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:448,&quot;width&quot;:1230,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:55744,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/190528790?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffd93a8d4-e875-46ea-b659-7c331f08e024_1230x448.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!92rC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffd93a8d4-e875-46ea-b659-7c331f08e024_1230x448.png 424w, https://substackcdn.com/image/fetch/$s_!92rC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffd93a8d4-e875-46ea-b659-7c331f08e024_1230x448.png 848w, https://substackcdn.com/image/fetch/$s_!92rC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffd93a8d4-e875-46ea-b659-7c331f08e024_1230x448.png 1272w, https://substackcdn.com/image/fetch/$s_!92rC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffd93a8d4-e875-46ea-b659-7c331f08e024_1230x448.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>Implementation: Feed-Forward Layer in Frontier Models</h2><p>Below is a practical implementation of SwiGLU inside a feed-forward network. Instead of using two separate linear layers for the feature and gate branches, we combine them into a single &#8220;up&#8221; projection that outputs twice the hidden dimension. his allows the model to perform one larger matrix multiplication instead of two smaller ones. In modern GPU kernels, a single larger matmul is typically more efficient After this combined projection, the tensor is split into feature and gate branches, the gating and amplification are applied, and the result is projected back to the model dimension. This structure reflects how SwiGLU is implemented in most frontier Transformer models for both performance and scalability.</p><h3>PyTorch</h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!_DCj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1251ce44-796d-4e16-9bdb-54b334e86654_1098x582.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!_DCj!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1251ce44-796d-4e16-9bdb-54b334e86654_1098x582.png 424w, https://substackcdn.com/image/fetch/$s_!_DCj!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1251ce44-796d-4e16-9bdb-54b334e86654_1098x582.png 848w, https://substackcdn.com/image/fetch/$s_!_DCj!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1251ce44-796d-4e16-9bdb-54b334e86654_1098x582.png 1272w, https://substackcdn.com/image/fetch/$s_!_DCj!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1251ce44-796d-4e16-9bdb-54b334e86654_1098x582.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!_DCj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1251ce44-796d-4e16-9bdb-54b334e86654_1098x582.png" width="1098" height="582" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1251ce44-796d-4e16-9bdb-54b334e86654_1098x582.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:582,&quot;width&quot;:1098,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:80663,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/190528790?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1251ce44-796d-4e16-9bdb-54b334e86654_1098x582.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!_DCj!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1251ce44-796d-4e16-9bdb-54b334e86654_1098x582.png 424w, https://substackcdn.com/image/fetch/$s_!_DCj!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1251ce44-796d-4e16-9bdb-54b334e86654_1098x582.png 848w, https://substackcdn.com/image/fetch/$s_!_DCj!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1251ce44-796d-4e16-9bdb-54b334e86654_1098x582.png 1272w, https://substackcdn.com/image/fetch/$s_!_DCj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1251ce44-796d-4e16-9bdb-54b334e86654_1098x582.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!QXNY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6fb580ae-74d8-430c-89db-988eeb2b4cee_1084x618.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!QXNY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6fb580ae-74d8-430c-89db-988eeb2b4cee_1084x618.png 424w, https://substackcdn.com/image/fetch/$s_!QXNY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6fb580ae-74d8-430c-89db-988eeb2b4cee_1084x618.png 848w, https://substackcdn.com/image/fetch/$s_!QXNY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6fb580ae-74d8-430c-89db-988eeb2b4cee_1084x618.png 1272w, https://substackcdn.com/image/fetch/$s_!QXNY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6fb580ae-74d8-430c-89db-988eeb2b4cee_1084x618.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!QXNY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6fb580ae-74d8-430c-89db-988eeb2b4cee_1084x618.png" width="1084" height="618" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6fb580ae-74d8-430c-89db-988eeb2b4cee_1084x618.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:618,&quot;width&quot;:1084,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:88540,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/190528790?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6fb580ae-74d8-430c-89db-988eeb2b4cee_1084x618.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!QXNY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6fb580ae-74d8-430c-89db-988eeb2b4cee_1084x618.png 424w, https://substackcdn.com/image/fetch/$s_!QXNY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6fb580ae-74d8-430c-89db-988eeb2b4cee_1084x618.png 848w, https://substackcdn.com/image/fetch/$s_!QXNY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6fb580ae-74d8-430c-89db-988eeb2b4cee_1084x618.png 1272w, https://substackcdn.com/image/fetch/$s_!QXNY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6fb580ae-74d8-430c-89db-988eeb2b4cee_1084x618.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!UBmv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcb9bae19-f805-4bb4-a43f-5c23483af6a6_1098x518.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!UBmv!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcb9bae19-f805-4bb4-a43f-5c23483af6a6_1098x518.png 424w, https://substackcdn.com/image/fetch/$s_!UBmv!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcb9bae19-f805-4bb4-a43f-5c23483af6a6_1098x518.png 848w, https://substackcdn.com/image/fetch/$s_!UBmv!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcb9bae19-f805-4bb4-a43f-5c23483af6a6_1098x518.png 1272w, https://substackcdn.com/image/fetch/$s_!UBmv!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcb9bae19-f805-4bb4-a43f-5c23483af6a6_1098x518.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!UBmv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcb9bae19-f805-4bb4-a43f-5c23483af6a6_1098x518.png" width="1098" height="518" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/cb9bae19-f805-4bb4-a43f-5c23483af6a6_1098x518.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:518,&quot;width&quot;:1098,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:61428,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/190528790?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcb9bae19-f805-4bb4-a43f-5c23483af6a6_1098x518.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!UBmv!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcb9bae19-f805-4bb4-a43f-5c23483af6a6_1098x518.png 424w, https://substackcdn.com/image/fetch/$s_!UBmv!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcb9bae19-f805-4bb4-a43f-5c23483af6a6_1098x518.png 848w, https://substackcdn.com/image/fetch/$s_!UBmv!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcb9bae19-f805-4bb4-a43f-5c23483af6a6_1098x518.png 1272w, https://substackcdn.com/image/fetch/$s_!UBmv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcb9bae19-f805-4bb4-a43f-5c23483af6a6_1098x518.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3>Excel Emulation</h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Fkyj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a24a31f-6d78-4b44-a88d-809cbe2df15d_1218x678.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Fkyj!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a24a31f-6d78-4b44-a88d-809cbe2df15d_1218x678.png 424w, https://substackcdn.com/image/fetch/$s_!Fkyj!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a24a31f-6d78-4b44-a88d-809cbe2df15d_1218x678.png 848w, https://substackcdn.com/image/fetch/$s_!Fkyj!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a24a31f-6d78-4b44-a88d-809cbe2df15d_1218x678.png 1272w, https://substackcdn.com/image/fetch/$s_!Fkyj!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a24a31f-6d78-4b44-a88d-809cbe2df15d_1218x678.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Fkyj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a24a31f-6d78-4b44-a88d-809cbe2df15d_1218x678.png" width="1218" height="678" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9a24a31f-6d78-4b44-a88d-809cbe2df15d_1218x678.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:678,&quot;width&quot;:1218,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:70720,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/190528790?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a24a31f-6d78-4b44-a88d-809cbe2df15d_1218x678.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Fkyj!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a24a31f-6d78-4b44-a88d-809cbe2df15d_1218x678.png 424w, https://substackcdn.com/image/fetch/$s_!Fkyj!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a24a31f-6d78-4b44-a88d-809cbe2df15d_1218x678.png 848w, https://substackcdn.com/image/fetch/$s_!Fkyj!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a24a31f-6d78-4b44-a88d-809cbe2df15d_1218x678.png 1272w, https://substackcdn.com/image/fetch/$s_!Fkyj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a24a31f-6d78-4b44-a88d-809cbe2df15d_1218x678.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!qwRd!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffd2d72f7-9e29-4184-9352-dffa47eb2eff_1156x1474.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!qwRd!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffd2d72f7-9e29-4184-9352-dffa47eb2eff_1156x1474.png 424w, https://substackcdn.com/image/fetch/$s_!qwRd!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffd2d72f7-9e29-4184-9352-dffa47eb2eff_1156x1474.png 848w, https://substackcdn.com/image/fetch/$s_!qwRd!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffd2d72f7-9e29-4184-9352-dffa47eb2eff_1156x1474.png 1272w, https://substackcdn.com/image/fetch/$s_!qwRd!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffd2d72f7-9e29-4184-9352-dffa47eb2eff_1156x1474.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!qwRd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffd2d72f7-9e29-4184-9352-dffa47eb2eff_1156x1474.png" width="1156" height="1474" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fd2d72f7-9e29-4184-9352-dffa47eb2eff_1156x1474.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1474,&quot;width&quot;:1156,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:171515,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/190528790?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffd2d72f7-9e29-4184-9352-dffa47eb2eff_1156x1474.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!qwRd!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffd2d72f7-9e29-4184-9352-dffa47eb2eff_1156x1474.png 424w, https://substackcdn.com/image/fetch/$s_!qwRd!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffd2d72f7-9e29-4184-9352-dffa47eb2eff_1156x1474.png 848w, https://substackcdn.com/image/fetch/$s_!qwRd!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffd2d72f7-9e29-4184-9352-dffa47eb2eff_1156x1474.png 1272w, https://substackcdn.com/image/fetch/$s_!qwRd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffd2d72f7-9e29-4184-9352-dffa47eb2eff_1156x1474.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!5RNL!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4fc13e35-7a27-47b1-b1b7-a49d95b82381_1152x802.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!5RNL!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4fc13e35-7a27-47b1-b1b7-a49d95b82381_1152x802.png 424w, https://substackcdn.com/image/fetch/$s_!5RNL!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4fc13e35-7a27-47b1-b1b7-a49d95b82381_1152x802.png 848w, https://substackcdn.com/image/fetch/$s_!5RNL!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4fc13e35-7a27-47b1-b1b7-a49d95b82381_1152x802.png 1272w, https://substackcdn.com/image/fetch/$s_!5RNL!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4fc13e35-7a27-47b1-b1b7-a49d95b82381_1152x802.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!5RNL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4fc13e35-7a27-47b1-b1b7-a49d95b82381_1152x802.png" width="1152" height="802" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4fc13e35-7a27-47b1-b1b7-a49d95b82381_1152x802.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:802,&quot;width&quot;:1152,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:71726,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/190528790?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4fc13e35-7a27-47b1-b1b7-a49d95b82381_1152x802.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!5RNL!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4fc13e35-7a27-47b1-b1b7-a49d95b82381_1152x802.png 424w, https://substackcdn.com/image/fetch/$s_!5RNL!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4fc13e35-7a27-47b1-b1b7-a49d95b82381_1152x802.png 848w, https://substackcdn.com/image/fetch/$s_!5RNL!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4fc13e35-7a27-47b1-b1b7-a49d95b82381_1152x802.png 1272w, https://substackcdn.com/image/fetch/$s_!5RNL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4fc13e35-7a27-47b1-b1b7-a49d95b82381_1152x802.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!1wZP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2965529-0bb6-4dcd-af2f-ff92fc7977ee_1160x1368.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!1wZP!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2965529-0bb6-4dcd-af2f-ff92fc7977ee_1160x1368.png 424w, https://substackcdn.com/image/fetch/$s_!1wZP!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2965529-0bb6-4dcd-af2f-ff92fc7977ee_1160x1368.png 848w, https://substackcdn.com/image/fetch/$s_!1wZP!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2965529-0bb6-4dcd-af2f-ff92fc7977ee_1160x1368.png 1272w, https://substackcdn.com/image/fetch/$s_!1wZP!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2965529-0bb6-4dcd-af2f-ff92fc7977ee_1160x1368.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!1wZP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2965529-0bb6-4dcd-af2f-ff92fc7977ee_1160x1368.png" width="1160" height="1368" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a2965529-0bb6-4dcd-af2f-ff92fc7977ee_1160x1368.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1368,&quot;width&quot;:1160,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:172848,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/190528790?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2965529-0bb6-4dcd-af2f-ff92fc7977ee_1160x1368.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!1wZP!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2965529-0bb6-4dcd-af2f-ff92fc7977ee_1160x1368.png 424w, https://substackcdn.com/image/fetch/$s_!1wZP!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2965529-0bb6-4dcd-af2f-ff92fc7977ee_1160x1368.png 848w, https://substackcdn.com/image/fetch/$s_!1wZP!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2965529-0bb6-4dcd-af2f-ff92fc7977ee_1160x1368.png 1272w, https://substackcdn.com/image/fetch/$s_!1wZP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2965529-0bb6-4dcd-af2f-ff92fc7977ee_1160x1368.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!J-UJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F550a1d95-191c-46d1-be80-880db46179fa_1148x802.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!J-UJ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F550a1d95-191c-46d1-be80-880db46179fa_1148x802.png 424w, https://substackcdn.com/image/fetch/$s_!J-UJ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F550a1d95-191c-46d1-be80-880db46179fa_1148x802.png 848w, https://substackcdn.com/image/fetch/$s_!J-UJ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F550a1d95-191c-46d1-be80-880db46179fa_1148x802.png 1272w, https://substackcdn.com/image/fetch/$s_!J-UJ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F550a1d95-191c-46d1-be80-880db46179fa_1148x802.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!J-UJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F550a1d95-191c-46d1-be80-880db46179fa_1148x802.png" width="1148" height="802" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/550a1d95-191c-46d1-be80-880db46179fa_1148x802.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:802,&quot;width&quot;:1148,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:84605,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/190528790?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F550a1d95-191c-46d1-be80-880db46179fa_1148x802.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!J-UJ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F550a1d95-191c-46d1-be80-880db46179fa_1148x802.png 424w, https://substackcdn.com/image/fetch/$s_!J-UJ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F550a1d95-191c-46d1-be80-880db46179fa_1148x802.png 848w, https://substackcdn.com/image/fetch/$s_!J-UJ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F550a1d95-191c-46d1-be80-880db46179fa_1148x802.png 1272w, https://substackcdn.com/image/fetch/$s_!J-UJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F550a1d95-191c-46d1-be80-880db46179fa_1148x802.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!aneO!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6c9f03b-4842-41f6-a8aa-7f4066b7d19e_1146x496.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!aneO!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6c9f03b-4842-41f6-a8aa-7f4066b7d19e_1146x496.png 424w, https://substackcdn.com/image/fetch/$s_!aneO!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6c9f03b-4842-41f6-a8aa-7f4066b7d19e_1146x496.png 848w, https://substackcdn.com/image/fetch/$s_!aneO!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6c9f03b-4842-41f6-a8aa-7f4066b7d19e_1146x496.png 1272w, https://substackcdn.com/image/fetch/$s_!aneO!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6c9f03b-4842-41f6-a8aa-7f4066b7d19e_1146x496.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!aneO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6c9f03b-4842-41f6-a8aa-7f4066b7d19e_1146x496.png" width="1146" height="496" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a6c9f03b-4842-41f6-a8aa-7f4066b7d19e_1146x496.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:496,&quot;width&quot;:1146,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:59180,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/190528790?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6c9f03b-4842-41f6-a8aa-7f4066b7d19e_1146x496.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!aneO!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6c9f03b-4842-41f6-a8aa-7f4066b7d19e_1146x496.png 424w, https://substackcdn.com/image/fetch/$s_!aneO!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6c9f03b-4842-41f6-a8aa-7f4066b7d19e_1146x496.png 848w, https://substackcdn.com/image/fetch/$s_!aneO!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6c9f03b-4842-41f6-a8aa-7f4066b7d19e_1146x496.png 1272w, https://substackcdn.com/image/fetch/$s_!aneO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6c9f03b-4842-41f6-a8aa-7f4066b7d19e_1146x496.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!XvxM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0d9a439-910b-484f-801b-ce20232513db_1142x484.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!XvxM!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0d9a439-910b-484f-801b-ce20232513db_1142x484.png 424w, https://substackcdn.com/image/fetch/$s_!XvxM!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0d9a439-910b-484f-801b-ce20232513db_1142x484.png 848w, https://substackcdn.com/image/fetch/$s_!XvxM!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0d9a439-910b-484f-801b-ce20232513db_1142x484.png 1272w, https://substackcdn.com/image/fetch/$s_!XvxM!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0d9a439-910b-484f-801b-ce20232513db_1142x484.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!XvxM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0d9a439-910b-484f-801b-ce20232513db_1142x484.png" width="1142" height="484" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b0d9a439-910b-484f-801b-ce20232513db_1142x484.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:484,&quot;width&quot;:1142,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:58886,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/190528790?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0d9a439-910b-484f-801b-ce20232513db_1142x484.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!XvxM!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0d9a439-910b-484f-801b-ce20232513db_1142x484.png 424w, https://substackcdn.com/image/fetch/$s_!XvxM!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0d9a439-910b-484f-801b-ce20232513db_1142x484.png 848w, https://substackcdn.com/image/fetch/$s_!XvxM!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0d9a439-910b-484f-801b-ce20232513db_1142x484.png 1272w, https://substackcdn.com/image/fetch/$s_!XvxM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0d9a439-910b-484f-801b-ce20232513db_1142x484.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!2vQX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b9b3d69-0872-40de-97d3-8d8b8188fba6_1154x850.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!2vQX!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b9b3d69-0872-40de-97d3-8d8b8188fba6_1154x850.png 424w, https://substackcdn.com/image/fetch/$s_!2vQX!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b9b3d69-0872-40de-97d3-8d8b8188fba6_1154x850.png 848w, https://substackcdn.com/image/fetch/$s_!2vQX!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b9b3d69-0872-40de-97d3-8d8b8188fba6_1154x850.png 1272w, https://substackcdn.com/image/fetch/$s_!2vQX!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b9b3d69-0872-40de-97d3-8d8b8188fba6_1154x850.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!2vQX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b9b3d69-0872-40de-97d3-8d8b8188fba6_1154x850.png" width="1154" height="850" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2b9b3d69-0872-40de-97d3-8d8b8188fba6_1154x850.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:850,&quot;width&quot;:1154,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:96467,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/190528790?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b9b3d69-0872-40de-97d3-8d8b8188fba6_1154x850.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!2vQX!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b9b3d69-0872-40de-97d3-8d8b8188fba6_1154x850.png 424w, https://substackcdn.com/image/fetch/$s_!2vQX!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b9b3d69-0872-40de-97d3-8d8b8188fba6_1154x850.png 848w, https://substackcdn.com/image/fetch/$s_!2vQX!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b9b3d69-0872-40de-97d3-8d8b8188fba6_1154x850.png 1272w, https://substackcdn.com/image/fetch/$s_!2vQX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b9b3d69-0872-40de-97d3-8d8b8188fba6_1154x850.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!0mFK!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e36bd10-ff91-48da-ad28-434322bd43dc_1164x336.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!0mFK!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e36bd10-ff91-48da-ad28-434322bd43dc_1164x336.png 424w, https://substackcdn.com/image/fetch/$s_!0mFK!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e36bd10-ff91-48da-ad28-434322bd43dc_1164x336.png 848w, https://substackcdn.com/image/fetch/$s_!0mFK!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e36bd10-ff91-48da-ad28-434322bd43dc_1164x336.png 1272w, https://substackcdn.com/image/fetch/$s_!0mFK!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e36bd10-ff91-48da-ad28-434322bd43dc_1164x336.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!0mFK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e36bd10-ff91-48da-ad28-434322bd43dc_1164x336.png" width="1164" height="336" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6e36bd10-ff91-48da-ad28-434322bd43dc_1164x336.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:336,&quot;width&quot;:1164,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:26330,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/190528790?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e36bd10-ff91-48da-ad28-434322bd43dc_1164x336.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!0mFK!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e36bd10-ff91-48da-ad28-434322bd43dc_1164x336.png 424w, https://substackcdn.com/image/fetch/$s_!0mFK!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e36bd10-ff91-48da-ad28-434322bd43dc_1164x336.png 848w, https://substackcdn.com/image/fetch/$s_!0mFK!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e36bd10-ff91-48da-ad28-434322bd43dc_1164x336.png 1272w, https://substackcdn.com/image/fetch/$s_!0mFK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e36bd10-ff91-48da-ad28-434322bd43dc_1164x336.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>Excel Blueprint</h2><p>&#128279; <a href="https://aibyhand-my.sharepoint.com/:x:/g/personal/tom_aibyhand_onmicrosoft_com/IQCj_hivSyCAQaD-RNF1UEXZAW-vBespvFfV64FQuruL1G4?e=GE9hBg">View the Excel Blueprint Online</a></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!X_Ax!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88b5d1ef-13a2-48d1-8c30-682b27156a65_1088x1418.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!X_Ax!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88b5d1ef-13a2-48d1-8c30-682b27156a65_1088x1418.png 424w, https://substackcdn.com/image/fetch/$s_!X_Ax!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88b5d1ef-13a2-48d1-8c30-682b27156a65_1088x1418.png 848w, https://substackcdn.com/image/fetch/$s_!X_Ax!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88b5d1ef-13a2-48d1-8c30-682b27156a65_1088x1418.png 1272w, https://substackcdn.com/image/fetch/$s_!X_Ax!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88b5d1ef-13a2-48d1-8c30-682b27156a65_1088x1418.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!X_Ax!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88b5d1ef-13a2-48d1-8c30-682b27156a65_1088x1418.png" width="1088" height="1418" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/88b5d1ef-13a2-48d1-8c30-682b27156a65_1088x1418.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1418,&quot;width&quot;:1088,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:386889,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/190528790?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88b5d1ef-13a2-48d1-8c30-682b27156a65_1088x1418.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!X_Ax!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88b5d1ef-13a2-48d1-8c30-682b27156a65_1088x1418.png 424w, https://substackcdn.com/image/fetch/$s_!X_Ax!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88b5d1ef-13a2-48d1-8c30-682b27156a65_1088x1418.png 848w, https://substackcdn.com/image/fetch/$s_!X_Ax!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88b5d1ef-13a2-48d1-8c30-682b27156a65_1088x1418.png 1272w, https://substackcdn.com/image/fetch/$s_!X_Ax!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88b5d1ef-13a2-48d1-8c30-682b27156a65_1088x1418.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>(Limited time preview)</p><p>This is part of the  <a href="https://www.byhand.ai/about#%C2%A7excel-blueprints-essential-ai-math">Essential AI Math Blueprints</a> series. Get the full series by <a href="https://www.byhand.ai/subscribe">joining the AI by Hand Academy.</a></p>]]></content:encoded></item><item><title><![CDATA[AI Isn't Human]]></title><description><![CDATA[What becomes possible when we stop comparing AI to ourselves?]]></description><link>https://www.byhand.ai/p/ai-isnt-human</link><guid isPermaLink="false">https://www.byhand.ai/p/ai-isnt-human</guid><dc:creator><![CDATA[Prof. Tom Yeh]]></dc:creator><pubDate>Sat, 07 Mar 2026 16:40:30 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!OE7F!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e395b7b-ca5c-4c73-a213-b6e8340d2c80_2098x1208.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="pullquote"><p>What becomes possible when we <strong>stop</strong> <strong>comparing</strong> AI to ourselves? The answer just might be the difference between automation and transformation. ~ Jaime Teevan</p></div><p>Jaime Teevan is the <a href="https://www.microsoft.com/en-us/research/people/teevan/">Chief Scientist and Technical Fellow</a> at Microsoft. Let me compare myself to her. I am always two steps behind. I was born two years later than her. I entered MIT two years later than her. I got my PhD two years later than her. I have two children, two fewer than her four. &#128513;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!d4gH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff85bec85-b563-4682-b1f1-80729c1b5bc4_2386x1718.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!d4gH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff85bec85-b563-4682-b1f1-80729c1b5bc4_2386x1718.png 424w, https://substackcdn.com/image/fetch/$s_!d4gH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff85bec85-b563-4682-b1f1-80729c1b5bc4_2386x1718.png 848w, https://substackcdn.com/image/fetch/$s_!d4gH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff85bec85-b563-4682-b1f1-80729c1b5bc4_2386x1718.png 1272w, https://substackcdn.com/image/fetch/$s_!d4gH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff85bec85-b563-4682-b1f1-80729c1b5bc4_2386x1718.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!d4gH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff85bec85-b563-4682-b1f1-80729c1b5bc4_2386x1718.png" width="1456" height="1048" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f85bec85-b563-4682-b1f1-80729c1b5bc4_2386x1718.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1048,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:6571160,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/189711335?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff85bec85-b563-4682-b1f1-80729c1b5bc4_2386x1718.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!d4gH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff85bec85-b563-4682-b1f1-80729c1b5bc4_2386x1718.png 424w, https://substackcdn.com/image/fetch/$s_!d4gH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff85bec85-b563-4682-b1f1-80729c1b5bc4_2386x1718.png 848w, https://substackcdn.com/image/fetch/$s_!d4gH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff85bec85-b563-4682-b1f1-80729c1b5bc4_2386x1718.png 1272w, https://substackcdn.com/image/fetch/$s_!d4gH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff85bec85-b563-4682-b1f1-80729c1b5bc4_2386x1718.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">My messy desk at MIT back in 2007, the year Jaime finished her PhD. I was two years behind her.</figcaption></figure></div><p>As I was comparing myself to Jaime, I was also reminded how similar we were back at MIT. We shared the same thesis committee member, Prof. Rob Miller. We shared the same application focus&#8212;search. Jaime studied personalized search, whereas I studied image-based search. We shared the same duty to cook lunch for more than a hundred PhD students.</p><p>Fast-forward to 2020, Jaime started the Future of Work initiative at Microsoft. I received a new grant from NSF to study AI&#8217;s impact on the Future of Work. In retrospect, we both underestimated the impact, another similarity.</p><p>In 2025, Jaime, along with her team of editors and contributors from inside and outside of Microsoft<strong>, </strong>published the <a href="https://aka.ms/nfw2025">fifth</a><strong><a href="https://aka.ms/nfw2025"> </a></strong><a href="https://aka.ms/nfw2025">New Future of Work report.</a> Our team also published the third paper from my Future of Work NSF project, once more two steps behind.</p><p>p.s., I highly recommend you all to read this report: <a href="https://aka.ms/nfw2025">https://aka.ms/nfw2025</a></p><p>When I compare myself to Jaime, my human friend, my connection with her grew stronger. When I compare myself to AI, AI just knows more, writes better, counts faster, drives safer, draws nicer. But there&#8217;s no soul behind it. It simply has access to a lot more data. I cannot feel connected with AI. AI isn&#8217;t human.</p><p>Indeed, AI isn&#8217;t human. Last week Jaime wrote an article about this point. She kindly agreed to let me share this article with you.</p><h1>AI Isn&#8217;t Human</h1><p>written by Jamie Teevan (<a href="https://www.linkedin.com/pulse/ai-isnt-human-jaime-teevan-kkixc">Link to her original article on LinkedIn</a>) </p><p><em>Early lessons from our first close encounter with a very different intelligence.</em></p><p>It&#8217;s tempting to think of AI as being &#8220;like us.&#8221; Generative AI models speak our language, respond fluidly, and feel conversational, so we naturally anthropomorphize them, going so far as to frame them as virtual colleagues or even friends. But leaning too hard on a human metaphor for AI minimizes what&#8217;s uniquely human and constrains our imagination as to how we can best use it. After all, we already know how to collaborate with people. The real opportunity ahead is to explore what&#8217;s possible when we work with something that is fundamentally different.</p><p>AI doesn&#8217;t just mimic human intelligence, it introduces new capabilities that can transform how we learn, create, and collaborate. Here are five ways AI is different from humans, and why those differences matter.</p><h2>1. Scale of Intelligence</h2><p>Humans excel at nuanced judgment, but we&#8217;re bounded by time and attention. Psychologist George A. Miller famously found that most people can actively hold only around five to nine independent numbers in their head at once. No person can read thousands of survey responses or synthesize millions of product reviews in an instant. AI models have no such limits. Consider restaurant reviews. Where you once had to read through each review one by one to get a feel for a restaurant, AI can now surface patterns across hundreds of reviews, amplifying insights or perspectives that you might otherwise have missed. This isn&#8217;t just about efficiency; it&#8217;s a shift toward collective intelligence.</p><p><em>Implication: Use AI to aggregate insights from employees, customers, and partners, turning fragmented feedback into actionable strategy.</em></p><h2>2. Speed of Ideas</h2><p>Human communication is often asynchronous and inherently slow. Even in fast-moving organizations, feedback loops take time. We wait for inspiration to strike, and then we wait again for others to weigh in. AI, on the other hand, responds instantly, offering ideas on demand and immediate feedback. You can brainstorm a dozen directions in seconds, and then immediately test and refine them. AI becomes both your creative partner and your first reviewer, helping you move from blank page to viable concept without waiting for the next meeting or email reply. It&#8217;s now possible to replace the &#8220;waiting game&#8221; of traditional collaboration with immediate, context-aware input. Late at night or across time zones, you can get suggestions, iterate, and improve, all without waking anyone up.</p><p><em>Implication: Redesign the cadence of work. Route first drafts and first feedback to AI by default, then bring people in where judgment is needed.</em></p><h2>3. Continuity of Interaction</h2><p>People carry context from one conversation to the next. That continuity is valuable. It&#8217;s how we build relationships and shared understanding. But continuity can also constrain us. We may hesitate to ask the same question twice or to explore radically different directions for fear of seeming inconsistent. If I make a bad argument in this article, you can&#8217;t just wipe it from your mind and let me give it another go. With AI, it&#8217;s possible to erase history and start over &#8211; there&#8217;s no penalty to being curious. Want to explore a different framing or simulate an alternative scenario? Reset and rerun. AI won&#8217;t judge, remember, or cling to the past. This ability to start an interaction afresh makes AI a powerful tool for creative exploration.</p><p><em>Implication: Use AI to model multiple futures. Instead of debating one strategy, generate and compare several, stress-testing assumptions before committing.</em></p><h2>4. Externalization of Cognition</h2><p>Human reasoning is opaque. We infer intentions and logic from what others say or do, but we can&#8217;t scroll through someone&#8217;s thought process. That opacity has social value. It fosters trust, privacy, and connection. But it also limits how we learn from one another. AI makes this invisible process visible. You can review chain-of-thought, tweak prompts, and trace how changes influence outcomes. Ask AI to provide feedback on an email from the perspective of an engineer, a marketer, or a customer, and watch how the responses shift. This transparency makes it easier to experiment, iterate, and learn. What&#8217;s more, AI&#8217;s thinking isn&#8217;t just visible, it&#8217;s portable. Prompts, responses, and model configurations can be saved, reused, and passed between people, turning not just outputs but the reasoning behind them into something others can pick up and use.</p><p><em>Implication: Make thinking shareable. Capture and circulate high-impact prompts, workflows, and model setups so your organization learns faster than any one person can.</em></p><h2>5. Ownership of Outcomes</h2><p>Humans operate inside systems of accountability. We learn through discomfort, where mistakes sting, reputations are at stake, and judgment is shaped by social and organizational norms. AI, on the other hand, can operate boldly and at scale, but it can&#8217;t be held responsible. It can&#8217;t lose its job, face consequences, or feel the weight of a bad decision. This difference matters. As AI takes on more execution, the human role shifts toward deciding when to trust its output, when to override it, and how to design the surrounding guardrails. The risk isn&#8217;t that AI will make &#8220;wrong&#8221; choices but that it has no skin in the game. That&#8217;s why human oversight matters. We need to design AI workflows that align AI outputs with organizational values, ensuring creativity doesn&#8217;t come at the cost of compliance or ethics. The opportunity is to pair AI&#8217;s fearless exploration with human oversight to open up previously unimaginable paths that people can turn into direction.</p><p><em>Implication: Treat AI as a bold contributor, but not a responsible one. Assign clear human ownership for decisions to ensure AI operates in alignment with your organization&#8217;s values.</em></p><h2>The Leadership Opportunity</h2><p>AI isn&#8217;t here to replace human intelligence; it&#8217;s here to expand it in ways we&#8217;ve only begun to imagine. AI will inevitably change your organization, and it&#8217;s your ability to capitalize on what makes it different that will determine how meaningful that change is.</p><p>So stop using AI to do the things humans already do well, and start using it to scale insight, accelerate iteration, explore without fear, and externalize expertise. More than a new tool, AI is a new layer of organizational cognition, one that can capture, share, and amplify how your teams think.</p><p>The next time you catch yourself saying, &#8220;AI is like a person,&#8221; pause. Then ask: What becomes possible when we stop comparing AI to ourselves? The answer just might be the difference between automation and transformation.</p><p>~ Jaime Teevan, Chief Scientist, Microsoft</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!OE7F!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e395b7b-ca5c-4c73-a213-b6e8340d2c80_2098x1208.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!OE7F!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e395b7b-ca5c-4c73-a213-b6e8340d2c80_2098x1208.png 424w, https://substackcdn.com/image/fetch/$s_!OE7F!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e395b7b-ca5c-4c73-a213-b6e8340d2c80_2098x1208.png 848w, https://substackcdn.com/image/fetch/$s_!OE7F!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e395b7b-ca5c-4c73-a213-b6e8340d2c80_2098x1208.png 1272w, https://substackcdn.com/image/fetch/$s_!OE7F!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e395b7b-ca5c-4c73-a213-b6e8340d2c80_2098x1208.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!OE7F!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e395b7b-ca5c-4c73-a213-b6e8340d2c80_2098x1208.png" width="1456" height="838" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2e395b7b-ca5c-4c73-a213-b6e8340d2c80_2098x1208.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:838,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1115572,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.byhand.ai/i/189711335?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e395b7b-ca5c-4c73-a213-b6e8340d2c80_2098x1208.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!OE7F!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e395b7b-ca5c-4c73-a213-b6e8340d2c80_2098x1208.png 424w, https://substackcdn.com/image/fetch/$s_!OE7F!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e395b7b-ca5c-4c73-a213-b6e8340d2c80_2098x1208.png 848w, https://substackcdn.com/image/fetch/$s_!OE7F!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e395b7b-ca5c-4c73-a213-b6e8340d2c80_2098x1208.png 1272w, https://substackcdn.com/image/fetch/$s_!OE7F!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e395b7b-ca5c-4c73-a213-b6e8340d2c80_2098x1208.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="file-embed-wrapper" data-component-name="FileToDOM"><div class="file-embed-container-reader"><div class="file-embed-container-top"><image class="file-embed-thumbnail-default" src="https://substackcdn.com/image/fetch/$s_!0Cy0!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack.com%2Fimg%2Fattachment_icon.svg"></image><div class="file-embed-details"><div class="file-embed-details-h1">New Future Of Work Report 2025</div><div class="file-embed-details-h2">4.67MB &#8729; PDF file</div></div><a class="file-embed-button wide" href="https://www.byhand.ai/api/v1/file/d6f94ed5-5123-4e25-9445-32acffd0601a.pdf"><span class="file-embed-button-text">Download</span></a></div><a class="file-embed-button narrow" href="https://www.byhand.ai/api/v1/file/d6f94ed5-5123-4e25-9445-32acffd0601a.pdf"><span class="file-embed-button-text">Download</span></a></div></div><p> </p>]]></content:encoded></item></channel></rss>