Feature Extraction + Head

Fine-Tuning series: 6 of 8

Apr 24, 2026

∙ Paid

Fine-Tuning Series:

Weight Update
Pretrain vs Fine-Tune
Full Fine-Tuning
Freezing Layers
Linear Probe
Feature Extraction + Head
Adapter Layers
LoRA

A feature head is a small trainable MLP bolted onto a frozen pretrained backbone. Think of it as pursuing a PhD on top of a master's degree. The master's — your pretrained backbone — stays exactly as it was, with no review. You aren't re-taking Linear Algebra or Probability; you're building something specialized on top of it: the PhD adds its own coursework, its own nonlinearity, and its own thesis layer.

Paid members: open the interactive diagram below ↓

AI by Hand ✍️

Feature Extraction + Head

Fine-Tuning series: 6 of 8

This post is for paid subscribers