Given input X, this exercise goes through the key equations behind the Self Attention layer and the Feed Forward layer, showing you how X is transformed into Y.
Share this post
Transformer by hand ✍️
Share this post
Given input X, this exercise goes through the key equations behind the Self Attention layer and the Feed Forward layer, showing you how X is transformed into Y.