Given input X, this exercise goes through the key equations behind the Self Attention layer and the Feed Forward layer, showing you how X is transformed into Y.
Transformer by hand ✍️
Given input X, this exercise goes through the key equations behind the Self Attention layer and the Feed Forward layer, showing you how X is transformed into Y.