3 Comments
User's avatar
Judy's avatar

There might be a mistake? ∂L / ∂z1 should be [ 1 , -2, 2 , -1] ?

Expand full comment
Breno Brito's avatar

Looks like ∂L / ∂z2 = [ 1 , 0 ] and ∂L / ∂z1 = [ 1 , 0 , 2 , 0 ] instead because of ReLU, no?

Expand full comment
Judy's avatar

It seems that ReLU is not applied to gradients. Instead, ReLU only works on activation of the output.

Expand full comment