6 Comments
User's avatar
Luca's avatar

Sorry, I cannot get how to calculate the mean at the beginning of the Bit linear layer.

Prof. Tom Yeh's avatar

Ah. It's 0.05. I made a mistake. I will fix it soon.

Prof. Tom Yeh's avatar

It's fixed now.

The mean is the sum of all six numbers and divide it by 6.

Thanks for pointing out! I'm glad that you actually tried!

Jason Rich Darmawan's avatar

How do you calculate the layernorm? I use z = x - \mu / \sqrt{ \sigma^2 + \epsilon } but got different result

Aamir Bader's avatar

can you do swin transformers by hand?