Great post! One question: isn’t LayerNorm just normalizing a vector?
Current theme: default
Less Wrong (text)
Less Wrong (link)
Great post! One question: isn’t LayerNorm just normalizing a vector?