Fix misleading comment in layer normalization
Comment states that moments are calculated across the last dimension, however this is not true for convolutional layers, where the moments are calculated on all dimensions except the one with index 0. I changed the comment from "Calculate the moments on the last axis (layer activations)." to "By default, compute the moments across all the dimensions except the one with index 0."
Loading
Please sign in to comment