Update initializers.py (#10985)
According to [Understanding the difficulty of training deep feedforward neural networks](http://jmlr.org/proceedings/papers/v9/glorot10a/glorot10a.pdf) and the codes, it is sqrt(2. / (in + out))
Loading
Please sign in to comment