Fix normalization in Shampoo when dealing with differently sized tensors.
Add M^1/2 to reduce condition numbers, before computing inverse pth root. PiperOrigin-RevId: 211162032
Loading
Please sign in to comment
Add M^1/2 to reduce condition numbers, before computing inverse pth root. PiperOrigin-RevId: 211162032