Use x*x instead of x^2 to calculate square in huber loss implementation. The...
Use x*x instead of x^2 to calculate square in huber loss implementation. The reason is d(x^y)/dy = x^y * log(x) and when x is zero, it becomes NaN. Even if y is constant, the check op would still report failure. PiperOrigin-RevId: 185208180
Loading
Please sign in to comment