[XLA:GPU] Use a fast approximation for tanh
Just reuse the CPU implementation, which in turn is derived from Eigen. It claims to be accurate within +-1% which is good enough for fast math. Refactor the CPU implementation into a common file and remove the VectorSupportLibrary dependency (it's not needed). PiperOrigin-RevId: 206022260
Loading
Please sign in to comment