[XLA:GPU] (Mostly) Disable fast-math optimizations
We remain performance neutral on the set of tracked benchmarks by allowing
fast-math like optimizations in two places:
1. We allow LLVM to generate div.approx when necessary by setting
-nvptx-prec-divf32=0
2. By default we emit llvm.min/llvm.max that has non IEEE behavior with NaNs.
PiperOrigin-RevId: 223610526
Loading
Please sign in to comment