[XLA:GPU] Elide the SequentialThunk when emitting scatter with no copy
We have a 1-element thunk sequence if we're not copying. That's still two thunks and hlo profiling gets confused if it sees two thunks for the same instruction and one of them claims to be the whole instruction. PiperOrigin-RevId: 216448063
Loading
Please sign in to comment