[XLA:GPU] Fix a problem in calculating the address of the memory used to
implement small data type atomic operations. To calculate the address of the enclosing 4 byte memory, output_address&(-2) was used while it should be output_address&(-4). Add a test case. PiperOrigin-RevId: 182251760
Loading
Please sign in to comment