[XLA:GPU] Extend the CustomCall for cudnn convolutions to represent
tensor_ops_enabled. The convolution algorithms returned from the stream executor have a flag for whether tensor_ops is enabled. This flag is used when running each algorithm during auto-tunning. However, this flag is not currently represented in the CustomCall representing the auto-tune result. As a result, the algorithm may be run differently after auto-tune. This change adds a constant to the CustomCall for cudnn convolution algorithm selected by auto-tune, to represent whether tensor_ops is enabled during auto-tune. This information is used by convolution thunk to ensure that the algorithm is run with the same flag after auto-tune. PiperOrigin-RevId: 185458497
Loading
Please sign in to comment