Commit 929e3ee9 authored Feb 12, 2018 by Bixia Zheng Committed by TensorFlower Gardener Feb 12, 2018

[XLA:GPU] Extend the CustomCall for cudnn convolutions to represent

tensor_ops_enabled.

The convolution algorithms returned from the stream executor have a flag
for whether tensor_ops is enabled. This flag is used when running each
algorithm during auto-tunning. However, this flag is not currently represented
in the CustomCall representing the auto-tune result. As a result, the algorithm
may be run differently after auto-tune.

This change adds a constant to the CustomCall for cudnn convolution algorithm
selected by auto-tune, to represent whether tensor_ops is enabled during
auto-tune. This information is used by convolution thunk to ensure that the
algorithm is run with the same flag after auto-tune.

PiperOrigin-RevId: 185458497

parent 96564330

Show whitespace changes

Inline Side-by-side

Please to comment