Enable CUDNN_TENSOR_OP_MATH for fp16 RNNs (#17367)
- Speeds up CUDNN RNNs with fp16 input/output when possible on supported GPUs. Computations will fall back to pseudo-fp16 if tensor op math is not supported. - Enabled by default, but can be disabled by setting the environment variable TF_DISABLE_CUDNN_RNN_TENSOR_OP_MATH=1.
Loading
Please sign in to comment