Commit 08313b87 authored by Eugene Zhulenev's avatar Eugene Zhulenev Committed by TensorFlower Gardener
Browse files

Optimize CuboidConvolutionBwdInput.

~25-30% speedup when compiled with AVX.

  * collapse inner dims before contraction
  * eval kernel tensor before contraction

PiperOrigin-RevId: 211651030
parent 11548e0a
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please to comment