Custom Conv3DBackprop Input/Filter kernels.
~2x-3x speedup when compiled with AVX over the Eigen kernels, at the cost of memory overhead (needs to allocate temp buffers). Memory overhead is constrained. When memory requirements grow too far, fallback on Eigen implementation. PiperOrigin-RevId: 212734097
Loading
Please sign in to comment