Commit 63c0707c authored by A. Unique TensorFlower's avatar A. Unique TensorFlower Committed by TensorFlower Gardener
Browse files

Add vectorized version of div_no_nan using the new pcmp_eq and pandnot packet ops in Eigen.

Benchmark                          Base (ns)  New (ns) Improvement
------------------------------------------------------------------
BM_cpu_DivNoNan_scalar/4k              13574     12943     +4.6%
BM_cpu_DivNoNan_scalar/32k             31168     19213    +38.4%
BM_cpu_DivNoNan_scalar/128k            50737     42902    +15.4%
BM_cpu_DivNoNan_scalar/1M             137965    110706    +19.8%

PiperOrigin-RevId: 233085406
parent a3d38e48
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please to comment