Commit 12f810aa authored by Bixia Zheng's avatar Bixia Zheng Committed by TensorFlower Gardener
Browse files

[XLA:GPU] Enhance unrolling heuristics for column reduction.

Previously, we enable unrolling only when the reduce operands are of small data
types. This change adds a simple analysis to count the number of tensors that
can be vectorized and can't be vectorized in order to decide whether unrolling
is beneficial for the kernel.

Add test cases.

PiperOrigin-RevId: 228618121
parent c2aa5390
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please to comment