Commit 3436665d authored by James Keeling's avatar James Keeling Committed by TensorFlower Gardener
Browse files

Split scan ops GPU code into multiple files

This file was a bottleneck during compilation, often taking many minutes to compile. In local testing this change reduces the wall-clock build time for the scan ops GPU kernels from 107s to 96s.

PiperOrigin-RevId: 228304727
parent 0ef4b190
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please to comment