Split topk GPU code into multiple files
This file was a bottleneck during compilation, often taking many minutes to compile. In local testing this change reduces the wall-clock build time for the topk GPU kernels from 155s to 48s. PiperOrigin-RevId: 228302319
Loading
Please sign in to comment