Add GPU support for Bucketize op (#13922)
* Add GPU support for Bucketize op This fix tries to add GPU support for `Bucketize` op. Before this PR only CPU implementation is available. This PR add GPU implementation with a CUDA kernel. Signed-off-by:Yong Tang <yong.tang.github@outlook.com> * Add GPU kernel for Bucketize op. Signed-off-by:
Yong Tang <yong.tang.github@outlook.com> * Update test cases to invoke GPU implementation of Bucketize Signed-off-by:
Yong Tang <yong.tang.github@outlook.com> * Move Eigen header to the top Signed-off-by:
Yong Tang <yong.tang.github@outlook.com> * Address review feedback Signed-off-by:
Yong Tang <yong.tang.github@outlook.com> * Use binary search for finding the upper bound with GPU similiar to std::upper_bound Signed-off-by:
Yong Tang <yong.tang.github@outlook.com> * Use CudaDeviceArrayOnHost instead of pinned memory Signed-off-by:
Yong Tang <yong.tang.github@outlook.com>
Loading
Please sign in to comment