Commit ec2f8761 authored by Yangzihao Wang's avatar Yangzihao Wang Committed by TensorFlower Gardener
Browse files

Improved transpose operator's performance

Use specialized GPU kernels on tensors when the permutation can be reduced to {0,2,1}, {2,1,0} or {1,0}.
Change: 151147354
parent 4f20605a
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please to comment