Commit 81cabadc authored by Yao Zhang's avatar Yao Zhang Committed by TensorFlower Gardener
Browse files

Use the host implementation of vec permute op if the input on the host. Note

that the op still needs to be placed on the GPU so that it stays within the
same partiion with the neighboring ops, and as a result, no unnecessary send
and recv are created.

PiperOrigin-RevId: 193457328
parent b7479a80
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please to comment