Always copy tensors to the remote CPU when executing a remote kernel.
The copy to the appropriate device will happen when the kernel is executing. Without this, its possible we'll copy tensors to an accelerator device when the op expects host memory inputs. PiperOrigin-RevId: 231696485
Loading
Please sign in to comment