Commit 4e7413a2 authored by A. Unique TensorFlower's avatar A. Unique TensorFlower Committed by TensorFlower Gardener
Browse files

[TF:XLA] Use XLA CPU runtime functions to speed up R2 dot in the HLO evaluator.

This CL adds a fast-path for R2 dot. For now the fast implementation has certain limitations:

1. Only operands with default layout, and
2. float type

It uses the XLA's CPU runtime functions which invoke eigen.

PiperOrigin-RevId: 225372611
parent 3dfbc353
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please to comment