Commit 3ddc925c authored Jun 07, 2018 by A. Unique TensorFlower Committed by TensorFlower Gardener Jun 07, 2018

Improve performance of HloComputation::MakeInstructionPostOrder

Previously it used the same infrastructure as HloInstruction::Accept
what caused a high overhead for large models due to the excess amount of
work it have to do to support modifying the graph under iteration and due
to the lack of caching on graphs with multiple sinks.

The new code is a very simple implementation of an iterative DFS based
topological sort.

PiperOrigin-RevId: 199606688

parent c70b7128

Show whitespace changes

Inline Side-by-side

Please to comment