This CL avoids superfluous construction of backwards graphs for nested...
This CL avoids superfluous construction of backwards graphs for nested function calls. It delays construction of backwards graph to when a tape is actually used to compute a gradient. PiperOrigin-RevId: 222916814
Loading
Please sign in to comment