Commit a1801ecd authored by Russell Power's avatar Russell Power Committed by TensorFlower Gardener
Browse files

Add experimental asynchronous checkpoint hook.

This triggers checkpoints in a separate thread while allowing training to
continue.  This can effectively parallelize checkpointing and training for
workloads like TPUEstimator, where the weights are only updated after a number
of device iterations.

PiperOrigin-RevId: 214670991
parent 2116c664
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please to comment