Strange ordering of evaluations of variables and loss

Question

I want to use tf.identity to copy the loss and variables either before or after the optimization step:

Here is the before case:

Copy the current loss and variables (save variables and associated loss)
Run one step of optimization (changes loss and values)
Repeat

Here is the after case:

Run one step of optimization (changes loss and values)
Copy the current loss and variables (save variables and associated loss)
Repeat

By "copy", I mean to create nodes in the computation graph to store the current values of loss and variables with tf.identity.

Somehow, this is what actually happens:

Copy loss
Run one step of optimization (changes loss and values)
Copy variable (this value doesn't correspond to the loss saved in step 1)
Repeat

How to fix this?

I could evaluate the loss again right after step 3. But that means wasting one evaluation of the loss in every cycle.

Test:

Copy loss and variables
Run optimization step and copy loss and variables.

If copying of loss and variables always happen before the optimization step, then the copies made in step 1 and step 2 would be the same.

Otherwise, copies made in step 1 and step 2 can be different.

import numpy as np
import tensorflow as tf

x = tf.get_variable('x', initializer=np.array([1], dtype=np.float64))
loss = x * x

optim = tf.train.AdamOptimizer(1)

## Control Dependencies ##
loss_ident = tf.identity(loss)  # <-- copy loss
x_ident = tf.identity(x)  # <-- copy variable
with tf.control_dependencies([loss_ident, x_ident]):
    train_op = optim.minimize(loss)

## Run ##
init_op = tf.global_variables_initializer()
with tf.Session() as sess:
    sess.run(init_op)
    for i in range(1000):
        # step 1
        a_, x1_ = sess.run([loss, x_ident])
        # step 2
        b_, x2_ = sess.run([loss_ident, x_ident, train_op])[:-1]
        print("loss:", a_, b_)
        assert np.allclose(a_, b_)
        print("variables:", x1_, x2_)
        assert np.allclose(x1_, x2_)

Result:

           step 1    step 2
loss:      [1.]      [1.]
variables: [1.]      [1.58114875e-07]  # <-- not the same
AssertionError

Unfortunately, the copies of the variable in step 1 and step 2 are different. Therefore, copying of the variable does not always happen before the optimization

Strange ordering of evaluations of variables and loss

Answers (1)

Related Questions