tensorflow_hub : module spec export with checkpoint path doesn't save all variables

Question

I want to train GANs with tensorflow and then export the generator and the discriminator as tensorflow_hub modules.
For that:
- I define my GAN architecture with tensorflow
- train it and save checkpoints
- create a module_spec with different tags like:
(set(), {'batch_size': 8, 'model': 'gen'})
({'bs8', 'gen'}, {'batch_size': 8, 'model': 'gen'})
({'bs8', 'disc'}, {'batch_size': 8, 'model': 'disc'})
- export with module_spec at tf_hub_path using a checkpoint_path that I saved during training

Then, I can load my generator with the command :

hub.Module(tf_hub_path, tags={"gen", "bs8"})

But, when I try to load the discriminator using a similar command :

hub.Module(tf_hub_path, tags={"disc", "bs8"})

I got the error:

ValueError: Tensor discriminator/linear/bias is not found in b'/tf_hub/variables/variables' checkpoint {'generator/fc_noise/kernel': [2, 48], 'generator/fc_noise/bias': [48]}

So, I concluded that the variables present in the discriminator weren't saved in the module on disk. I checked the different sources of error that I imagined:

That the module spec was correctly defined. For that, I decided to train my model, create the module spec and directly load a module from that module_spec. This worked fine for the generator and for the discriminator. Then, I assumed that my module_spec was correct

Then, I was wondering if the checkpoint were correctly saving all the variables in my graph.

checkpoint_path = tf.train.latest_checkpoint(self.model_dir)
inspect_list = tf.train.list_variables(checkpoint_path)
print(inspect_list)

[('disc_step_1/beta1_power', []),
('disc_step_1/beta2_power', []),
('discriminator/linear/bias', [1]),
('discriminator/linear/bias/d_opt', [1]),
('discriminator/linear/bias/d_opt_1', [1]),
('discriminator/linear/kernel', [3, 1]),
('discriminator/linear/kernel/d_opt', [3, 1]),
('discriminator/linear/kernel/d_opt_1', [3, 1]),
('gen_step/beta1_power', []),
('gen_step/beta2_power', []),
('generator/fc_noise/bias', [48]),
('generator/fc_noise/bias/g_opt', [48]),
('generator/fc_noise/bias/g_opt_1', [48]),
('generator/fc_noise/kernel', [2, 48]),
('generator/fc_noise/kernel/g_opt', [2, 48]),
('generator/fc_noise/kernel/g_opt_1', [2, 48]),
('global_step', []),
('global_step_disc', [])]

Thus, I saw that all the variables were correctly saved inside the checkpoints. Only the two variables related to the generator were correctly exported in the tf hub module on disk.

Finally, I suppose that my error comes from the :

module_spec.export(tf_hub_path, checkpoint_path=checkpoint_path)

Only the tag "gen" is taken into account to export the variables from checkpoint_path. I also checked that the name of the variables were corresponding between the module.variable_map and the list variables from checkpoint path. Here is the variable map for the module with tag "disc":

print(module.variable_map)
{'discriminator/linear/bias': , 'discriminator/linear/kernel': }

I have

tensorflow : 1.13.1
tensorflow_hub : 0.4.0
python : 3.5.2

Thanks for your help

tensorflow_hub : module spec export with checkpoint path doesn't save all variables

Answers (1)

Related Questions

tensorflow_hub : module spec export with checkpoint path doesn&#39;t save all variables

Answers (1)

Related Questions

tensorflow_hub : module spec export with checkpoint path doesn't save all variables