Reputation: 1
I don't seem to understand how clusters work in the parameterServerStrategy in TensorFlow, and I need some clarifications.
I have read this tutorial, but they don't mention or explain clearly how to run parameterServerStrategy using multiple machines. I have a working version, but it is on a single machine, and the workers and the ps don't seem to do anything, it the chief that runs everything. I have tried to implement it on multiple machine where I used their global Ip:s and unused ports for the workers and ps, but the chief does not seem to find them.
Upvotes: 0
Views: 23