How does clusters work in TensorFlow in the parameterServerStrategy?

I don't seem to understand how clusters work in the parameterServerStrategy in TensorFlow, and I need some clarifications.

I have read this tutorial, but they don't mention or explain clearly how to run parameterServerStrategy using multiple machines. I have a working version, but it is on a single machine, and the workers and the ps don't seem to do anything, it the chief that runs everything. I have tried to implement it on multiple machine where I used their global Ip:s and unused ports for the workers and ps, but the chief does not seem to find them.

Upvotes: 0

Views: 23

Answers (0)

Related Questions