What's the right strategy to implement a recursive/iterative routine in dataflow?

Question

I need to solve something similar to the equation Ax=b, in which A is an extremely large (but sparse) matrix. (It's 97% sparse; but its size is approximately 1,000,000,000 x 33,000,000)

Without getting bogged down into unnecessary details, the solution is effectively an iterative solution in which one “guesses X”, and then one plugs back the guessed “X” Back into the equation, evaluates it, and then re-determines the “next version of X”. The matrix A and the vector “b” don't change in these iterations. (There may be anywhere from 1,000 to 100,000 iterations).

My concern is how to write this iterative logic in such a manner that we don't keep copying the matrix A ( and even the vector B) in every iteration. (In theory, even vector “x” should not need to be re-instantiated in every iteration, but that cost is pretty minor.) If this were a single threaded application, it's quite easy to implement the logic. But I am unsure how to do this efficiently in the dataflow architecture. ( I instinctively feel that perhaps “side inputs” may be helpful here; or, instead of passing portions of the matrix A back-and-forth and recopy the data, maybe one just passes a pointer to the object?)

What's the right strategy to implement a recursive/iterative routine in dataflow?

Answers (1)

Related Questions

What&#39;s the right strategy to implement a recursive/iterative routine in dataflow?

Answers (1)

Related Questions

What's the right strategy to implement a recursive/iterative routine in dataflow?