Reputation: 55
I am trying to join 5 flows in which the first one is the driver and the others are being left outer joined to driver. I have used Ab Initio in the past where we could use a single join component and specify the kind of join for each input flow. I couldn't find any such Step in the Pentaho and hence I have to rely on the Merge Join which left outer joins only two tables at a time and then take its result to join with the next and so on and so forth. I am planning to do all that in a single transformation. What I am worried about it since Pentaho runs all the Steps in parallel, it might start to run a join which is much later in the flow without waiting for an earlier join to complete. Is this a valid concern? if so how do you tackle it in a single transformation?
Upvotes: 0
Views: 1408
Reputation: 2440
That’s correct you can only join 2 steps at a time. Answer to your second point, no it will execute parallel but your second join will wait for your first joie to finish. So you will get a proper result only.
Upvotes: 1