SSIS - Reuse Ole DB source when matching Fact against lookup table twice

Question

I am pretty new to SSIS and BI in general, so first of all sorry if this is a newbie question.

I have my source data for the fact table in a csv, so I want to match the ids against the surrogate keys in lookup tables.

The data structure in the csv is like this

... userId, OriginStationId, DestinyStationId,..

What I am trying to accomplish is to match the data against my lookup table. So what I am doing is

Reading Lookup data using OLE DB Source
Reading my csv file
Sorting both inputs by the same field
Doing a left join by Id, in order to get the SK

This way, if there is no match (aka can't find the surrogate key) I can redirect that to a rejected csv and handle it later.

something like this:

(sorry for the spanish!)

I am doing this for each dimension, so I can handle each one with different error codes.

Since OriginStationId and DestinyStationId are two values from the same dimension (they both match against the same lookup table), I wanted to know if there's a way to avoid reading two times the data from the table (I mean, not to use two ole db sources to read twice the data from the same table).

I tried adding a second output to the sort but I am not allowed to. The same goes to adding another output from OLE DB Source.

I see there's an "cache option", is the best way to go ? (Although it would impy creating anyway another OLE DB source.. right?)

The third option I thought of was joining by the two fields, but since there is only one field in the lookup table (the same field) I am getting an error when I try to map both colums from my csv against the same column in my Lookup table

There are columns missing with the sort order 2 to 2

What is the best way to go for this ? Or I am thinking something incorrectly ? If something was not clear let me know and I'll update my question

Tab Alleman · Accepted Answer

Any time you wish you could have multiple outputs from a component that only allows one, all you have to do is follow that component with the Multicast component, whose sole purpose is to split a Data Flow stream into multiple outputs.

SSIS - Reuse Ole DB source when matching Fact against lookup table twice

Answers (2)

Related Questions