SSIS - How to improve a work flow

Question

Previously I have asked for a possible solution for a situation that I had to face in order to implement a sql query (which is implementing originally in access). I have reach a solution (after asking a lot) but I would like to know if anyone has another way to execute this query.

I have got two different tables, one in sql and another in oracle (S and O)

O(A, B, C) => PK=(A,B) and S(D,E,F) => PK = (D,E)

The query looks like this

SELECT A,B,C,E,F
FROM S INNER JOIN O ON
S.D = O.A      (Only one attribute of the PK in O)

S has over 10.000 registers and O more than 700 millions. Given this, is not logic to implement a merge join, or a look up because I will have only the first match between D and A. So I thought that it will be better to assemble the query in the Oracle side. To do this I have implemented an scheme like this.

enter image description here

With the sql I have executed this query:

with tmp(A) as ( select distinct D as A from S
)
select cast( select concat(' or A = ', A) 
             from tmp 
             for xml path('')) as nvarchar(max)) as ID

I am getting a string with the values that I gonna search on oracle. Finally in the data flow, I am creating an expression like this:

select A, B, C
from O
where A= '' + @ID

I downnload this values to sql server and then I am able to manipulate them as I wish.

The use of the foreach loop was necessary because I am storing the string of sql inside an object variable. I found that SSIS has some troubles with the nvarchar(max) variables.

Some considerations:

1) The Oracle database is administered for another area of the company and they only gives reading permissions over the tables.

2) The DBA of the sql server does not allow to download the O table on a staging area. Not possibilities of negotiations with him, besides, this tabla is updated every day with more registers. He only manages this server and does not have any authority over Oracle.

3) The solution that was given for some members of my team was to create a query in oracle between different tables that can give me the attributes of O that I need, as a result I could get more than 3 millions of register and not all of the attributes A are presented in S. Even more, some the values of D has been manipulated, so possibly they are not going to be present in O.

With this implementation I am getting more than 150.000 registers from Oracle. But I would like to know if another solution can be implemented or if there are other components that I can use to reach the same results. Believe me when I say that I have read, asked and searched a lot before to implement this flow.

SSIS - How to improve a work flow

Answers (1)

Related Questions