SSIS: flagging ALL the Data Quality issues in each row with Conditional Split

Question

I have been tasked with performing Data Quality checks on data from a SQL table, whereby I export problem rows into a separate SQL table.

So far I've used a main Conditional Split that goes into derived columns: 1 per conditional split condition. It is working whereby it checks for errors, and depending on which condition is failed first, the data is output with a DQ_TYPE column populated with a certain code (e.g. DQ_001 if it had an error with the Hours condition, DQ_002 if it hit an error with the Consultant Code condition, and so on).

The problem is that I need to be able to see all of the errors within each row. For example at the moment, if Patient 101 has a row in the SQL table that has errors in all 5 columns, it'll fail the first condition in Conditional Split and 1 row will get output into my results with the code DQ_001. I would instead need it to be output 5 times, once for each error that it encountered, i.e. 1 row with DQ_001, a 2nd row with DQ_002, a 3rd row with DQ_003 and so on.

The goal is that I will use the DataQualityErrors SQL table to create an SSRS report that groups on DQ_TYPE and we can therefore Pie Chart to show the distribution of which error DQ_00X codes are most prevalent.

Is this possible using straightforward toolbox functions? Or is this only available with complex Script tasks, etc.?

SSIS: flagging ALL the Data Quality issues in each row with Conditional Split

Answers (1)

Related Questions