In SQL Server, how to filter lots of elements across multiple columns

Question

I have a table, t1, with columns such as name, code1, code2,..., code20

There are, say, 100K rows.

I have another look up table, t2, which has one column, code; it has 10k rows and each row has a code. So, totally there are 10K codes in this 1-column table.

I need to filter out all the rows in t1 that have the codes in t2 from any column, i.e. columns code1 to code20. In other words, in each row in t1, once a column has one of the codes in t2, it should be captured.

Is there an easy way to do this? Thanks a lot!

Gordon Linoff · Accepted Answer

Here is a way to do it using not exists:

select t1.*
from t1
where not exists (select 1
                  from t2
                  where t2.code = t1.code1 or
                        t2.code = t1.code2 or
                        . . .
                        t2.code = t1.code20
                 );

It is tempting to use in as the condition in the nested select, but this behaves in a funky way with NULLs. The sequence of direct comparisons is easier.

That said, having 20 columns with the same type of data is usually a sign of poor table design. More typically, the data would be in some sort of association/junction table, with the 20 columns each appearing in their own row.

In SQL Server, how to filter lots of elements across multiple columns

Answers (2)

Related Questions