Select all rows where two columns have the same value?

Question

I'm working with SQL and was wondering how I would get all of the rows where values in 2 columns are equal. For example, imagine this table:

+----+---------+
| ID | Version |
+----+---------+
| AB |       1 |
| AB |       1 |
| BA |       2 |
| BA |       2 |
| CB |       1 |
+----+---------+

I want to select all rows where the IDs and versions match other rows with the same values in their ID and Version columns. In other words, I want to find duplicate values. So the desired output would be:

+----+---------+
| ID | Version |
+----+---------+
| AB |       1 |
| AB |       1 |
| BA |       2 |
| BA |       2 |
+----+---------+

How would I go about doing this as efficiently as possible in a table with over a million rows?

Gordon Linoff · Accepted Answer

The simplest method are probably window functions:

select t.*
from (select t.*,
             count(*) over (partition by id, version) as cnt
      from t
     ) t
where cnt >= 2;

If you have an index on (id, version) (or (version, id)), then the database engine should be able to take advantage of that.

Select all rows where two columns have the same value?

Answers (2)

Related Questions