Compare large MySQL tables

Question

I need to make a comparison between two (or more) tables with around 60.000 rows and about 60 columns.

In these tables there are two values on which I want to run a query. The purpose of the query is to count the rows which exists in TABLE_A but don't exist in TABLE_B based on two values in the row.

I've ran the following query:

SELECT id
FROM table_a ta
WHERE NOT EXISTS (
  SELECT id
  FROM table_b tb
  WHERE ta.value1=tb.value1 AND ta.value2=tb.value2
)

As said, I've tried the code above and some variations on it. But to run this query it takes ages before it's finished. I hope to find a solution which runs in under 10 seconds.

Next query I tried, and of which I thought was working:

SELECT value1, value2
FROM (
    SELECT ta.value1, ta.value2
    FROM table_a ta
    UNION ALL
    SELECT tb.value1, tb.value2
    FROM table_b tb
) result
GROUP BY value1, value2
HAVING COUNT(*) = 1
ORDER BY value1

The code shows me all differences between the two tables. So if valueX exists in TABLE_A but not in TABLE_B it's shown and vice versa.

So in short, I want to get all rows from TABLE_A which are not present in TABLE_B based on two values in the row.

Hope someone can help, thanks!

Sjoerd · Accepted Answer

After some trial and error I have improved the second block of code. I noticed an additional field in my table which I could use to further filter the results.

SELECT date, value1, value2
FROM (
    SELECT date, value1, value2
    FROM (
        SELECT ta.date, ta.value1, ta.value2
        FROM table_1 ta
        UNION ALL
        SELECT tb.date, tb.value1, tb.value2
        FROM table_2 tb
    ) filter
    GROUP BY value1, value2
    HAVING COUNT(*) = 1
) result
WHERE date='YYYY-MM-DD'

This code filters the results in under 4 seconds.

Anyway, thanks everyone for the trouble.

Compare large MySQL tables

Answers (2)

Related Questions