Optimize NOT IN subquery

Question

I'm using a MySQL database and trying to optimize the following query:

SELECT * FROM main 
WHERE (name IS NOT NULL AND name != '') 
AND user_id NOT IN
( SELECT user_id FROM process
WHERE sns_id = 1  AND (process.status=1 OR process.status=2) 
AND process.updated_at > 1392335789207) LIMIT 100;

Essentially I want to make sure all those user_id in the process table are not included when selecting rows from main.

I've tried optimizing using EXISTS but that doesn't seem to produce any better performance.

How could I optimize this query?

Justin · Accepted Answer

You can use Query:

SELECT *
FROM main
WHERE name IS NOT NULL
 AND name != ''
 AND user_id NOT EXISTS
    (SELECT user_id
     FROM process
     WHERE sns_id = 1
       AND process.status in (1,2)
       AND process.updated_at > 1392335789207
       AND process.user_id = main.user_id) 
LIMIT 100

You saying that there no duplicate user id, so you can try use this query which will be probably the fastest solution:

SELECT m.*
FROM main m
LEFT JOIN process p
 ON p.user_id = m.user_id
 AND p.sns_id = 1
 AND p.status in (1,2)
 AND p.updated_at > 1392335789207
WHERE m.name IS NOT NULL
 AND m.name != ''
 AND p.user_id is null
LIMIT 100

Optimize NOT IN subquery

Answers (2)

Related Questions