How to retain one row and remove duplicates in mysql?

Question

I have a mysql table with each row having like 20 fields. Among others, it has:

table: origin, destination, date, price

Now I want to remove any rows that are duplicate regarding only one set of specific fields: origin, destination, date.

I tried:

delete from mytable where id not in 
    (select id from (
            SELECT MAX(p.id) as id from mytable p group by p.origin, p.destination, p.date
    ) x)

Problem: this retains the rows with the highest id (means: last added).

Instead I'd like to retain only the row that has the lowest price. But how?

Sidenote: I cannot add an unique index, as the table is used for mass inserts by LOAD DATA and should there not throw errors. At time of load I don't know which row is the "bestprice" one.

Also I would not want to introduce any additional or temp tables copying one to another. Just modify the existing table.

Blank · Accepted Answer

Self-join solution:

delete t1
from yourtable t1
join yourtable t2
on t1.origin = t2.origin
and t1.destination = t2.destination
and t1.date = t2.date
and t1.price > t2.price

How to retain one row and remove duplicates in mysql?

Answers (2)

Related Questions