Many-to-many join table, bi-directional relationships and data integrity

Question

I have an old legacy table like this:

 =====================
| product1 | product2 |
 =====================
|        1 |        2 |
|        2 |        1 |
|        3 |        4 |
|        4 |        5 |
|        5 |        3 |
 =====================

It's a poorly designed table, I know, but it has to stay. The purpose of the table is to store relationships between products. The rule is that each product should have a reference to each similar product and vice-versa. Assuming that the example data is the full dataset, the first two rows are correct - 1 has a relationship with 2 and 2 has a relationship with 1. However, the last three rows are a mess. The correct way of representing those products' relationships would be:

 =====================
| product1 | product2 |
 =====================
|        3 |        4 |
|        3 |        5 |
|        4 |        3 |
|        4 |        5 |
|        5 |        3 |
|        5 |        4 |
 =====================

What I need is some kind of algorithm (or tool) so that I can identify and fix the incorrect data.

Many-to-many join table, bi-directional relationships and data integrity

Answers (1)

Related Questions