Find how many duplicate connections are in a list of edges [python]

Question

Given a list of edges such as,

edges = [[1, 2], [1, 3], [2, 3], [4, 5], [4, 6], [5, 6], [10, 11], [12, 9], [12, 10]]

I need to find how many duplicate connections are in the list.

In this example: the connections occur in the order

dup = 0

1-2

1-2-3

then [2,3] are already connected so we increment dup by 1

1-2-3, 4-5

1-2-3, 4-5-6

then [5,6] are already connected so again we increment dup by 1

1-2-3, 4-5-6, 10-11

1-2-3, 4-5-6, 9-12, 10-11

1-2-3, 4-5-6, 9-10-11-12

return dup = 2

The last step is where my method messes up , because it counts [12,10] as a duplicate, since my current method is to add the numbers into a dictionary and check if both x and y are in the dictionary then i increment dup by 1

But what I really need to do is check if x and y are already connected, and if they are then increment dup by 1

But I am stuck trying to find a way to do this.

Tagc · Accepted Answer

While researching this problem I came across a package called networkx. Makes this problem really simple, apparently. I love how 90% of programming is just relying on smart people to do all the hard work, because I sure couldn't do it.

import networkx as nx

def find_duplicate_edges(edges):
    graph = nx.Graph()
    for n1, n2 in edges:
        if graph.has_node(n1) and graph.has_node(n2) and nx.has_path(graph, n1, n2):
            yield n1, n2
        else:
            graph.add_edge(n1, n2)

if __name__ == '__main__':
    edges = [[1, 2], [1, 3], [2, 3], [4, 5], [4, 6], [5, 6], [10, 11], [12, 9], [12, 10]]
    for edge in find_duplicate_edges(edges):
        print(edge)

Output

(2, 3)
(5, 6)

Find how many duplicate connections are in a list of edges [python]

Answers (2)

Related Questions