Closest cluster distance (No centroid)

Question

I am interested in finding the distribution of nearest neighbor cluster distance in a spatial data set (lon, lat). My cluster criteria is simple, meaning that when two points are next to each other they belong to the same cluster and the minimum number of points in a cluster is one. To do so I am using sklearn.cluster.DBSCAN. After clustering, I want to find the distance to the closest cluster for each cluster and that's where I am having problems. Everything I have found calculates the nearest neighbor distance between the centroids of the clusters, and I want to use the boundaries instead.

Instead finding the blue distance, I want to find the black.

At the moment I am doing so by taking all the points from one cluster, then calculating the distance of every point of this cluster with all the points of the remaining clusters and finally taking the minimum distance. However, as you can imagine this is very inefficient and the calculation takes forever.

Does anyone knows how to properly do this?

Closest cluster distance (No centroid)

Answers (1)

Related Questions