Reputation: 1390
I am trying to separate a data set that has 2 clusters that do not overlap in anyway and a single data point that is away from these two clusters.
When I use kmeans() to get the 2 clusters, it splits one of the "valid" cluster into half and considers the single data point as a separate cluster.
Is there a way to specify minimum number of points for this? I am using MATLAB.
Upvotes: 0
Views: 832
Reputation: 306
There are several solutions:
More explanations on the inadequate behaviour of k-means can be found on Cross Validated site (see here for instance).
Upvotes: 1