Selection of initial medoids in PAM algorith

Question

I have read a couple of different articles on how PAM selects the initial medoids but I am getting conflicting views.

Some propose that the k first medoids are selected randomly, while others suggest that the algorithm selects initially the k representative medoids in the dataset (not clarifying how that "representativeness" happens though). Below I have listed these resources:

Medoid calculation

Drawbacks of K-Medoid (PAM) Algorithm

https://paginas.fe.up.pt/~ec/files_1112/week_06_Clustering_part_II.pdf

https://www.datanovia.com/en/lessons/k-medoids-in-r-algorithm-and-practical-examples/

1) My question would be if someone could explain in more detail how the algorithm selects the initial k medoids as from what I understand different initial selection can lead to different results.

2) Also is that one of the reasons of using CLARA (apart from minimizing computing time and RAM storage problem) - that is to find medoids through resampling that are the "optimal" options?

I am using R as a parenthesis, with the function pam(). Open to other functions in other libraries if there is a better alternative I am not aware of.

Selection of initial medoids in PAM algorith

Answers (1)

Related Questions