newnewbie
newnewbie

Reputation: 1003

easy thing about clustering in R

According to the results I am getting ( I do not see that in the API), hclust works by using each row of a given matrix as a vector. Is there any way to work it so that it works with columns instead?

Besides, does dist work the same or does dist work with columns?

Upvotes: 0

Views: 92

Answers (1)

zero323
zero323

Reputation: 330063

You can always apply hclust to transposed matrix:

# If you have observations matrix 
m <- matrix(1:100, nrow=20)
hc <- hclust(dist(t(m)))

Besides, does dist work the same or does dist work with columns?

General convention is variables in columns, observations in rows and that's how dist works:

dist                   package:stats                   R Documentation

Distance Matrix Computation

Description:
This function computes and returns the distance matrix computed by
using the specified distance measure to compute the distances
between the rows of a data matrix.

Update

hclust works by using each row of a given matrix as a vector.

Actually internal implementation of hclust shouldn't matter. You pass as an argument dissimilarity structure produced by dist, and I am almost sure, that all metrics implemented in dist produce proper symmetrical distance matrix.

Upvotes: 3

Related Questions