python compute distance matrix from dictionary data

Question

I want to compute a distance matrix from a dictionary data like the following:

y = {"a": ndarray1, "b": ndarry2, "c": ndarry3}

The value of each key ("a", "b", "c") is a np.ndarry with different size. And I have a dist() function that can compute the distance between y["a"] and y["b"] through dist(y["a"], y["b"]).

So that the resulting distance matrix would be:

+----------------------------------------------------------------+
|                a        b                        c             |
+----------------------------------------------------------------+
| a  | 0        mydist(ndarrya1, ndarray)  mydist(ndarray1, ndarray3) |
| b  |          0                        mydist(ndarray2, ndarray3) |
| c  |                                   0                        |
+----------------------------------------------------------------+

I have tried scipy.spatial.distance.pdist with pdist(y, mydist), but got an error saying that:

[X] = _copy_arrays_if_base_present([_convert_to_double(X)])
  File "/usr/local/lib/python2.7/dist-packages/scipy/spatial/distance.py", line 113, in _convert_to_double
X = X.astype(np.double)
TypeError: float() argument must be a string or a number

Can anyone tell me how to implement this pdist by myself? I want to use the pdist result for further hierarchical clustering.

python compute distance matrix from dictionary data

Answers (1)

Related Questions