Reputation: 303
I have these input file
id, feature1, feature2, ...
0, 0, 1, 1, 0, 0, 0, ...
1, 0, 0, 1, 0, 1, 0, ...
2, 1, 0, 0, 0, 0, 0, ...
3, 0, 0, 0, 0, 1, 0, ...
and I want get its distance matrix calculate by hadoop or mahout using mapreduce job. but mahout has no method of calculate distance matrix. What do I do?
Thank you for your help.
Upvotes: 0
Views: 367
Reputation: 519
You can calculate the distance between each records yourself using Mahout. Use distance method of DistanceMeasure class, but you have to convert the input file into SequenceFile first.
Upvotes: 1