Reputation: 25058
Do you know an application or algorithm to reduce dimensionality of big data, maybe using Map-Reduce
, or other api
, also:
Do you know some algorithms like
Singular Value decomposition
than
can be useful to reduce dimention of
data sets
how to use distributed computing to solve this???
Upvotes: 0
Views: 682
Reputation: 3443
Besides Mahout, you should take a look at SLEPc (which is a toolkit based on PETSc) for solving eigenvalue problems for very large sparse matrices. It uses MPI, so it will run on lots of different parallel and distributed architectures. There's also Gensim, written in Python. It's probably not as scalable as either Mahout or SLEPc but it's much easier to use.
Upvotes: 1