edgarmtze
edgarmtze

Reputation: 25058

Dimension Reduction with Map reduce, using distributed computing?

Do you know an application or algorithm to reduce dimensionality of big data, maybe using Map-Reduce, or other api, also:

Upvotes: 0

Views: 682

Answers (2)

rmalouf
rmalouf

Reputation: 3443

Besides Mahout, you should take a look at SLEPc (which is a toolkit based on PETSc) for solving eigenvalue problems for very large sparse matrices. It uses MPI, so it will run on lots of different parallel and distributed architectures. There's also Gensim, written in Python. It's probably not as scalable as either Mahout or SLEPc but it's much easier to use.

Upvotes: 1

Niels Basjes
Niels Basjes

Reputation: 10652

Have a look at Mahout because SVD is implemented in there.

Upvotes: 2

Related Questions