Reputation: 16050
Is there any sense to use Spark (in particular, MLlib) on a single node (besides the goal of learning this technology)? Is there any improvement in speed?
Upvotes: 0
Views: 229
Reputation: 1665
Are you comparing this to using a non-Spark machine learning system?
It really depends what the capabilities are of the other library you might use.
If, for example, you've got all your training data stored in Parquet files, then Spark makes it very easy to read in those files and work with, whether that's on 1 machine or 100.
Upvotes: 1