How convert JavaRDD to JavaRDD?

Question

JavaRDD> documents = StopWordsRemover.Execute(lemmatizedTwits).toJavaRDD().map(new Function>() {
    @Override
    public List call(Row row) throws Exception {
        List document = new LinkedList();
        for(int i = 0; i



I try make it with use this code, but I get WrappedArray

[[WrappedArray(happy, holiday, beth, hope, wonderful, christmas, wish, best)], [WrappedArray(light, shin, meeeeeeeee, like, diamond)]]


How make it correctly?

zero323 · Accepted Answer

You can use getList method:

Dataset lemmas = StopWordsRemover.Execute(lemmatizedTwits).select("lemmas");
JavaRDD> documents = lemmas.toJavaRDD().map(row -> row.getList(0));

where lemmas is the name of the column with lemmatized text. If there is only one column (it looks like this is the case) you can skip select. If you know the index of the column you can skip select as well and pass index to getList but it is error prone.

Your current code iterates over the Row not the field you're trying to extract.

How convert JavaRDD<Row> to JavaRDD<List<String>>?

Answers (2)

Related Questions

How convert JavaRDD&lt;Row&gt; to JavaRDD&lt;List&lt;String&gt;&gt;?

Answers (2)

Related Questions

How convert JavaRDD<Row> to JavaRDD<List<String>>?