Converting CSV values to Vector in Spark Dataframe in Java

Question

I have a CSV file with two columns

id, features

the id column is a string and the features column is a comma delimited list of feature values for a Machine Learning algorithm ie. "[1,4,5]" I basically just need to call Vectors.parse() on the value to get a vector, but I don't want to convert to an RDD first.

I want to get this into a Spark Dataframe where the features column is a org.apache.spark.mllib.linalg.Vector

I am reading this into a dataframe with the databricks csv api and I'm trying to convert the features column to a Vector.

Does anyone know how to do this in Java?

Converting CSV values to Vector in Spark Dataframe in Java

Answers (1)

Related Questions