Splitting result of Sparklyr as a spark object

Question

I have a problem with splitting the outcome of my random forest generated by Sparklyr.

I'm using the following code to generate a model, which predict a {0 | 1} value and predict the outcome for a specified validation set.

model <- ml_random_forest( tbl(sc,"train_set") , formulea)

prediction <- sdf_predict( model, tbl(sc,"validation_set") ) %>% select(account_no, probability , prediction)

This generated prediction object looks like:

Source:   query [3.744e+06 x 3]
Database: spark connection master=yarn-client app=Dev - model v.11 local=FALSE

   account_no probability prediction
                    
1     5053177             1
2     6508441             1
3     7805527             1
4    10001696             1
5    10004230             1
6    10005647             1
7    10006029             1
8    10018558             0
9    10019161             1
10   10031652             1
# ... with 3.744e+06 more rows

How can i split the list in Spark, to get only the first number of the list. Something like this ...

   account_no probability 
              
1     5053177   <0.9726>          
2     6508441   <0.1234>

Hope someone can help to solve this issue.

Greetings, Jitske

Splitting <dbl [2]> result of Sparklyr as a spark object

Answers (1)

Related Questions

Splitting &lt;dbl [2]&gt; result of Sparklyr as a spark object

Answers (1)

Related Questions

Splitting <dbl [2]> result of Sparklyr as a spark object