Weka commmandline and strings

Question

I'd like to do some text classification (Naive Bayes) with Weka using the simple cli (command line), but I have one problem. Weka can't handle strings, they have to be converted. But how can I convert the strings in my arff file through cli?

sentences.arff example

@relation data set

@attribute text string
@attribute class {swedish,'?',english}

@data
'detta är en svensk text',swedish
'this is an english text',english
'what is the name of this book?',english
'vilken färg är en liten stuga?',swedish
'you are the best',english
'en enstaka fjäder i hatten fördröjer livet ett tag',swedish
'detta är en annan svensk text',swedish

I'm using the following command to create a model

java weka.classifiers.bayes.NaiveBayes -t data.arff -d data.model

Weka commmandline and strings

Answers (1)

Related Questions