Reputation: 5297
I am trying to figure out how to do this but Google does not seem to find me an answer.
I have a nice dataset that I am able to generate a pruned and unpruned decision trees in Weka. From this I can get the 10-fold cross-validation information which is nice.
But I would like to run statistical tests between the two decision trees, i.e. T-Test or Wilcoxon, using R. I have been suggested to use the DMwR and RWeka package but as I have no prior experience with this language, reading the RWeka docs and googling for tutorials or other explanations, I am coming up empty handed.
Upvotes: 1
Views: 313
Reputation: 6087
As far as I know, you can run a t-test using WEKA's Experimenter. Where you select the dataset and the algorithms (they could be the same algorithm with different parameters) and then perform a t-test.
About Wilcoxon test, what I usually do is to "save" each model generated by WEKA (they would be Java Objects) and I read these objects in my Java program (maybe you can do it in R) and I perform the test in a Java program.
Upvotes: 1