Optimize Decision Tree Parameters in RapidMiner

Question

I am working with a rather large dataset (770K records , 2K attributes , almost all of these attributes are binomial but in integer form) ,

I want to apply decision tree on the data with a 10-fold cross validation, but I've some problems :

1.Why does decision tree (e.g. with depth of 10) takes so much time to be trained ? actually I balance the data (as it's imbalanced) to 40% of the original size (~320K records) before training the tree , but it still takes a lot of time , is there any other version of Decision Tree which result the same performance and takes less time ? (Does making the attributes in binomial form makes it faster ?)

2.How can I optimize parameter of decision tree ? Should I optimize it on the the whole X-validation ?

Optimize Decision Tree Parameters in RapidMiner

Answers (1)

Related Questions