Why Does LIBSVM Grid Search Slow Down?

Question

LIBSVM, Java, grid search, performance, slows down

I have been working with the Java version of LIBSVM. In pseudocode, I do a naive grid search for the optimal C and gamma pairs, take these training model files, and then perform cross-validation against 10 k-fold data sets seeking the best parameters.

I noted what seemed to be anecdotal slow downs as svm_predict is repeatedly called during the grid search. At first I thought this was simply a fluke but I have been carefully testing an the testing indicates that the processing time for svm_predict increases exponentially according to the number of times called.

The first time called, svm_predict takes ~15 milliseconds to perform the predictions on my machine. By the 500th sequential call, svm_predict takes ~541 milliseconds. By the 1000th sequential call, svm_predict is showing about ~8931 milliseconds. By the 1220th call, svm_predict is at about ~21260 milliseconds per call.

(NOTE: the increases in time do not appear related to the C-gamma pairs themselves. There is a consistent increase in time to process even if the pairs are randomized (that is, the model itself is not increasing in complexity).

I have run the software in a profiler and see no obvious memory leaks or any memory issues at all--both heap and stack traces remain fairly stable or show oscillations well within the allocated memory limits. Even testing to suggest garbage collection does not affect performance at all.

My software "wraps" the LIBSVM internally. The grid search merely runs through a range of C-gamma pairs on at a time calling svm_predict on each training file to measure performance.

Has anyone else seen this issue? Is there a fix? Gris search is very intensive anyway but with times quickly going to 21 seconds per prediction, doing even a fairly basic search (400 C-gamma pairs) becomes very time consuming even on high end equipment. Any advice?

NEW INFO (10 Oct 2014: I continue to test and tentatively confirm that the issue seems to be slow downs with LIBSVM with repeated calls to svm_predict during a grid search

I also have a test harness to manually test svm_predictions based on previously generated MODEL and DATA files. That is, I can manually test each model-data file prediction. The elapsed time to predict after 648 iterations using grid search is 1183 milliseconds per file. Precisely the same model-data file pair manually running a single instance of svm_predict results in 34 milliseconds. This confirms my concerns about svm_predict. Has anyone else seen this or does anyone have a workable, suggested remedy?

Why Does LIBSVM Grid Search Slow Down?

Answers (1)

Related Questions