Reputation: 2365
I am trying to understand the implementation of the sklearn chi2 for feauture selection algorithm.
I think I understand the chi2 formula.
After getting this value we will see the table for 1 degree of freedom and according to ou need choose the p value.If chi2 value is greater than keep it otherwise ignore it.
My question is how does sklearn package choose this p-value on its own?It just requires X and y array as input.
http://scikit-learn.org/stable/modules/generated/sklearn.feature_selection.chi2.html
and are the chi2 scores always normalized? link to the paper-http://courses.ischool.berkeley.edu/i256/f06/papers/yang97comparative.pdf
Upvotes: 0
Views: 1633
Reputation: 4150
The idea is to perform univariate feature selection:
So going back to your question, I think you misunderstood the following point:
Upvotes: 1