What does FSelector information.gain measure?

Question

I am trying to understand how to properly use the R package FSelector, and in particular, its information.gain function. According to the documentation:

information gain = H(class) + H(attribute) - H(class,attribute)

What do these quantities mean? And how do they relate to the standard definition of Information Gain. As far as I know, Information Gain due to an attribute = H(S) - sum p(S_i)H(S_i) where H(.) is entropy; S is the unpartitioned set; S_i are the subsets of S induced by the attribute; and p(S_i) = |S_i|/|S|.

I would also like to know if there are any other packages that use the concept of Information Gain.

Thank you for your help.

What does FSelector information.gain measure?

Answers (1)

Related Questions