Reputation: 47
I have two data sets, one for training and one for testing. I am going to predict the values of a column with numerical type in test data set. In order to predict the value of an instance, I have to find the k nearest neighbors of that instance in training data set, and calculate the average of values. (waiting also can be used).
For example:
......a..................b....................10
......a..................b....................12
......c..................d....................16
......a..................b....................?
I need a method of data mining to give me the result = (10+12)/2 = 11 Which method should I use to get such a result? And do you know any good document which explains how to use that method?
Upvotes: 1
Views: 7540
Reputation: 2811
KNN in Weka is implemented as IBk. It is capable of predicting numerical and nominal values.
If you are using the Weka Explorer (GUI) you can find it by looking for the "Choose" button under the Classify tab. Once there navigate the folders:
classifiers -> lazy -> IBk
Once you select IBk, click on the box immediately to the right of the button. This will open up a large number of options. If you then click on the button "More" in the options window, you will see all of the options explained. If you need more of an explanation of the classifier they even list the academic paper that the classifier is based on. You can do this for all of the classifiers to obtain additional information.
Upvotes: 5