Reputation: 3
I want to use Weka to find outliers by locating the attribute values more than 3 standard deviations from the mean. I'm sure there's a way to do it.
I already know how to search for outliers using the InterquartileRange filter--that just isn't what I want to do in this case. And Weka gives me the the mean and SD, so figuring out my thresholds is easy enough. I just don't know what filter to use, and searching hasn't worked (I get answers to much more advanced questions).
Upvotes: 0
Views: 249
Reputation: 2608
If you compute the thresholds yourself, you could use the SubsetByExpression filter to discard instances that fall outside these values (for a particular attribute).
Upvotes: 0