Reputation: 105
Can you tell me which measure the parameter d is in the DBOutlierDetection algorithm (or DBOutlierScore)? Centimetres? Milimetres?
I have to somehow compare the area under parameter d with LOF's k.
Upvotes: 2
Views: 60
Reputation: 8715
It depends on your distance measure that you set with -algorithm.distancefunction
.
The parameter is a distance; the sematnic meaning of the distance depends on your data and distance function.
For example, if your data are latitude, longitude pairs
Similar, if you are using Euclidean distance, and your
You can normalize or standardize the data. For example if you normalize by mean and standard deviation, the unit of the measure disappears. Using Euclidean distance on such data then has the unit of "standard deviations". But that unit also does not make much sense anymore on a multimodal distribution, as it is common in outlier detection and clustering.
Upvotes: 2