Data Management

A New Local Distance-Based Outlier Detection Approach for Scattered Real-World Data

Date Added: Mar 2010
Format: PDF

Detecting outliers which are grossly different from or inconsistent with the remaining dataset is a major challenge in real-world KDD applications. Existing outlier detection methods are ineffective on scattered real-world datasets due to implicit data patterns and parameter setting issues. The authors define a novel Local Distance-based Outlier Factor (LDOF) to measure the outlierness of objects in scattered datasets which addresses these issues. LDOF uses the relative location of an object to its neighbours to determine the degree to which the object deviates from its neighbourhood.