Argument free clustering via boundary extraction for massive point-data Sets

Journal Publication ResearchOnline@JCU
Estivill-Castro, Vladimir;Lee, Ickjai
Abstract

Minimizing the need for user-specified arguments results in less costly Geographical Data Mining. For massive data sets, the need to find best-fit arguments in semi-automatic clustering is not the only concern, the manipulation of data to find arguments opposes the philosophy of ‘‘let the data speak for themselves’’ that underpins exploratory data analysis. Our new approach consists of effective and efficient methods for discovering cluster boundaries in point-data sets. Parameters are not specified by users. Rather, values for parameters are revealed from the proximity structures of Voronoi modeling, and thus, an algorithm, AUTOCLUST, calculates them from the Delunay Diagram. We detect clusters of different densities and sparse clusters near to high-density clusters. Multiple bridges linking clusters are identified and removed. All this within O(n log n) time, where n is the number of data points. We contrast AUTOCLUST with algorithms for clustering large georeferenced sets of points. These comparisons confirm the virtues of our approach.

Journal

N/A

Publication Name

N/A

Volume

26

ISBN/ISSN

1873-7587

Edition

N/A

Issue

4

Pages Count

N/A

Location

N/A

Publisher

N/A

Publisher Url

N/A

Publisher Location

N/A

Publish Date

N/A

Url

N/A

Date

N/A

EISSN

N/A

DOI

10.1016/S0198-9715(01)00044-8