Argument free clustering via boundary extraction for massive point-data Sets
Journal Publication ResearchOnline@JCUAbstract
Minimizing the need for user-specified arguments results in less costly Geographical Data Mining. For massive data sets, the need to find best-fit arguments in semi-automatic clustering is not the only concern, the manipulation of data to find arguments opposes the philosophy of ‘‘let the data speak for themselves’’ that underpins exploratory data analysis. Our new approach consists of effective and efficient methods for discovering cluster boundaries in point-data sets. Parameters are not specified by users. Rather, values for parameters are revealed from the proximity structures of Voronoi modeling, and thus, an algorithm, AUTOCLUST, calculates them from the Delunay Diagram. We detect clusters of different densities and sparse clusters near to high-density clusters. Multiple bridges linking clusters are identified and removed. All this within O(n log n) time, where n is the number of data points. We contrast AUTOCLUST with algorithms for clustering large georeferenced sets of points. These comparisons confirm the virtues of our approach.
Journal
N/A
Publication Name
N/A
Volume
26
ISBN/ISSN
1873-7587
Edition
N/A
Issue
4
Pages Count
N/A
Location
N/A
Publisher
N/A
Publisher Url
N/A
Publisher Location
N/A
Publish Date
N/A
Url
N/A
Date
N/A
EISSN
N/A
DOI
10.1016/S0198-9715(01)00044-8