Automatic aspect discrimination in data clustering
Horta, Danilo, and Campello, Ricardo J.G.B. (2012) Automatic aspect discrimination in data clustering. Pattern Recognition, 45 (12). pp. 4370-4388.
PDF (Published Version)
- Published Version
Restricted to Repository staff only |
Abstract
The attributes describing a data set may often be arranged in meaningful subsets, each of which corresponds to a different aspect of the data. An unsupervised algorithm (SCAD) that simultaneously performs fuzzy clustering and aspects weighting was proposed in the literature. However, SCAD may fail and halt given certain conditions. To fix this problem, its steps are modified and then reordered to reduce the number of parameters required to be set by the user. In this paper we prove that each step of the resulting algorithm, named ASCAD, globally minimizes its cost-function with respect to the argument being optimized. The asymptotic analysis of ASCAD leads to a time complexity which is the same as that of fuzzy c-means. A hard version of the algorithm and a novel validity criterion that considers aspect weights in order to estimate the number of clusters are also described. The proposed method is assessed over several artificial and real data sets.
Item ID: | 47652 |
---|---|
Item Type: | Article (Research - C1) |
ISSN: | 1873-5142 |
Keywords: | aspect discrimination; attribute weighting; cluster validation; clustering |
Funders: | National Council for Scientific and Technological Development (CNPq), São Paulo Research Foundation (FAPESP) |
Date Deposited: | 10 Mar 2017 01:07 |
FoR Codes: | 01 MATHEMATICAL SCIENCES > 0104 Statistics > 010401 Applied Statistics @ 100% |
SEO Codes: | 97 EXPANDING KNOWLEDGE > 970101 Expanding Knowledge in the Mathematical Sciences @ 100% |
Downloads: |
Total: 7 |
More Statistics |