A fuzzy extension of the Rand index and other related indexes for clustering and classification assessment

Campello, R.J.G.B. (2007) A fuzzy extension of the Rand index and other related indexes for clustering and classification assessment. Pattern Recognition Letters, 28 (7). pp. 833-841.

[img] PDF (Published Version) - Published Version
Restricted to Repository staff only

View at Publisher Website: http://dx.doi.org/10.1016/j.patrec.2006....
 
122
1


Abstract

A fuzzy extension of the Rand index [Rand, W.M., 1971. Objective criteria for the evaluation of clustering methods. J. Amer. Statist. Assoc. 846-850] is introduced in this paper. The Rand index is a traditional criterion for assessment and comparison of different results provided by classifiers and clustering algorithms. It is able to measure the quality of different hard partitions of a data set from a classification perspective, including partitions with different numbers of classes or clusters. The original Rand index is extended here by making it able to evaluate a fuzzy partition of a data set - provided by a fuzzy clustering algorithm or a classifier with fuzzy-like outputs against a reference hard partition that encodes the actual (known) data classes. A theoretical formulation based on formal concepts from the fuzzy set theory is derived and used as a basis for the mathematical interpretation of the Fuzzy Rand Index proposed. The fuzzy counterparts of other (five) related indexes, namely, the Adjusted Rand Index of Hubert and Arabic, the Jaccard coefficient, the Minkowski measure, the Fowlkes-Mallows Index, and the r statistics, are also derived from this formulation.

Item ID: 47619
Item Type: Article (Research - C1)
ISSN: 1872-7344
Keywords: fuzzy clustering, fuzzy classification, external validity indexes, Rand index, adjusted Rand index, Jaccard coefficient, Minkowski measure, Fowlkes-Mallows index, Gamma statistics
Funders: Brazilian National Research Council (CNPq), São Paulo Research Foundation (FAPESP)
Projects and Grants: CNPq Grant no. #307554/2003-1, FAPESP Grant no. #06/50231-5
Date Deposited: 08 Mar 2017 07:40
FoR Codes: 01 MATHEMATICAL SCIENCES > 0104 Statistics > 010401 Applied Statistics @ 100%
SEO Codes: 97 EXPANDING KNOWLEDGE > 970101 Expanding Knowledge in the Mathematical Sciences @ 100%
Downloads: Total: 1
More Statistics

Actions (Repository Staff Only)

Item Control Page Item Control Page