A comparative study on the use of correlation coefficients for redundant feature elimination
Jaskowiak, Pablo A., Campello, Ricardo J.G.B., Covões, Thiago F., and Hruschka, Eduardo R. (2010) A comparative study on the use of correlation coefficients for redundant feature elimination. In: Proceedings of the 11th Brazilian Symposium on Neural Networks. pp. 13-18. From: SBRN 2010: 11th Brazilian Symposium on Neural Networks, 23-28 October 2010, São Paulo, Brazil.
PDF (Published Version)
- Published Version
Restricted to Repository staff only |
Abstract
Simplified Silhouette Filter (SSF) is a recently introduced feature selection method that automatically estimates the number of features to be selected. To do so, a sampling strategy is combined with a clustering algorithm that seeks clusters of correlated (potentially redundant) features. It is well known that the choice of a similarity measure may have great impact in clustering results. As a consequence, in this application scenario, this choice may have great impact in the feature subset to be selected. In this paper we study six correlation coefficients as similarity measures in the clustering stage of SSF, thus giving rise to several variants of the original method. The obtained results show that, in particular scenarios, some correlation measures select fewer features than others, while providing accurate classifiers.
Item ID: | 47968 |
---|---|
Item Type: | Conference Item (Research - E1) |
ISBN: | 978-0-7695-4210-2 |
Keywords: | feature selection; correlation coefficients; classification |
Funders: | CNPq, FAPESP |
Date Deposited: | 12 Jul 2017 02:42 |
FoR Codes: | 01 MATHEMATICAL SCIENCES > 0104 Statistics > 010401 Applied Statistics @ 100% |
SEO Codes: | 97 EXPANDING KNOWLEDGE > 970101 Expanding Knowledge in the Mathematical Sciences @ 100% |
Downloads: |
Total: 1 |
More Statistics |