An unsupervised self-organizing learning with support vector ranking for imbalanced datasets

Nguwi, Yok-Yen, and Cho, Siu-Yeung (2010) An unsupervised self-organizing learning with support vector ranking for imbalanced datasets. Expert Systems and Application, 37 (12). pp. 8303-8312.

[img] PDF (Published Version) - Published Version
Restricted to Repository staff only

View at Publisher Website: http://dx.doi.org/10.1016/j.eswa.2010.05...
 
24
5


Abstract

The aim of computational learning algorithm is to establish grounds that work for any types of data, once and for all. However, majority of the classifiers have their base from balanced datasets. This paper discusses the issues related to imbalanced data distribution problem and the common strategy to deal with imbalance datasets. We propose a model capable of handling imbalance datasets well in which other typical classifiers fail to do so. The model adopted a derivation of supportvector machines in selecting variables so that the problem of imbalanced data distribution can be relaxed. Then, we used an Emergent Self-Organizing Map (ESOM) to cluster the ranker features so as to provide clusters for unsupervised classification. This work progresses by examining the efficiency of the model in evaluating imbalanced datasets. Experimental results show that the criterion based on weight vector derivative achieves good results and performs consistently well over imbalance datasets. In general, our approach outperforms other classification methods which are unable to handle the imbalanced data distribution in the testing datasets.

Item ID: 21657
Item Type: Article (Research - C1)
ISSN: 1873-6793
Keywords: imbalanced datasets; support vector ranking; emergent self-organizing map
Date Deposited: 25 Jun 2012 06:36
FoR Codes: 08 INFORMATION AND COMPUTING SCIENCES > 0801 Artificial Intelligence and Image Processing > 080105 Expert Systems @ 100%
SEO Codes: 89 INFORMATION AND COMMUNICATION SERVICES > 8999 Other Information and Communication Services > 899999 Information and Communication Services not elsewhere classified @ 100%
Downloads: Total: 5
More Statistics

Actions (Repository Staff Only)

Item Control Page Item Control Page