Bagging support vector machine for classification of SELDI-TOF mass spectra of ovarian cancer serum samples

Zhang, Bailing, Pham, Tuan D., and Zhang, Yanchun (2007) Bagging support vector machine for classification of SELDI-TOF mass spectra of ovarian cancer serum samples. In: Orgun, Mehmet A., and Thornton, John, (eds.) AI 2007: Advances in Artificial Intelligence. Lecture Notes in Computer Science, 4830 . Springer, Berlin, Germany, pp. 820-826.

[img] PDF (Published Version)
Restricted to Repository staff only

View at Publisher Website: http://dx.doi.org/10.1007/978-3-540-7692...
 
7
1


Abstract

There has been much progresses recently about the identification of diagnostic proteomic signatures for different human cancers using surface-enhanced laser desorption ionization time-of-flight (SELDI-TOF) mass spectrometry. To identify proteomic patterns in serum to discriminate cancer patients from normal individuals, many classification methods have been experimented, often with successful results. Most of these earlier studies, however, are based on the direct application of original mass spectra, together with dimension reduction methods like PCA or feature selection methods like T-tests. Because only the peaks of MS data correspond to potential biomarkers, it is important to study classification methods using the detected peaks. This paper investigates ovarian cancer identification from the detected MS peaks by applying Bagging Support Vector Machine as a special strategy of bootstrap aggregating (Bagging). In bagging SVM, each individual SVM is trained independently, using randomly chosen training samples via a bootstrap technique. The trained individual SVMs are aggregated to make a collective decision in an appropriate way, for example, the majority voting. Bagged SVM demonstrated a 94% accuracy with 95% sensitivity and 92% specificity respectively by using the detected peaks. The efficiency can be further improved by applying PCA to reduce the dimension.

Item ID: 2846
Item Type: Book Chapter (Research - B1)
ISBN: 978-3-540-76926-2
ISSN: 1611-3349
Keywords: patterern recognition; bioinformatics
Date Deposited: 24 Sep 2009 23:57
FoR Codes: 08 INFORMATION AND COMPUTING SCIENCES > 0801 Artificial Intelligence and Image Processing > 080109 Pattern Recognition and Data Mining @ 50%
08 INFORMATION AND COMPUTING SCIENCES > 0801 Artificial Intelligence and Image Processing > 080106 Image Processing @ 50%
SEO Codes: 89 INFORMATION AND COMMUNICATION SERVICES > 8902 Computer Software and Services > 890205 Information Processing Services (incl. Data Entry and Capture) @ 100%
Downloads: Total: 1
More Statistics

Actions (Repository Staff Only)

Item Control Page Item Control Page