Filter feature selection based Boolean Modelling for Genetic Network Inference

Gamage, Hasini Nakulugamuwa, Chetty, Madhu, Shatte, Adrian, and Hallinan, Jennifer (2022) Filter feature selection based Boolean Modelling for Genetic Network Inference. BioSystems, 221. 104757.

[img] PDF (Published Version) - Published Version
Restricted to Repository staff only

View at Publisher Website: https://doi.org/10.1016/j.biosystems.202...
 
1


Abstract

The reconstruction of Gene Regulatory Networks (GRNs) from time series gene expression data is highly relevant for the discovery of complex biological interactions and dynamics. Various computational strategies have been developed for this task, but most approaches have low computational efficiency and are not able to cope with high-dimensional, low sample-number, gene expression data. In this paper, we introduce a novel combined filter feature selection approach for efficient and accurate inference of GRNs. A Boolean framework for network modelling is used to demonstrate the efficacy of the proposed approach. Using discretized microarray expression data, the genes most relevant to each target gene are first filtered using ReliefF, an instance-based feature ranking method that is here applied for the first time to GRN inference. Then, further gene selection from the filtered-gene list is done using a mutual information-based min-redundancy max-relevance criterion by eliminating irrelevant genes. This combined method is executed on resampled datasets to finalize the optimal set of regulatory genes. Building upon our previous research, a Pearson correlation coefficient-based Boolean modelling approach is utilized for the efficient identification of the optimal regulatory rules associated with selected regulatory genes. The proposed approach was evaluated using gene expression datasets from small-scale and medium-scale real gene networks, and was observed to be more effective than Linear Discriminant Analysis, performed better than the individual feature selection methods, and obtained improved Structural Accuracy with a higher number of true positives than other state-of-the-art methods, while outperforming these methods with respect to Dynamic Accuracy and efficiency.

Item ID: 81614
Item Type: Article (Research - C1)
ISSN: 1872-8324
Keywords: Gene Regulatory Networks; Feature selection; Boolean network; ReliefF; Min-redundancy max-relevance; Linear discriminant analysis
Copyright Information: © 2022 Elsevier B.V. All rights reserved.
Date Deposited: 24 Jan 2024 02:25
FoR Codes: 31 BIOLOGICAL SCIENCES > 3102 Bioinformatics and computational biology > 310202 Biological network analysis @ 80%
31 BIOLOGICAL SCIENCES > 3102 Bioinformatics and computational biology > 310299 Bioinformatics and computational biology not elsewhere classified @ 10%
46 INFORMATION AND COMPUTING SCIENCES > 4611 Machine learning > 461199 Machine learning not elsewhere classified @ 10%
SEO Codes: 28 EXPANDING KNOWLEDGE > 2801 Expanding knowledge > 280102 Expanding knowledge in the biological sciences @ 100%
Downloads: Total: 1
Last 12 Months: 1
More Statistics

Actions (Repository Staff Only)

Item Control Page Item Control Page