Reconstruction of Large-Scale Gene Regulatory Networks Using Regression-based Models

No Thumbnail Available
Mohamed Salleh F.H.
Zainudin S.
Raih M.F.
Journal Title
Journal ISSN
Volume Title
Institute of Electrical and Electronics Engineers Inc.
Research Projects
Organizational Units
Journal Issue
Gene regulatory networks (GRN) reconstruction is the process of identifying gene regulatory interactions from experimental data through computational analysis. GRN reconstruction-related works have boosted many major discoveries in finding drug targets for the treatment of human diseases, including cancer. However, reconstructing GRNs from gene expression data is a challenging problem due to high-dimensionality and very limited number of observations data, severe multicollinearity and the tendency of generating cascade errors. These problems lead to the reduced performance of GRN inference methods, hence resulting in the method being unreliable for scientific usage. We propose a method called P-CALS (Principal Component Analysis and Partial Least Squares) that is derived from the combination of PCA (Principal Component Analysis) with PLS (Partial Least Squares). The performance of P-CALS is assessed to the genome-scale GRN of E. coli, S. cerevisiae and an in-silico datasets. We discovered that P-CALS achieved satisfactory results as all of the sub-networks from diverse datasets achieved AUROC values above 0.5 and gene relationships were discovered at the most complex network tested in the experiments. � 2018 IEEE.
Big data; Complex networks; Diseases; Escherichia coli; Gene expression; Least squares approximations; Multivariant analysis; Regression analysis; Computational analysis; Gene regulatory networks; Large-scale gene regulatory networks; Multi variate analysis; Partial least square (PLS); PCA (principal component analysis); Regression-based model; Regulatory interactions; Principal component analysis