Prognostic Risk Model of Colorectal Cancer Constructed by Bioinformatics Method and Functional Study of the Gene CLCA1 & SPINK4

Main Authors: Shiying Li, Haowei Liu, Yongming Zhao, Yingxing Zeng, Jun Xu
Format: Article
Terbitan: , 2020
Subjects:
Online Access: https://zenodo.org/record/3932694
Daftar Isi:
  • Abstract: Background Colorectal cancer (CRC) is the most common malignant tumor of digestive system. The metastases is the main cause of mortality in CRC patients, of whom the initial diagnosis is about 25%. In our study, we aimed to identify potential gene biomarkers based on RNA sequencing data to predict and improve CRC patient survival. Method In this study, by screening differentially expressed genes of colon cancer related to liver metastasis, a survival prognostic risk model was constructed by bioinformatics analysis. Here, we conducted our data mining analysis for CRC by integrating the differentially expressed genes acquired from Gene Expression Omnibus (GEO) database by primary tumor versus liver metastasis (GSE81582,GSE41258,GSE49355,GSE68468) into The Cancer Genome Atlas (TCGA) database which includes 415 primary tumor and 132 liver metastasis tissue. At the same time, we used transwell, RT-PCR and western to examine the effects of CLCA1 and SPINK4 on the migration of colorectal cancer cells at the cell level. Results We identified intersections of 197 genes (117 up-regulated and 80 down-regulated) between GEO data and TCGA data. Differentially expressed genes in TCGA-COAD by single factor cox analysis, lasso cycle training and multifactor cox analysis composed a survival prognosis prediction model consisted of 7 genes ORM1, CLCA1, C8B, SPINK4, ALDOB, GAMT, C8G. And results of transwell experiments showed that high expression of CLCA1 and SPINK4 can inhibit the migration ability of colon cancer cells LOVO and SW620, meanwhile western blotting showed that the high expression of both genes can upregulate the expression of epithelial phenotypic marker E-cadherin, and Vimentin expression is down-regulated. Conclusion In this study, 197 differentially expressed genes were selected and a relatively robust survival prognosis prediction model was constructed. The model consisted of seven genes: GAMT, C8G, ORM1, CLCA1, C8B, SPINK4, and ALDOB. At the same time, we found that CLCA1 and SPINK4 are closely related to survival prognosis. The predictive model nomogram will enable patients with CRC to be more accurately managed in trials testing new drugs and in clinical practice.
  • Author contribution: Shiying Li, article writing; Haowei Liu, contributed towards data collection and analysis; Yongming Zhao,investigation; Shiying Li, reference search;Yingxing Zeng, Formal analysis;Jun Xu Supervision and editing proof reading and project administration. Abbreviations: CRC, Colorectal cancer; GEO, Gene Expression Omnibus; TCGA, The Cancer Genome Atlas; EMT, Epithelial-mesenchymal transition; PCA, Principal component analysis; NCI, National Cancer Institute; NHGRI, National Human Genome Research Institute; mAb, monoclonal antibodies; PPI, protein protein interaction; GEPIA, Gene Expression Profiling Interactive Analysis.