利用生物資訊學分析大腸直腸癌的關鍵基因
Abstract
大腸直腸癌(CRC)目前是世界上常見的癌症之一,近年來發病趨勢逐漸上升且該疾病致死率也高居前列。雖然對於CRC的研究早已展開,但是該惡性腫瘤的分子機制並未被定性。本研究的目的是通過生物資訊分析確定並驗證了CRC相關的核心基因。為了篩選CRC中的差異表達基因 (DEGs),我們從Gene Expression Omnibus (GEO)上下載了數據集GSE103512,使用R軟體進行了數據分析,得到了235個DEGs,其中54個上調,181個下調。我們把分析出的DEGs導入了DAVID進行了Gene ontology (GO)和 Kyoto Encyclopedia of Genes and Genomes pathway(KEGG)分析。發現這些DEGs顯著參與癌症相關的function和pathway。我們使用string數據庫以及cytoscape可視化工具構建了PPI (protein-protein interaction)網路。通過cytoscape的cytoHubba工具,篩選出了10個關鍵基因(IGF1, DCN, CXCL8, FOS, CXCL12, C3, MYH11, CYR61, CXCR4),把這些hub genes代入gepia2.0數據庫中進行Overall Survival分析,發現CXCL8低表現量或C3高表現量會產生不良的生存結果。通過cytoscape的MCODE工具,我們從PPI網路中篩選出了3組module。把每組module中的基因進行KEGG分析,發現在module中同樣包含CXCL12,IGF1,CXCR4這3個關鍵基因,且顯著參與了癌症相關的pathway。本研究中發掘的核心基因在一定程度上有助於理解CRC發展的分子機制,其中CXCL12,IGF1,CXCR4參與了疾病發展過程的關鍵pathway,通過Overall Survival分析CXCL8和C3的表現量高低會對CRC造成顯著的預後差異。
Colorectal cancer (CRC) is currently one of the common cancers in the world. In recent years, the CRC has gradually increased . the mortality rate of CRC is also at the forefront. Although the research on CRC has already started, the molecular mechanism has not been fully elucidated. This study aims to identify and verify the core genes related to CRC through bioinformatics analysis. To screen the differentially expressed genes (DEGs) in CRC, we downloaded the dataset GES103512 from Gene Expression Omnibus (GEO), and analyzed the data using R , and obtained 235 DEGs, 54 of which were up-regulated and 181 were down-regulated. We imported the analyzed DEGs into DAVID for Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes pathway (KEGG) analysis. We found that these DEGs are significantly involved in cancer-related functions and pathways. We built a PPI (protein-protein interaction) network by using the STRING database and Cytoscape software. Through Cytoscape's APP cytoHubba, 10 hub genes (IGF1, DCN, CXCL8, FOS, CXCL12, C3, MYH11, CYR61, CXCR4) were screened out, and these hub genes were substituted into the GEPIA2.0 database for Overall Survival analysis and found that CXCL8 Low expression or C3 high expression can produce poor survival outcomes. Through the MCODE APP of Cytoscape, we have screened out 3 groups of modules from the PPI network. KEGG analysis of the genes in each group of modules found that the module also contains the three hub genes CXCL12, IGF1, and CXCR4, and they are significantly involved in the pathway of cancer. The core genes discovered in this study help to understand the molecular mechanism of CRC development to a certain extent. Among them, CXCL12, IGF1, and CXCR4 are involved in the key pathways of the disease development process. The expression levels of CXCL8 and C3 will cause significant prognostic differences.
Colorectal cancer (CRC) is currently one of the common cancers in the world. In recent years, the CRC has gradually increased . the mortality rate of CRC is also at the forefront. Although the research on CRC has already started, the molecular mechanism has not been fully elucidated. This study aims to identify and verify the core genes related to CRC through bioinformatics analysis. To screen the differentially expressed genes (DEGs) in CRC, we downloaded the dataset GES103512 from Gene Expression Omnibus (GEO), and analyzed the data using R , and obtained 235 DEGs, 54 of which were up-regulated and 181 were down-regulated. We imported the analyzed DEGs into DAVID for Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes pathway (KEGG) analysis. We found that these DEGs are significantly involved in cancer-related functions and pathways. We built a PPI (protein-protein interaction) network by using the STRING database and Cytoscape software. Through Cytoscape's APP cytoHubba, 10 hub genes (IGF1, DCN, CXCL8, FOS, CXCL12, C3, MYH11, CYR61, CXCR4) were screened out, and these hub genes were substituted into the GEPIA2.0 database for Overall Survival analysis and found that CXCL8 Low expression or C3 high expression can produce poor survival outcomes. Through the MCODE APP of Cytoscape, we have screened out 3 groups of modules from the PPI network. KEGG analysis of the genes in each group of modules found that the module also contains the three hub genes CXCL12, IGF1, and CXCR4, and they are significantly involved in the pathway of cancer. The core genes discovered in this study help to understand the molecular mechanism of CRC development to a certain extent. Among them, CXCL12, IGF1, and CXCR4 are involved in the key pathways of the disease development process. The expression levels of CXCL8 and C3 will cause significant prognostic differences.
Description
Keywords
關鍵基因, 大腸直腸癌, 差異表達基因, 生物信息学分析, hub genes, Colorectal cancer, differentially expressed genes, bioinformatics analysis