Literature Data Mining and Enrichment Analysis Reveal A Genetic Network of 423 Genes for Renal Cancer
نویسندگان
چکیده
Background: Renal cancer (RC) is a type of cancer that starts in the cells of the kidneys. Around 208,500 new cases of renal cancer worldwide are diagnosed yearly, accounting for just under 2% of all cancers. People who have a family history of RC have an increased risk of developing the disease. Recent years, an increased number of researches have been reported hundreds of genes related to the development of the disease. However, no systemic study has summarized these findings and has provided an objective view of these genes reportedly associated with RC. Methods: We conducted a literature data mining (LDM) of over 1,100 articles covering publications from 1988 to April 2016, where 423 genes were reported to be associated with RC. We then performed a gene set enrichment analysis (GSEA) and a sub-network enrichment analysis (SNEA) to study the functional profile and pathogenic significance of these genes with RC. Lastly, we performed a network connectivity analysis (NCA) to study the associations between the reported genes. Literature and enrichment metrics analyses were used to discover genes with specific significance to the disease. Results: 329/423 genes enriched 100 pathways (p<1.2e-10), demonstrating multiple associations with RC. Ten genes (IL6, VEGFA, HIF1A, EGFR, PTEN, TP53, FGF2, CTNNB1, HMOX1, and BRCA1) were identified as the top genes associated with leukemia in terms of both functional diversity and replication frequency. Additionally, three novel genes, CD274, NOTCH1, and CREB1, were found to play roles within many significant RC related pathways, suggesting that they were worthy of further study. Moreover, SNEA and NCA results indicated that many of these genes work as a functional network that plays roles in the pathogenesis of other RC related disorders. Conclusion: Our results suggest that the genetic causes of RC were linked to a genetic network composed of a large group of genes. The gene lists, together with the literature and enrichment metrics provided in this study, can serve as a groundwork for further biological/genetic studies in the field.
منابع مشابه
Obsessive-Compulsive Disorder Interactome Profile Analysis: A Perspective From Molecular Mechanism
Introduction: Obsessive-Compulsive Disorder (OCD) is one of the complex neuropsychiatric conditions. This disorder disables individuals in many different aspects of their personal and social life. Interactome analysis may provide a better understanding of this disorder’s molecular origin and its underlying mechanisms. Methods: In this study, the OCD-associated genes were extracted from the lit...
متن کاملSystematic enrichment analysis of microRNA expression profiling studies in endometriosis
Objective(s): The purpose of this study was to conduct a meta-analysis on human microRNAs (miRNAs) expression data of endometriosis tissue profiles versus those of normal controls and to identify novel putative diagnostic markers. Materials andMethods: PubMed, Embase, Web of Science, Ovid Medline were used to search for endometriosis miRNA expression profiling studies of endometriosis. The miRN...
متن کاملExploring Gene Signatures in Different Molecular Subtypes of Gastric Cancer (MSS/ TP53+, MSS/TP53-): A Network-based and Machine Learning Approach
Gastric cancer (GC) is one of the leading causes of cancer mortality, worldwide. Molecular understanding of GC’s different subtypes is still dismal and it is necessary to develop new subtype-specific diagnostic and therapeutic approaches. Therefore developing comprehensive research in this area is demanding to have a deeper insight into molecular processes, underlying these subtypes. In this st...
متن کاملIdentification of key genes and pathways involved in vitiligo vulgaris by gene network analysis
Background and Aim: Vitiligo vulgaris is an acquired, chronic skin and hair condition characterized clinically by loss of melanin, which, if untreated, is typically progressive and irreversible. The aim of the present study was to identify potential genes involved in the pathogenesis of vitiligo. Methods: One dataset of mRNA expression in patients with vitiligo (GSE65127) were obtained from ...
متن کاملIntegrative Biomarker Discovery for Breast Cancer Metastasis from Gene Expression and Protein Interaction Data Using Error-tolerant Pattern Mining
Biomarker discovery for complex diseases is a challenging problem. Most of the existing approaches identify individual genes as disease markers, thereby missing the interactions among genes. Moreover, often only single biological data source is used to discover biomarkers. These factors account for the discovery of inconsistent biomarkers. In this paper, we propose a novel error-tolerant patter...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016