Integrated pathway clusters with coherent biological themes for target prioritisation.

Published on Jun 11, 2014in PLOS ONE3.24
路 DOI :10.1371/JOURNAL.PONE.0099030
Yi-An Chen10
Estimated H-index: 10
,
Lokesh P. Tripathi12
Estimated H-index: 12
+ 3 AuthorsKenji Mizuguchi40
Estimated H-index: 40
Sources
Abstract
Prioritising candidate genes for further experimental characterisation is an essential, yet challenging task in biomedical research. One way of achieving this goal is to identify specific biological themes that are enriched within the gene set of interest to obtain insights into the biological phenomena under study. Biological pathway data have been particularly useful in identifying functional associations of genes and/or gene sets. However, biological pathway information as compiled in varied repositories often differs in scope and content, preventing a more effective and comprehensive characterisation of gene sets. Here we describe a new approach to constructing biologically coherent gene sets from pathway data in major public repositories and employing them for functional analysis of large gene sets. We first revealed significant overlaps in gene content between different pathways and then defined a clustering method based on the shared gene content and the similarity of gene overlap patterns. We established the biological relevance of the constructed pathway clusters using independent quantitative measures and we finally demonstrated the effectiveness of the constructed pathway clusters in comparative functional enrichment analysis of gene sets associated with diverse human diseases gathered from the literature. The pathway clusters and gene mappings have been integrated into the TargetMine data warehouse and are likely to provide a concise, manageable and biologically relevant means of functional analysis of gene sets and to facilitate candidate gene prioritisation.
馃摉 Papers frequently viewed together
20113.24PLOS ONE
3 Authors (Yi-An Chen, ..., Kenji Mizuguchi)
2011PSB: Pacific Symposium on Biocomputing
5 Authors (Sevin Turcan, ..., Donna K. Slonim)
References30
Newest
The accurate representation of all aspects of a metabolic network in a structured format, such that it can be used for a wide variety of computational analyses, is a challenge faced by a growing number of researchers. Analysis of five major metabolic pathway databases reveals that each database has made widely different choices to address this challenge, including how to deal with knowledge that is uncertain or missing. In concise overviews, we show how concepts such as compartments, enzymatic c...
Source
#1Partha K. Chandra (Tulane University)H-Index: 24
#2Lili Bao (Tulane University)H-Index: 6
Last. Srikanta Dash (Tulane University)H-Index: 34
view all 13 authors...
A stable and persistent Hepatitis C virus (HCV) replication cell culture model was developed to examine clearance of viral replication during long-term treatment using interferon-伪 (IFN-伪), IFN-位, and ribavirin (RBV). Persistently HCV-infected cell culture exhibited an impaired antiviral response to IFN-伪+RBV combination treatment, whereas IFN-位 treatment produced a strong and sustained antiviral response that cleared HCV replication. HCV replication in persistently infected cells induced chroni...
Source
#1Lokesh P. TripathiH-Index: 12
#2Hiroto KambaraH-Index: 62
Last. Kenji MizuguchiH-Index: 40
view all 11 authors...
Hepatitis C virus (HCV) is a major cause of chronic liver disease. HCV NS5A protein plays an important role in HCV infection through its interactions with other HCV proteins and host factors. In an attempt to further our understanding of the biological context of protein interactions between NS5A and host factors in HCV pathogenesis, we generated an extensive physical interaction map between NS5A and cellular factors. By combining a yeast two-hybrid assay with comprehensive literature mining, we...
Source
#1Hufeng Zhou (NUS: National University of Singapore)H-Index: 17
#2Jingjing Jin (NUS: National University of Singapore)H-Index: 17
Last. Limsoon Wong (NUS: National University of Singapore)H-Index: 62
view all 6 authors...
Pathway data are important for understanding the relationship between genes, proteins and many other molecules in living organisms. Pathway gene relationships are crucial information for guidance, prediction, reference and assessment in biochemistry, computational biology, and medicine. Many well-established databases--e.g., KEGG, WikiPathways, and BioCyc--are dedicated to collecting pathway data for public access. However, the effectiveness of these databases is hindered by issues such as incom...
Source
#6Yiran Chen (University of Texas Health Science Center at San Antonio)H-Index: 282
Background One method to understand and evaluate an experiment that produces a large set of genes, such as a gene expression microarray analysis, is to identify overrepresentation or enrichment for biological pathways. Because pathways are able to functionally describe the set of genes, much effort has been made to collect curated biological pathways into publicly accessible databases. When combining disparate databases, highly related or redundant pathways exist, making their consolidation into...
Source
#1Fan Zhang (University of North Texas Health Science Center)H-Index: 11
#2Renee Drabier (University of North Texas Health Science Center)H-Index: 6
Background Next-Generation Sequencing (NGS) technologies and Genome-Wide Association Studies (GWAS) generate millions of reads and hundreds of datasets, and there is an urgent need for a better way to accurately interpret and distill such large amounts of data. Extensive pathway and network analysis allow for the discovery of highly significant pathways from a set of disease vs. healthy samples in the NGS and GWAS. Knowledge of activation of these processes will lead to elucidation of the comple...
Source
#1Gerald F. Watts (RPH: Royal Perth Hospital)H-Index: 116
#2A. Juniper (RPH: Royal Perth Hospital)H-Index: 4
Last. Peter O'Leary (UWA: University of Western Australia)H-Index: 32
view all 6 authors...
Familial hypercholesterolaemia (FH) is a condition that should be familiar to all health professionals involved in preventive medicine. FH is the most common and serious monogenic disorder of lipid metabolism that leads to premature coronary heart disease. However, most cases remain undetected or inadequately treated in our community. We provide an overview of FH, with emphasis on evidence for treatment, new models of care (MoCs) and health economic evaluations. Evidence for treatment is based o...
Source
#1Shoichi Ihara (Osaka University)H-Index: 5
#2Hiroshi KidaH-Index: 75
Last. Atsushi KumanogohH-Index: 71
view all 25 authors...
Stat3 mediates a complex spectrum of cellular responses, including inflammation, cell proliferation, and apoptosis. Although evidence exists in support of a positive role for Stat3 in cancer, its role has remained somewhat controversial because of insufficient study of how its genetic deletion may affect carcinogenesis in various tissues. In this study, we show using epithelium-specific knockout mice (Stat3 螖/螖 ) that Stat3 blunts rather than supports antitumor immunity in carcinogen-induced lun...
Source
#1Lokesh P. TripathiH-Index: 12
#2Hiroto KambaraH-Index: 62
Last. Kenji MizuguchiH-Index: 40
view all 9 authors...
Hepatitis C virus (HCV) causes chronic liver disease worldwide. HCV Core protein (Core) forms the viral capsid and is crucial for HCV pathogenesis and HCV-induced hepatocellular carcinoma, through its interaction with the host factor proteasome activator PA28纬. Here, using BD-PowerBlot high-throughput Western array, we attempt to further investigate HCV pathogenesis by comparing the protein levels in liver samples from Core-transgenic mice with or without the knockout of PA28纬 expression (abbrev...
Source
#4Yeongjun Jang (Korea Research Institute of Bioscience and Biotechnology)H-Index: 5
One of the biggest challenges in the study of biological regulatory networks is the systematic organization and integration of complex interactions taking place within various biological pathways. Currently, the information of the biological pathways is dispersed in multiple databases in various formats. hiPathDB is an integrated pathway database that combines the curated human pathway data of NCI-Nature PID, Reactome, BioCarta and KEGG. In total, it includes 1661 pathways consisting of 8976 dis...
Source
Cited By22
Newest
#1Elif Everest (ITU: Istanbul Technical University)H-Index: 2
#2Ege 脺lgen (Ac谋badem University)H-Index: 8
Last. Eda Tahir Turanli (Ac谋badem University)
view all 8 authors...
Source
#1Regan Odongo (GIT: Gebze Institute of Technology)H-Index: 2
#2Asuman Demiroglu-Zergeroglu (GIT: Gebze Institute of Technology)H-Index: 5
Last. Tunahan 脟ak谋r (GIT: Gebze Institute of Technology)H-Index: 1
view all 3 authors...
Background null Narrow spectrum of action through limited molecular targets and unforeseen drug-related toxicities have been the main reasons for drug failures at the phase I clinical trials in complex diseases. Most plant-derived compounds with medicinal values possess poly-pharmacologic properties with overall good tolerability, and, thus, are appropriate in the management of complex diseases, especially cancers. However, methodological limitations impede attempts to catalogue targeted process...
Source
#1Jos茅 Jaime Mart铆nez-Maga帽a (UJAT: Universidad Ju谩rez Aut贸noma de Tabasco)H-Index: 7
Last. Humberto Nicolini (UJAT: Universidad Ju谩rez Aut贸noma de Tabasco)H-Index: 40
view all 15 authors...
The combination of substance use and psychiatric disorders is one of the most common comorbidities. The objective of this study was to perform a genome-wide association study of this comorbidity (Com), substance use alone (Subs), and psychiatric symptomatology alone (Psych) in the Mexican population. The study included 3914 individuals of Mexican descent. Genotyping was carried out using the PsychArray microarray and genome-wide correlations were calculated. Genome-wide associations were analyze...
Source
#1German OsmakH-Index: 7
#2Ivan KiselevH-Index: 7
Last. O. O. Favorova (RSMU: Russian National Research Medical University)H-Index: 10
view all 4 authors...
MicroRNAs (miRNAs) are short, single-stranded, non-coding ribonucleic acid (RNA) molecules, which are involved in the regulation of main biological processes, such as apoptosis or cell proliferation and differentiation, through sequence-specific interaction with target mRNAs. In this study, we propose a workflow for predicting miRNAs function by analyzing the structure of the network of their target genes. This workflow was applied to study the functional role of miR-375 in the heart muscle (myo...
Source
#1Ying LinH-Index: 8
#2Shiva AfsharH-Index: 1
Last. Shizhong HanH-Index: 26
view all 5 authors...
Autism spectrum disorder (ASD) is a complex neurodevelopmental condition with a strong genetic basis. The role of de novo mutations in ASD has been well established, but the set of genes implicated to date is still far from complete. The current study employs a machine learning-based approach to predict ASD risk genes using features from spatiotemporal gene expression patterns in human brain, gene-level constraint metrics, and other gene variation features. The genes identified through our predi...
Source
#1Regan Odongo (GIT: Gebze Institute of Technology)H-Index: 2
#2Asuman Demiroglu Zergeroglu (GIT: Gebze Institute of Technology)H-Index: 1
Last. Tunahan 脟ak谋r (GIT: Gebze Institute of Technology)H-Index: 15
view all 3 authors...
Background: Plant-derived natural products possess poly-pharmacologic mechanisms of action with good tolerability and thus are appropriate in the management of complex diseases, especially cancers. However, methodological limitations impede attempts to catalogue targeted processes and infer systemic mechanisms of action. Integrative systems biology approaches are better suited in these cases due to their analytical comprehensiveness. Method: The transcriptome data from drug-treated breast cancer...
Source
#1Yi-An ChenH-Index: 10
#2Lokesh P. TripathiH-Index: 12
Last. Kenji MizuguchiH-Index: 40
view all 6 authors...
Biological data analysis is the key to new discoveries in disease biology and drug discovery. The rapid proliferation of high-throughput 鈥渙mics鈥 data has necessitated a need for tools and platforms that allow the researchers to combine and analyse different types of biological data and obtain biologically relevant knowledge. We had previously developed TargetMine, an integrative data analysis platform for target prioritisation and broad-based biological knowledge discovery. Here we describe the ...
Source
#1Ercan Bastu (Ac谋badem University)H-Index: 15
#2Irem Demiral (Harvard University)H-Index: 5
Last. John Yeh (Harvard University)H-Index: 21
view all 9 authors...
The aim of this prospective cohort study was to identify altered biologic processes in the endometrium that may be potential markers of receptive endometrium in patients with repeated implantation ...
Source
#1Yi-An ChenH-Index: 10
#2Lokesh P. TripathiH-Index: 12
Last. Kenji MizuguchiH-Index: 40
view all 3 authors...
: Most biological processes including diseases are multifactorial and determined by a complex interplay of various genetic and environmental factors. This chapter aims to provide a user guide to data querying, analysis, and visualization with TargetMine and the associated auxiliary toolkit. We have also discussed some of the commonly used data queries for the researchers who are interested in gene set analysis within a data warehouse framework. Overall, TargetMine provides a convenient web brows...
Source
#1Daniel Domingo-Fern谩ndez (Fraunhofer Society)H-Index: 11
#2Charles Tapley Hoyt (Fraunhofer Society)H-Index: 11
Last. Martin Hofmann-Apitius (Fraunhofer Society)H-Index: 28
view all 5 authors...
Although pathways are widely used for the analysis and representation of biological systems, their lack of clear boundaries, their dispersion across numerous databases, and the lack of interoperability impedes the evaluation of the coverage, agreements, and discrepancies between them. Here, we present ComPath, an ecosystem that supports curation of pathway mappings between databases and fosters the exploration of pathway knowledge through several novel visualizations. We have curated mappings be...
Source
This website uses cookies.
We use cookies to improve your online experience. By continuing to use our website we assume you agree to the placement of these cookies.
To learn more, you can find in our Privacy Policy.