Mining the diseasome

Urbach, Davnah; Moore, Jason H

doi:10.1186/1756-0381-4-25

Editorial
Open access
Published: 09 September 2011

Mining the diseasome

Davnah Urbach¹ &
Jason H Moore^1,2,3

BioData Mining volume 4, Article number: 25 (2011) Cite this article

6046 Accesses
7 Citations
6 Altmetric
Metrics details

Over the last ten years, genome-wide association studies (GWAS) have reported over 4000 single nucleotide polymorphisms associated to more than 200 traits [1, 2]. Despite providing us with a slightly better understanding of the genetic architecture of common diseases, generating avalanches of new hypotheses, and fostering timid progress in pharmacogenomics [3], genetic associations studies haven't yet revolutionized clinical practice [4]. Hence, although such studies are still published at a remarkable pace, the notion of "post-GWAS" functional characterization of risk loci [2] is gradually gaining in popularity. Indeed, deciphering the function of disease-associated genetic variants is likely to get us closer to achieving an understanding of disease architecture that will ultimately be translatable into clinical applications. Despite this gradual change in research priorities, the field of medical genomics remains fairly conservative: the "single gene single disease" paradigm largely prevails, to the detriment of the avant-garde notion of "diseasome" [5] and of human disease network ("HDN") in particular, and attempts to truly integrate clinical information (e.g., age at onset or reduction in life span) and molecular data are scarce. Here we call for a revival of the notion of disease network, and recall how superimposing layers of clinical data and biological information to such networks may help identify novel disease genes. An inspiring read in that context is the recent paper by Barabási and coworkers on network medicine [6].

Diseases are traditionally considered as discrete entities and classified accordingly. However, the networks of genes accountable for particular disease phenotypes most certainly overlap, with individual genes simultaneously serving the cause of multiple disorders [5, 7]. Clinically distinct diseases have genes in common, like nodes in a network have links in common, and DNs capture this analogy by representing diseases with nodes and the genes they share with links. In such a network representation, breast cancer and pancreatic cancer for instance are two nodes connected by TP53 [5]. What the concept of DN implies is that many susceptibility loci hitherto associated to distinct diseases are in fact likely to contribute to the genetic architecture of several disorders. Hence, rather than initiating genetic association studies with no a priori hypothesis about where in the genome to look for potential candidate risk loci, the information captured by HDNs may serve the purpose of anchoring the search for susceptibility loci in genomic regions known to harbor genetic variants predictive of other "linked" diseases. Subsequently, the human interactome [6], i.e., the compendium of molecular, phenotypic and genetic interactions, or genome-wide regulatory networks [8] can serve as maps to navigate the genome in search of further susceptibility loci.

Additional indices on where to start exploring the genome for susceptibility loci can be inferred from general principles of human diseases and clinical data. For example, a considerable fraction of diseases with onset early in life appear to result from defects in enzyme-encoding genes, whereas diseases with onset during adulthood appear to be caused by alterations in genes encoding modifiers of protein functions [9]. Thus, clinical information such as age at onset or severity can serve as valuable expert knowledge to narrow down the genomic search space to genes or genetic domains that are biologically and clinically meaningful. Additionally, and although this is not always the case, co-morbid disorders often share genes [6]. Hence, using well-established susceptibility loci for co-morbid disorders as a starting point in genetic association studies may further enhance the success rate of these endeavors.

Recent years have come with major advancements in candidate gene prioritization and our understanding of the genetic architecture of human diseases is undoubtedly progressing. Here we have suggested that biological and clinical information may serve as valuable expert knowledge for genetic association studies and that disease networks may provide useful guidance prior to and during data mining.

References

Hindorff LA, Sethupathy P, Junkins HA, Ramos EM, Mehta JP, Collins FS, Manolio TA: Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proc Natl Acad Sci USA. 2009, 106: 9362-9367. 10.1073/pnas.0903103106.
Article CAS PubMed PubMed Central Google Scholar
Freedman ML: Principles for the post-GWAS functional characterization of cancer risk loci. Nat Genet. 2011, 43: 513-518. 10.1038/ng.840.
Article CAS PubMed PubMed Central Google Scholar
Daly AK: Genome-wide association studies in pharmacogenomics. Nat Rev Genet. 2010, 11: 241-246. 10.1038/nrg2751.
Article CAS PubMed Google Scholar
Collins FS: Has the revolution arrived?. Nature. 2010, 464: 674-675. 10.1038/464674a.
Article CAS PubMed Google Scholar
Goh KI, Cusick ME, Valle D, Childs B, Vidal M, Barabási AL: The human disease network. Proc Nat Acad Sci USA. 2007, 104: 8685-8690. 10.1073/pnas.0701361104.
Article CAS PubMed PubMed Central Google Scholar
Barabási AL, Gulbahce N, Loscalzo J: Network medicine: a nework-based approach to human disease. Nat Rev Genet. 2011, 12: 56-68. 10.1038/nrg2918.
Article PubMed PubMed Central Google Scholar
Ahmed SSSJ, Ahameethunisa AR, Santosh W, Chakravarthy S, Kumar S: Systems biological approach on neurological disorders: a novel molecular connectivity to aging and psychiatric diseases. BMC Syst Biol. 2011, 5:
Google Scholar
Cowper-Sal Iari R, Cole MD, Karagas MR, Lupien M, Moore JH: Layers of epistasis: genome-wide regulatory networks and network approaches to genome-wide association studies. WIREs Syst Biol Med. 2010
Google Scholar
Jimenez-Sanchez G, Childs B, Valle D: Human disease genes. Nature. 2001, 409: 853-855. 10.1038/35057050.
Article CAS PubMed Google Scholar

Download references

Author information

Authors and Affiliations

Dartmouth College, Institute for Quantitative Biomedical Sciences, One Medical Center Dr, Lebanon, NH, 03756, USA
Davnah Urbach & Jason H Moore
Dartmouth Medical School, Department of Genetics, One Medical Center Dr, Lebanon, NH, 03756, USA
Jason H Moore
Dartmouth Medical School, Department of Community and Family Medicine, One Medical Center Dr, Lebanon, NH, 03756, USA
Jason H Moore

Authors

Davnah Urbach
View author publications
You can also search for this author in PubMed Google Scholar
Jason H Moore
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jason H Moore.

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Urbach, D., Moore, J.H. Mining the diseasome. BioData Mining 4, 25 (2011). https://doi.org/10.1186/1756-0381-4-25

Download citation

Received: 22 July 2011
Accepted: 09 September 2011
Published: 09 September 2011
DOI: https://doi.org/10.1186/1756-0381-4-25

Mining the diseasome

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

BioData Mining

Contact us