A New Comprehensive Catalog of the Human Virome Reveals Hidden Associations with Chronic Diseases

Published on Nov 1, 2020in bioRxiv
· DOI :10.1101/2020.11.01.363820
Tisza Mj , Michael J. Tisza7
Estimated H-index: 7
+ -1 AuthorsChristopher B. Buck52
Estimated H-index: 52
Abstract While there have been remarkable strides in microbiome research, the viral component of the microbiome has generally presented a more challenging target than the bacteriome. This is despite the fact that many thousands of shotgun sequencing runs from human metagenomic samples exist in public databases and all of them encompass large amounts of viral sequences. The lack of a comprehensive database for human-associated viruses, along with inadequate methods for high-throughput identification of highly divergent viruses in metagenomic data, has historically stymied efforts to characterize virus sequences in a comprehensive way. In this study, a new high-specificity and high-sensitivity bioinformatic tool, Cenote-Taker 2, was applied to thousands of human metagenome datasets, uncovering over 50,000 unique virus operational taxonomic units. Publicly available case-control studies were re-analyzed, and over 1,700 strong virus-disease associations were found.
#1Michael J. TiszaH-Index: 7
#2Anna K. BelfordH-Index: 4
Last. Christopher B. BuckH-Index: 52
view all 6 authors...
Viruses, despite their great abundance and significance in biological systems, remain largely mysterious. Indeed, the vast majority of the perhaps hundreds of millions of viral species on the planet remain undiscovered. Additionally, many viruses deposited in central databases like GenBank and RefSeq are littered with genes annotated as hypothetical protein or the equivalent. Cenote-Taker2, a virus discovery and annotation tool available on command line and with a graphical user interface with f...
1 CitationsSource
#1Ann C. Gregory (OSU: Ohio State University)H-Index: 14
#2Olivier Zablocki (OSU: Ohio State University)H-Index: 9
Last. Matthew B. Sullivan (OSU: Ohio State University)H-Index: 75
view all 6 authors...
The gut microbiome profoundly affects human health and disease, and their infecting viruses are likely as important, but often missed because of reference database limitations. Here, we (1) built a human Gut Virome Database (GVD) from 2,697 viral particle or microbial metagenomes from 1,986 individuals representing 16 countries, (2) assess its effectiveness, and (3) report a meta-analysis that reveals age-dependent patterns across healthy Westerners. The GVD contains 33,242 unique viral populati...
37 CitationsSource
#1Patrick Pausch (University of California, Berkeley)H-Index: 1
#1Patrick Pausch (University of California, Berkeley)H-Index: 11
Last. Jennifer A. DoudnaH-Index: 132
view all 10 authors...
CRISPR-Cas systems are found widely in prokaryotes, where they provide adaptive immunity against virus infection and plasmid transformation. We describe a minimal functional CRISPR-Cas system, comprising a single ~70-kilodalton protein, CasΦ, and a CRISPR array, encoded exclusively in the genomes of huge bacteriophages. CasΦ uses a single active site for both CRISPR RNA (crRNA) processing and crRNA-guided DNA cutting to target foreign nucleic acids. This hypercompact system is active in vitro an...
56 CitationsSource
#1Michael J. Coffey (UNSW: University of New South Wales)H-Index: 12
#2Ivan Low (UNSW: University of New South Wales)H-Index: 1
Last. Chee Y. Ooi (UNSW: University of New South Wales)H-Index: 27
view all 9 authors...
Intestinal bacterial dysbiosis is evident in children with cystic fibrosis (CF) and intestinal viruses may be contributory, given their influence on bacterial species diversity and biochemical cycles. We performed a prospective, case-control study on children with CF and age and gender matched healthy controls (HC), to investigate the composition and function of intestinal viral communities. Stool samples were enriched for viral DNA and RNA by viral extraction, random amplification and purificat...
4 CitationsSource
#1Eugene V. Koonin (NIH: National Institutes of Health)H-Index: 218
#2Valerian V. Dolja (OSU: Oregon State University)H-Index: 72
Last. Jens H. Kuhn (NIH: National Institutes of Health)H-Index: 65
view all 8 authors...
SUMMARY Viruses and mobile genetic elements are molecular parasites or symbionts that coevolve with nearly all forms of cellular life. The route of virus replication and protein expression is determined by the viral genome type. Comparison of these routes led to the classification of viruses into seven “Baltimore classes” (BCs) that define the major features of virus reproduction. However, recent phylogenomic studies identified multiple evolutionary connections among viruses within each of the B...
80 CitationsSource
#1Abul K. Tarafder (University of Oxford)H-Index: 15
#2Andriko von Kügelgen (University of Oxford)H-Index: 3
Last. Tanmay A.M. Bharat (University of Oxford)H-Index: 19
view all 6 authors...
The opportunistic pathogen Pseudomonas aeruginosa is a major cause of antibiotic-tolerant infections in humans. P. aeruginosa evades antibiotics in bacterial biofilms by up-regulating expression of a symbiotic filamentous inoviral prophage, Pf4. We investigated the mechanism of phage-mediated antibiotic tolerance using biochemical reconstitution combined with structural biology and high-resolution cellular imaging. We resolved electron cryomicroscopy atomic structures of Pf4 with and without its...
19 CitationsSource
#1John BeaulaurierH-Index: 10
#2Elaine Luo (UH: University of Hawaii)H-Index: 6
Last. Edward F. DeLong (UH: University of Hawaii)H-Index: 107
view all 11 authors...
Viruses are the most abundant biological entities on Earth and play key roles in host ecology, evolution, and horizontal gene transfer. Despite recent progress in viral metagenomics, the inherent genetic complexity of virus populations still poses technical difficulties for recovering complete virus genomes from natural assemblages. To address these challenges, we developed an assembly-free, single-molecule nanopore sequencing approach, enabling direct recovery of complete virus genome sequences...
32 CitationsSource
#1Basem Al-Shayeb (University of California, Berkeley)H-Index: 11
#2Rohan Sachdeva (University of California, Berkeley)H-Index: 15
Last. Jillian F. BanfieldH-Index: 136
view all 45 authors...
Bacteriophages typically have small genomes1 and depend on their bacterial hosts for replication2. Here we sequenced DNA from diverse ecosystems and found hundreds of phage genomes with lengths of more than 200 kilobases (kb), including a genome of 735 kb, which is—to our knowledge—the largest phage genome to be described to date. Thirty-five genomes were manually curated to completion (circular and no gaps). Expanded genetic repertoires include diverse and previously undescribed CRISPR–Cas syst...
87 CitationsSource
#1Michael J. Tisza (NIH: National Institutes of Health)H-Index: 7
#2Diana V. Pastrana (NIH: National Institutes of Health)H-Index: 30
Last. Christopher B. Buck (NIH: National Institutes of Health)H-Index: 52
view all 26 authors...
When scientists hunt for new DNA sequences, sometimes they get a lot more than they bargained for. Such is the case in metagenomic surveys, which analyze not just DNA of a particular organism, but all the DNA in an environment at large. A vexing problem with these surveys is the overwhelming number of DNA sequences detected that are so different from any known microbe that they cannot be classified using traditional approaches. However, some of these “known unknowns” are undoubtedly viral sequen...
36 CitationsSource
#1Pauli Virtanen (University of Jyväskylä)H-Index: 23
#2Ralf GommersH-Index: 16
Last. SciPy . ContributorsH-Index: 3
view all 35 authors...
SciPy is an open-source scientific computing library for the Python programming language. Since its initial release in 2001, SciPy has become a de facto standard for leveraging scientific algorithms in Python, with over 600 unique code contributors, thousands of dependent packages, over 100,000 dependent repositories and millions of downloads per year. In this work, we provide an overview of the capabilities and development practices of SciPy 1.0 and highlight some recent technical developments....
3,250 CitationsSource
Cited By1
#1Anastasia Gulyaeva (UMCG: University Medical Center Groningen)
#2Sanzhima Garmaeva (UMCG: University Medical Center Groningen)H-Index: 5
Last. Arnau Vich Vila (UMCG: University Medical Center Groningen)H-Index: 18
view all 0 authors...
The crAss-like phages are a diverse group of related viruses that includes one of the most abundant viruses of the human gut. To explore their diversity and functional role in human population and clinical cohorts, we analyzed gut metagenomic data collected from more than 2000 individuals from the Netherlands. We discovered 125 novel species-level and 32 novel genus-level clusters of crAss-like phages, all belonging to five previously recognized groups associated with the human gut. Analysis of ...
#1Jiabao Cao (CAS: Chinese Academy of Sciences)H-Index: 1
#2Cheng Wang (Chinese PLA General Hospital)H-Index: 4
Last. Penghui Yang (Chinese PLA General Hospital)H-Index: 8
view all 20 authors...
SARS-CoV-2 is the cause of the current global pandemic of COVID-19; this virus infects multiple organs, such as the lungs and gastrointestinal tract. The microbiome in these organs, including the bacteriome and virome, responds to infection and might also influence disease progression and treatment outcome. In a cohort of 13 COVID-19 patients in Beijing, China, we observed that the gut virome and bacteriome in the COVID-19 patients were notably different from those of five healthy controls. We i...
3 CitationsSource