Ryan Connor
National Institutes of Health
Test caseEvolutionary ecologyWorld Wide WebGenomeMetagenomicsMacroecologyWorkflowAtlas (topology)Data proliferationEcologyHuman viromeStructure (mathematical logic)Host (network)Biological dataGeographyInformation retrievalVirusVertebrateNode (computer science)Diversity (politics)Human immunodeficiency virus (HIV)Ecology (disciplines)Computer sciencePython (programming language)Event (computing)Genome sizeBig dataEvolutionary biologyCloud computingMetadataCRISPR
Publications 4
#1Colin J. Carlson (Georgetown University)H-Index: 18
#2Rory Gibb (Lond: University of London)H-Index: 7
Last. Hannah K Frank (Tulane University)
view all 15 authors...
Data cataloguing viral diversity on Earth have been fragmented across sources, disciplines, formats, and various degrees of open collation, posing challenges for research on macroecology, evolution, and public health. Here, we solve this problem by establishing a dynamically-maintained database of vertebrate-virus associations, called The Global Virome in One Network (VIRION). The VIRION database has been assembled through both reconciliation of static datasets and integration of dynamically-upd...
#1Rory Gibb (Lond: University of London)
#1Rory Gibb (Lond: University of London)H-Index: 7
Last. Timothée Poisot (UdeM: Université de Montréal)H-Index: 27
view all 13 authors...
The fields of viral ecology and evolution have rapidly expanded in the last two decades, driven by technological improvements, and motivated by efforts to discover potentially zoonotic wildlife viruses under the rubric of pandemic prevention. One consequence has been a massive proliferation of host-virus association data, which comprise the backbone of research in viral macroecology and zoonotic risk prediction. These data remain fragmented across numerous data portals and projects, each with th...
Last. Ben BusbyH-Index: 11
view all 33 authors...
Viruses represent important test cases for data federation due to their genome size and the rapid increase in sequence data in publicly available databases. However, some consequences of previously decentralized (unfederated) data are lack of consensus or comparisons between feature annotations. Unifying or displaying alternative annotations should be a priority both for communities with robust entry representation and for nascent communities with burgeoning data sources. To this end, during thi...
#1Ryan Connor (NIH: National Institutes of Health)H-Index: 1
#2Rodney Brister (NIH: National Institutes of Health)H-Index: 1
Last. Ben Busby (NIH: National Institutes of Health)H-Index: 11
view all 38 authors...
A wealth of viral data sits untapped in publicly available metagenomic data sets when it might be extracted to create a usable index for the virological research community. We hypothesized that work of this complexity and scale could be done in a hackathon setting. Ten teams comprised of over 40 participants from six countries, assembled to create a crowd-sourced set of analysis and processing pipelines for a complex biological data set in a three-day event on the San Diego State University camp...
5 CitationsSource