Rapid development of cloud-native intelligent data pipelines for scientific data streams using the HASTE Toolkit
Abstract
Background Large streamed datasets, characteristic of life science applications, are often resource-intensive to process, transport and store. We propose a pipeline model, a design pattern for scientific pipelines, where an incoming stream of scientific data is organized into a tiered or ordered “data hierarchy". We introduce the HASTE Toolkit, a proof-of-concept cloud-native software toolkit based on this pipeline model, to partition and...
Paper Details
Title
Rapid development of cloud-native intelligent data pipelines for scientific data streams using the HASTE Toolkit
Published Date
Mar 1, 2021
Journal
Volume
10
Issue
3
Citation AnalysisPro
You’ll need to upgrade your plan to Pro
Looking to understand the true influence of a researcher’s work across journals & affiliations?
- Scinapse’s Top 10 Citation Journals & Affiliations graph reveals the quality and authenticity of citations received by a paper.
- Discover whether citations have been inflated due to self-citations, or if citations include institutional bias.
Notes
History