Novel HPC techniques to batch execution of many variable size BLAS computations on GPUs

Published: Jun 14, 2017
Abstract
This paper presents a software framework for solving large numbers of relatively small matrix problems using GPUs. Our approach combines novel and existing HPC techniques to methodically apply performance analysis, kernel design, low-level optimizations, and autotuning to exceed in performance proprietary vendor libraries. As a case study, we discuss the fundamental matrix operations defined by the Basic Linear Algebra Subprograms (BLAS)...
Paper Details
Title
Novel HPC techniques to batch execution of many variable size BLAS computations on GPUs
Published Date
Jun 14, 2017
Citation AnalysisPro
  • Scinapse’s Top 10 Citation Journals & Affiliations graph reveals the quality and authenticity of citations received by a paper.
  • Discover whether citations have been inflated due to self-citations, or if citations include institutional bias.