Algorithms and optimization techniques for high-performance matrix-matrix multiplications of very small matrices

Volume: 81, Pages: 1 - 21
Published: Jan 1, 2019
Abstract
Expressing scientific computations in terms of BLAS, and in particular the general dense matrix-matrix multiplication (GEMM), is of fundamental importance for obtaining high performance portability across architectures. However, GEMMs for small matrices of sizes smaller than 32 are not sufficiently optimized in existing libraries. We consider the computation of many small GEMMs and its performance portability for a wide range of computer...
Paper Details
Title
Algorithms and optimization techniques for high-performance matrix-matrix multiplications of very small matrices
Published Date
Jan 1, 2019
Volume
81
Pages
1 - 21
Citation AnalysisPro
  • Scinapse’s Top 10 Citation Journals & Affiliations graph reveals the quality and authenticity of citations received by a paper.
  • Discover whether citations have been inflated due to self-citations, or if citations include institutional bias.