LIBXSMM: accelerating small matrix multiplications by runtime code generation

Pages: 981 - 991
Published: Nov 13, 2016
Abstract
Many modern highly scalable scientific simulations packages rely on small matrix multiplications as their main computational engine. Math libraries or compilers are unlikely to provide the best possible kernel performance. To address this issue, we present a library which provides high performance small matrix multiplications targeting all recent x86 vector instruction set extensions up to Intel AVX-512. Our evaluation proves that speed-ups of...
Paper Details
Title
LIBXSMM: accelerating small matrix multiplications by runtime code generation
Published Date
Nov 13, 2016
Pages
981 - 991
Citation AnalysisPro
  • Scinapse’s Top 10 Citation Journals & Affiliations graph reveals the quality and authenticity of citations received by a paper.
  • Discover whether citations have been inflated due to self-citations, or if citations include institutional bias.