Enabling Highly Efficient Batched Matrix Multiplications on SW26010 Many-core Processor

Volume: 17, Issue: 1, Pages: 1 - 23
Published: Mar 4, 2020
Abstract
We present a systematic methodology for optimizing batched matrix multiplications on SW26010 many-core processor of the Sunway TaihuLight supercomputer. Five surrogate algorithms and a machine learning–based algorithm selector are proposed to fully exploit the computing capability of SW26010 and cope with the sophisticated algorithm characteristics of batched matrix multiplications. Experiment results show that the algorithm selector is able to...
Paper Details
Title
Enabling Highly Efficient Batched Matrix Multiplications on SW26010 Many-core Processor
Published Date
Mar 4, 2020
Volume
17
Issue
1
Pages
1 - 23
Citation AnalysisPro
  • Scinapse’s Top 10 Citation Journals & Affiliations graph reveals the quality and authenticity of citations received by a paper.
  • Discover whether citations have been inflated due to self-citations, or if citations include institutional bias.