Time and energy modeling of high–performance Level-3 BLAS on x86 architectures

Volume: 55, Pages: 77 - 94
Published: Jun 1, 2015
Abstract
We present accurate piece-wise models for the time and energy costs of high performance implementations of both the matrix multiplication (gemm) and the triangular system solve with multiple right-hand sides (trsm) on x86 architectures. Our methodology decouples the costs due to the floating-point arithmetic/data movement occurring in the higher levels of the cache hierarchy from those of packing/data transfers between the main memory and the...
Paper Details
Title
Time and energy modeling of high–performance Level-3 BLAS on x86 architectures
Published Date
Jun 1, 2015
Volume
55
Pages
77 - 94
Citation AnalysisPro
  • Scinapse’s Top 10 Citation Journals & Affiliations graph reveals the quality and authenticity of citations received by a paper.
  • Discover whether citations have been inflated due to self-citations, or if citations include institutional bias.