Understanding the GPU Microarchitecture to Achieve Bare-Metal Performance Tuning

Published: Jan 26, 2017
Abstract
In this paper, we present a methodology to understand GPU microarchitectural features and improve performance for compute-intensive kernels. The methodology relies on a reverse engineering approach to crack the GPU ISA encodings in order to build a GPU assembler. An assembly microbenchmark suite correlates microarchitectural features with their performance factors to uncover instruction-level and memory hierarchy preferences. We use SGEMM as a...
Paper Details
Title
Understanding the GPU Microarchitecture to Achieve Bare-Metal Performance Tuning
Published Date
Jan 26, 2017
Citation AnalysisPro
  • Scinapse’s Top 10 Citation Journals & Affiliations graph reveals the quality and authenticity of citations received by a paper.
  • Discover whether citations have been inflated due to self-citations, or if citations include institutional bias.