e-journal
Studying Inter-Warp Divergence Aware Execution on GPUs Chulian
This letter quantitatively studies the benefits of inter-warp divergence aware execution on GPUs. To that end, the letter first proposes a novel approach to quantify the inter-warp divergence by measuring the temporal similarity in execution progress of concurrent warps, which we call Warp Progression Similarity (WPS). Based on the WPS metric, this letter proposes a WPS-aware Scheduler (WPSaS) to optimize GPU throughput. The aim is to manage inter-warp divergence to hide memory access latency and minimize resource conflicts and temporal under-utilization in compute units allowing GPUs to achieve their peak throughput. Our results demonstrate that WPSaS improves throughput by 10% with a pronounced reduction in resource conflicts and temporal under-utilization.
Tidak ada salinan data
Tidak tersedia versi lain