No Travel? No Problem.

Remote Participation
Measurement and Analysis of GPU-Accelerated OpenCL Computations on Intel GPUs
Event Type
Workshop
Tags
Online Only
Heterogeneous Systems
Performance
Productivity Tools
Software Engineering
Registration Categories
W
TimeSunday, 14 November 202110:30am - 11am CST
LocationOnline
DescriptionIn this paper, we describe extensions to Rice University's HPCToolkit performance tools that support measurement and analysis of Intel's DPC++ programming model for GPU-accelerated systems atop an implementation of the industry-standard OpenCL framework for heterogeneous parallelism on Intel GPUs. HPCToolkit supports three techniques for performance analysis of programs atop OpenCL on Intel GPUs. First, HPCToolkit supports profiling and tracing of OpenCL kernels. Second, HPCToolkit supports CPU-GPU blame shifting for OpenCL kernel executions---a profiling technique that can identify code that executes on one or more CPUs while GPUs are idle. Third, HPCToolkit supports fine-grained measurement, analysis, and attribution of OpenCL GPU kernels, including instruction counts, execution latency, and SIMD waste. The paper describes these capabilities and then illustrates their application in case studies with two applications that offload computations onto Intel GPUs.
Back To Top Button