A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis. - View it on GitHub
Star
0
Rank
13772035