A Modular Program Analysis Tool Framework for Emerging Accelerators.
AccelProf is a modular program analysis framework for accelerator workloads spanning NVIDIA CUDA, AMD ROCm, and modern deep-learning systems. It abstracts over heterogeneous profiling APIs and deep-learning frameworks, providing a unified interface for capturing and analyzing runtime events at multiple levels. Its extensible architecture enables researchers and practitioners to rapidly prototype custom analysis tools with minimal overhead.
# Download
git clone --recursive https://github.com/AccelProf/AccelProf.git
git submodule update --init --recursive
# Check dependences
bash ./bin/utils/check_build_env.sh
# Build and install
./bin/build
# Set env
export ACCEL_PROF_HOME=$(pwd)
export PATH=${ACCEL_PROF_HOME}/bin:${PATH}Analyze an accelerator application:
accelprof -v -t app_analysis <executable> [args...]Full user and developer documentation: 👉 https://accelprofdocs.readthedocs.io
- [CGO’26] PASTA: A Modular Program Analysis Tool Framework for Accelerators.
Mao Lin, Hyeran Jeon, and Keren Zhou.
Proceedings of the 23rd ACM/IEEE International Symposium on Code Generation and Optimization (CGO 2026).
Released under the MIT License.
See LICENSE for details.
