Skip to content

A modular program analysis tool framework for accelerators (NVIDIA, AMD, and DL workloads).

License

Notifications You must be signed in to change notification settings

FlagZhao/AccelProf

 
 

Repository files navigation

Documentation Status Static Badge Static Badge Static Badge

AccelProf

A Modular Program Analysis Tool Framework for Emerging Accelerators.

Overview

AccelProf is a modular program analysis framework for accelerator workloads spanning NVIDIA CUDA, AMD ROCm, and modern deep-learning systems. It abstracts over heterogeneous profiling APIs and deep-learning frameworks, providing a unified interface for capturing and analyzing runtime events at multiple levels. Its extensible architecture enables researchers and practitioners to rapidly prototype custom analysis tools with minimal overhead.

Installation

# Download
git clone --recursive https://github.com/AccelProf/AccelProf.git
git submodule update --init --recursive

# Check dependences
bash ./bin/utils/check_build_env.sh

# Build and install
./bin/build

# Set env
export ACCEL_PROF_HOME=$(pwd)
export PATH=${ACCEL_PROF_HOME}/bin:${PATH}

Basic Usage

Analyze an accelerator application:

accelprof -v -t app_analysis <executable> [args...]

Documentation

Full user and developer documentation: 👉 https://accelprofdocs.readthedocs.io

Paper

  • [CGO’26] PASTA: A Modular Program Analysis Tool Framework for Accelerators.
    Mao Lin, Hyeran Jeon, and Keren Zhou.
    Proceedings of the 23rd ACM/IEEE International Symposium on Code Generation and Optimization (CGO 2026).

License

Released under the MIT License. See LICENSE for details.

About

A modular program analysis tool framework for accelerators (NVIDIA, AMD, and DL workloads).

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Shell 74.8%
  • Python 9.8%
  • C++ 7.8%
  • CMake 3.8%
  • Makefile 3.8%