Skip to content

Releases: ROCm/rocHPL

v7.0.2

16 Jul 13:40
836c282

Choose a tag to compare

What's Changed

  • Fix for non-constexpr warpSize in ROCm 7.0+

Full Changelog: v7.0.1...v7.0.2

v7.0.1

21 May 21:41

Choose a tag to compare

What's Changed

Full Changelog: v7.0.0...v7.0.1

v7.0.0

27 Mar 20:02

Choose a tag to compare

What's Changed

  • GPU-side panel factorization
    • This is a significant change to the usual overlapped CPU+GPU code path.
    • New optimal run parameters may be required on some systems.
  • As GPU panel factorization uses the Cooperative Groups API, oversubscription of GPUs is no longer supported
    • Users must ensure to run with at most one process per GPU.
  • CPU-GPU binding in run scripts changed to use NUMA affinity
  • Added Debian and Linuxmint into the distribution filter by @joelandman in #11

New Contributors

Full Changelog: v6.1.0...v7.0.0

v6.1.0

21 Jan 20:54

Choose a tag to compare

What's Changed

  • New iteration count input parameter to allow for repeated back-to-back runs
  • Support for ROCm 6.0+
  • Updated BLIS, UCX, and OpenMPI third party libs built by default.
  • Support for OpenMPI v5.0+
  • Auto-detect supported gfx archs via ROCm version
  • Support for up to 512 CPU cores on single node
  • Flux job manager support
  • Added toggle for enabling roctx tracing

Full Changelog: v6.0.0...v6.1.0

v6.0.0

07 Jul 17:50

Choose a tag to compare

Initial public release