Skip to content

Conversation

@jmellorcrummey
Copy link

we compiled the 'develop' versions of clr and hip as Ben recommended. the HIP and HSA assertions

ROCP_FATAL_IF(external_corr_ids.size() < (callback_contexts.size() + buffered_contexts.size()))

that I commented out out trip with each of external_corr_ids.size(), (callback_contexts.size(), and buffered_contexts.size() == 1. omitting them, our incomplete draft of rocprofiler-sdk support in hpctoolkit is being exercised as expected.

remove comment containing 'johnmc'
remove comment containing 'johnmc'
Copy link
Contributor

@jrmadsen jrmadsen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi @jmellorcrummey, I think these should just move down below the tracing::populate_external_correlation_ids(…) function call that is a little further down in the code.

@jrmadsen
Copy link
Contributor

Actually, I was mistaken. It appears this condition can arise if you are using both callback and buffer tracing of an API in the same context:

extern_corr_ids.emplace(itr, empty_user_data);

extern_corr_ids.emplace(itr, empty_user_data);

which ends up with one external correlation id map entry, one callback entry, and one buffer entry. Are you intentionally doing this?

@jmellorcrummey
Copy link
Author

We are intentionally doing this at present. It should be allowed. That's why I think the assertion should be deleted.

For the future, we aim to switch to the external correlation id support rather than using the callback.

aovid the need to patch AMD's amdgpu driver for AFAR V
@jayhawk-commits
Copy link
Contributor

This pull request has merge conflicts that need to be resolved. It cannot be imported to the ROCm/rocm-systems repo automatically.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants