Properly Handle DynamicInferenceRequestRecord with latest Mcore #536

suiyoubi · 2025-12-15T14:45:26Z

Latest versions of Megatron-Core's inference engine return DynamicInferenceRequestRecord objects instead of InferenceRequest objects. The DynamicInferenceRequestRecord is a container class that holds multiple DynamicInferenceRequest objects (to support suspend/resume functionality) and doesn't have a direct generated_text attribute.

Added handling after generate() to merge DynamicInferenceRequestRecord objects in this PR

Signed-off-by: Ao Tang <aot@nvidia.com>

oyilmaz-nvidia · 2025-12-17T18:40:45Z

@suiyoubi Could you please fix the linting issues? Then we can start the CI.

copy-pr-bot · 2025-12-18T13:32:50Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

oyilmaz-nvidia · 2025-12-18T18:08:12Z

/ok to test bc17765

invoke merge for DynamicInferenceRequestRecord

e3c07e0

Signed-off-by: Ao Tang <aot@nvidia.com>

suiyoubi requested review from athitten, oyilmaz-nvidia and pthombre as code owners December 15, 2025 14:45

github-actions bot added deploy LLM labels Dec 15, 2025

copy-pr-bot bot had a problem deploying to nemo-ci December 15, 2025 14:45 Failure

ruff format

bc17765

copy-pr-bot bot had a problem deploying to nemo-ci December 18, 2025 18:08 Failure

copy-pr-bot bot temporarily deployed to test December 18, 2025 18:08 Inactive

copy-pr-bot bot had a problem deploying to nemo-ci December 18, 2025 18:13 Failure

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Properly Handle DynamicInferenceRequestRecord with latest Mcore #536

Properly Handle DynamicInferenceRequestRecord with latest Mcore #536

Uh oh!

suiyoubi commented Dec 15, 2025

Uh oh!

oyilmaz-nvidia commented Dec 17, 2025

Uh oh!

copy-pr-bot bot commented Dec 18, 2025

Uh oh!

oyilmaz-nvidia commented Dec 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Properly Handle DynamicInferenceRequestRecord with latest Mcore #536

Are you sure you want to change the base?

Properly Handle DynamicInferenceRequestRecord with latest Mcore #536

Uh oh!

Conversation

suiyoubi commented Dec 15, 2025

Uh oh!

oyilmaz-nvidia commented Dec 17, 2025

Uh oh!

copy-pr-bot bot commented Dec 18, 2025

Uh oh!

oyilmaz-nvidia commented Dec 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants