Skip to content
This repository was archived by the owner on Nov 19, 2025. It is now read-only.

Conversation

@arendu
Copy link
Collaborator

@arendu arendu commented Nov 1, 2024

What does this PR do ?

Add a one line overview of what this PR aims to accomplish.

Changelog

  • Please update the CHANGELOG.md under next version with high level changes in this PR.

Usage

  • You can potentially add a usage example below
# Add a code snippet demonstrating how to use this 

Before your PR is "Ready for review"

Pre checks:

Checklist when contributing a new algorithm

  • Does the trainer resume and restore model state all states?
  • Does the trainer support all parallelism techniques(PP, TP, DP)?
  • Does the trainer support max_steps=-1 and validation?
  • Does the trainer only call APIs defined in alignable_interface.py?
  • Does the trainer have proper logging?

Additional Information

  • Related to # (issue)

Signed-off-by: arendu <adithya.r@gmail.com>
@github-actions github-actions bot added the Utils label Nov 1, 2024
arendu and others added 3 commits November 1, 2024 18:00
Signed-off-by: arendu <adithya.r@gmail.com>
Signed-off-by: root <root@cw-dfw-h100-001-129-026.cm.cluster>
Signed-off-by: arendu <adithya.r@gmail.com>
Signed-off-by: adithyare <adithyare@nvidia.com>
@arendu arendu requested a review from trias702 November 14, 2024 01:46
Signed-off-by: arendu <adithya.r@gmail.com>
Signed-off-by: arendu <adithya.r@gmail.com>
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants