docs: adds a known_errors.rst to improve UX #332

terrykong · 2024-10-02T00:21:36Z

What does this PR do ?

Add a one line overview of what this PR aims to accomplish.

Changelog

Please update the CHANGELOG.md under next version with high level changes in this PR.

Usage

You can potentially add a usage example below

# Add a code snippet demonstrating how to use this

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation? Make sure to also update the NeMo Framework User Guide which contains the tutorials

Checklist when contributing a new algorithm

Does the trainer resume and restore model state all states?
Does the trainer support all parallelism techniques(PP, TP, DP)?
Does the trainer support max_steps=-1 and validation?
Does the trainer only call APIs defined in alignable_interface.py?
Does the trainer have proper logging?

Additional Information

Related to # (issue)

Signed-off-by: Terry Kong <terryk@nvidia.com>

terrykong · 2024-10-04T20:48:45Z

Note to self: re-home known_errors.rst

docs: adds a known_errors.rst to improve UX

c5f9b39

Signed-off-by: Terry Kong <terryk@nvidia.com>

terrykong requested review from jgerh and ko3n1g October 2, 2024 00:21

github-actions bot added the documentation Improvements or additions to documentation label Oct 2, 2024

terrykong added r0.5.0 creates cherry-pick PR after merge (add before Run CICD label) and removed documentation Improvements or additions to documentation labels Oct 2, 2024

missing

3a94377

Signed-off-by: Terry Kong <terryk@nvidia.com>

github-actions bot added the documentation Improvements or additions to documentation label Oct 2, 2024

terrykong marked this pull request as draft October 4, 2024 23:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

docs: adds a known_errors.rst to improve UX #332

docs: adds a known_errors.rst to improve UX #332

Uh oh!

terrykong commented Oct 2, 2024

Uh oh!

terrykong commented Oct 4, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

docs: adds a known_errors.rst to improve UX #332

Are you sure you want to change the base?

docs: adds a known_errors.rst to improve UX #332

Uh oh!

Conversation

terrykong commented Oct 2, 2024

What does this PR do ?

Changelog

Usage

Before your PR is "Ready for review"

Checklist when contributing a new algorithm

Additional Information

Uh oh!

terrykong commented Oct 4, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants