Nemotron5 features #403

arendu · 2024-11-14T18:44:35Z

What does this PR do ?

contains changes to support nemotron5

Containers feat: support new DPO data format and update SFT config to use override API #405

Changelog

Please update the CHANGELOG.md under next version with high level changes in this PR.

Usage

You can potentially add a usage example below

# Add a code snippet demonstrating how to use this

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation? Make sure to also update the NeMo Framework User Guide which contains the tutorials

Checklist when contributing a new algorithm

Does the trainer resume and restore model state all states?
Does the trainer support all parallelism techniques(PP, TP, DP)?
Does the trainer support max_steps=-1 and validation?
Does the trainer only call APIs defined in alignable_interface.py?
Does the trainer have proper logging?

Additional Information

Related to # (issue)

) Signed-off-by: Terry Kong <terryk@nvidia.com> Signed-off-by: NeMo-Aligner CI <nemo-aligner-ci@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

wip Signed-off-by: arendu <adithya.r@gmail.com> docs: 0.5.0 documentation updates (#346) Signed-off-by: ashors1 <ashors@nvidia.com> ci: Sign-off cherry pick (#366) Signed-off-by: Oliver Koenig <okoenig@nvidia.com> docs: main readme and sft docs (#367) Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Gerald Shen <119401249+gshennvm@users.noreply.github.com> docs: fix code block rendering (#369) Signed-off-by: ashors1 <ashors@nvidia.com> dpo and sft Signed-off-by: arendu <adithya.r@gmail.com> dpo support Signed-off-by: root <root@cw-dfw-h100-001-129-026.cm.cluster> mamba padding Signed-off-by: arendu <adithya.r@gmail.com> convenience script to remove old format of DPO data Signed-off-by: adithyare <adithyare@nvidia.com> pad to mult 256 Signed-off-by: arendu <adithya.r@gmail.com> copy dpo style cfg overrides Signed-off-by: arendu <adithya.r@gmail.com> remove _modify_config Signed-off-by: arendu <adithya.r@gmail.com> fix config issue Signed-off-by: Jiaqi Zeng <jiaqiz@nvidia.com> fix mamba config issue Signed-off-by: Jiaqi Zeng <jiaqiz@nvidia.com> is mamba default false Signed-off-by: arendu <adithya.r@gmail.com> revert cherry-pick-release-commit Signed-off-by: Terry Kong <terryk@nvidia.com> Revert "revert cherry-pick-release-commit" This reverts commit 911337c. undo .github/workflows Signed-off-by: Terry Kong <terryk@nvidia.com> revert docs changes that weren't supposed to be there Signed-off-by: Terry Kong <terryk@nvidia.com>

for more information, see https://pre-commit.ci Signed-off-by: NeMo-Aligner CI <nemo-aligner-ci@nvidia.com>

Signed-off-by: Terry Kong <terryk@nvidia.com>

Signed-off-by: arendu <adithya.r@gmail.com>

github-actions bot added Utils Algorithms labels Nov 14, 2024

feat: TRTLLM API handle tokenizers without pad_id (e.g., tiktoken) (#399

eb2db8b

) Signed-off-by: Terry Kong <terryk@nvidia.com> Signed-off-by: NeMo-Aligner CI <nemo-aligner-ci@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

terrykong force-pushed the nemotron5_features branch from 330d0c9 to bf8e6cc Compare November 21, 2024 23:18

github-actions bot added documentation Improvements or additions to documentation CI labels Nov 21, 2024

terrykong force-pushed the nemotron5_features branch from bf8e6cc to 9c1727f Compare November 21, 2024 23:19

github-actions bot removed CI documentation Improvements or additions to documentation labels Nov 21, 2024

terrykong force-pushed the nemotron5_features branch from 9789e6a to ddd829b Compare November 21, 2024 23:27

terrykong force-pushed the nemotron5_features branch from ddd829b to 5270081 Compare November 21, 2024 23:30

[pre-commit.ci] auto fixes from pre-commit.com hooks

6394adb

for more information, see https://pre-commit.ci Signed-off-by: NeMo-Aligner CI <nemo-aligner-ci@nvidia.com>

github-actions bot removed Utils Algorithms labels Nov 21, 2024

feat: dpo dataset new openai chat completion format

7723879

terrykong mentioned this pull request Nov 22, 2024

feat: support new DPO data format and update SFT config to use override API #405

Merged

8 tasks

remove dockerfile patches for nemotron-5 since nemo includes PRs needed

d97f299

Signed-off-by: Terry Kong <terryk@nvidia.com>

terrykong force-pushed the main branch from 5b07bec to 94a9594 Compare December 5, 2024 00:15

meta tokens support

a9c8b0a

Signed-off-by: arendu <adithya.r@gmail.com>

github-actions bot added the Utils label Dec 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Nemotron5 features #403

Nemotron5 features #403

Uh oh!

arendu commented Nov 14, 2024 •

edited by terrykong

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Nemotron5 features #403

Are you sure you want to change the base?

Nemotron5 features #403

Uh oh!

Conversation

arendu commented Nov 14, 2024 • edited by terrykong Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do ?

Changelog

Usage

Before your PR is "Ready for review"

Checklist when contributing a new algorithm

Additional Information

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

arendu commented Nov 14, 2024 •

edited by terrykong

Loading