Skip to content

Conversation

@amaslenn
Copy link
Contributor

Summary

Pytorch to 25.12-py3
Nemo to 25.11.01

Test Plan

  1. CI

Additional Notes

Pytorch to 25.12-py3
Nemo to 25.11.01
@coderabbitai
Copy link
Contributor

coderabbitai bot commented Jan 12, 2026

📝 Walkthrough

Walkthrough

Updated container image tags and small test parameters across multiple test and test-scenario TOML files: PyTorch images bumped to 25.12-py3, NeMo images to 25.11.01, copyright year ranges extended, and NCCL test params added to one config.

Changes

Cohort / File(s) Summary
NCCL / PyTorch test configs
conf/common/test/dse_nccl_all_gather.toml, conf/common/test/nccl_test.toml, conf/common/test/nccl_test_all_gather.toml, conf/common/test/osu_test.toml, conf/common/test/ucc_test.toml
Updated cmd_args.docker_image_url to nvcr.io#nvidia/pytorch:25.12-py3. Also updated copyright years. dse_nccl_all_gather.toml: added cmd_args.maxbytes = "4G", cmd_args.iters = "100", cmd_args.warmup_iters = ["5","50"], and [extra_cmd_args] with --stepfactor = "2".
NeMo test configs
conf/common/test/nemo_run_llama3_8b.toml, conf/common/test_scenario/dse_nemo_run_llama3_8b.toml
Updated cmd_args.docker_image_url to nvcr.io#nvidia/nemo:25.11.01. Updated copyright years.
Test scenario / container templates
conf/common/test_scenario/nccl_test.toml, conf/common/test_scenario/slurm_container.toml, conf/common/test_scenario/ucc_generator_test.toml
Updated Tests.cmd_args.docker_image_url to nvcr.io#nvidia/pytorch:25.12-py3 (some files previously used nvcr.io/nvidia/pytorch:25.09-py3). Updated copyright years.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~10 minutes

Poem

🐇 I nibble on toml and cheer,
Images bumped, the path is clear.
PyTorch climbs and NeMo sings,
Small new params and tiny things.
Hop—configs fresh for testing gears!

🚥 Pre-merge checks | ✅ 2
✅ Passed checks (2 passed)
Check name Status Explanation
Title check ✅ Passed The title 'Upgrade container versions for common examples' accurately summarizes the main change: updating container image versions across multiple configuration files.
Description check ✅ Passed The description is related to the changeset, specifying which container versions were upgraded (PyTorch to 25.12-py3 and Nemo to 25.11.01) and mentions the test plan.

✏️ Tip: You can configure your own custom pre-merge checks in the settings.


📜 Recent review details

Configuration used: Path: .coderabbit.yaml

Review profile: ASSERTIVE

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between d7fd29a and f72a79a.

📒 Files selected for processing (10)
  • conf/common/test/dse_nccl_all_gather.toml
  • conf/common/test/nccl_test.toml
  • conf/common/test/nccl_test_all_gather.toml
  • conf/common/test/nemo_run_llama3_8b.toml
  • conf/common/test/osu_test.toml
  • conf/common/test/ucc_test.toml
  • conf/common/test_scenario/dse_nemo_run_llama3_8b.toml
  • conf/common/test_scenario/nccl_test.toml
  • conf/common/test_scenario/slurm_container.toml
  • conf/common/test_scenario/ucc_generator_test.toml
🧰 Additional context used
🧠 Learnings (4)
📚 Learning: 2025-12-23T00:23:16.200Z
Learnt from: srivatsankrishnan
Repo: NVIDIA/cloudai PR: 764
File: src/cloudai/workloads/megatron_bridge/megatron_bridge.py:98-101
Timestamp: 2025-12-23T00:23:16.200Z
Learning: In src/cloudai/workloads/megatron_bridge/megatron_bridge.py, the nemo_run_repo GitRepo uses commit="main" intentionally. Nemo Run is a Slurm executor (not a framework) used by Megatron Bridge to launch recipes, and tracking the main branch is acceptable for this dependency.

Applied to files:

  • conf/common/test/nemo_run_llama3_8b.toml
  • conf/common/test_scenario/dse_nemo_run_llama3_8b.toml
📚 Learning: 2026-01-05T22:24:31.807Z
Learnt from: srivatsankrishnan
Repo: NVIDIA/cloudai PR: 767
File: conf/experimental/megatron_bridge/test/gb300/megatron_bridge_qwen_30b.toml:28-37
Timestamp: 2026-01-05T22:24:31.807Z
Learning: In CloudAI Megatron-Bridge TOML configuration files, document and implement support for container_image to accept '#' as a separator in addition to '/'. For example, both 'nvcr.io/nvidia/nemo:25.11.01' and 'nvcr.io#nvidia/nemo:25.11.01' should be considered valid syntax. Update parsing/validation logic for container_image accordingly and add validation tests to cover both separator forms in all relevant TOML configs (e.g., under conf/**).

Applied to files:

  • conf/common/test/nemo_run_llama3_8b.toml
  • conf/common/test/nccl_test.toml
  • conf/common/test_scenario/nccl_test.toml
  • conf/common/test_scenario/slurm_container.toml
  • conf/common/test_scenario/dse_nemo_run_llama3_8b.toml
  • conf/common/test/osu_test.toml
  • conf/common/test/ucc_test.toml
  • conf/common/test_scenario/ucc_generator_test.toml
  • conf/common/test/dse_nccl_all_gather.toml
  • conf/common/test/nccl_test_all_gather.toml
📚 Learning: 2025-12-18T17:54:44.004Z
Learnt from: allkoow
Repo: NVIDIA/cloudai PR: 742
File: doc/workloads/osu.rst:14-23
Timestamp: 2025-12-18T17:54:44.004Z
Learning: In the OSU benchmark workload (src/cloudai/workloads/osu_bench/), list-valued fields in OSUBenchCmdArgs (e.g., benchmark: str | List[str], message_size: Optional[str | List[str]]) are unrolled by the DSE (Design Space Exploration) flow before command generation, so the command generation strategy code receives only scalar values.

Applied to files:

  • conf/common/test/osu_test.toml
📚 Learning: 2025-12-17T22:24:51.805Z
Learnt from: srivatsankrishnan
Repo: NVIDIA/cloudai PR: 760
File: tests/standalone_command_gen_strategy/test_aiconfigurator_standalone_command_gen_strategy.py:33-122
Timestamp: 2025-12-17T22:24:51.805Z
Learning: In the NVIDIA/cloudai repository, avoid suggesting overly nitpick refactor comments such as test parametrization when there are only two test cases with different modes (e.g., agg vs disagg). Such refactoring suggestions are not needed unless explicitly requested.

Applied to files:

  • conf/common/test/dse_nccl_all_gather.toml
⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)
  • GitHub Check: Run pytest (3.12)
  • GitHub Check: Run pytest (3.10)
🔇 Additional comments (10)
conf/common/test/nccl_test_all_gather.toml (1)

2-2: LGTM!

Copyright year update and PyTorch container version bump to 25.12-py3 are consistent with the PR objectives. The # separator in the image URL is valid per project conventions.

Also applies to: 22-22

conf/common/test/nemo_run_llama3_8b.toml (1)

2-2: LGTM!

Copyright year update and NeMo container version bump to 25.11.01 align with the PR objectives.

Also applies to: 22-22

conf/common/test/ucc_test.toml (1)

2-2: LGTM!

Copyright year update and PyTorch container version bump to 25.12-py3 are consistent with the PR objectives.

Also applies to: 22-22

conf/common/test_scenario/dse_nemo_run_llama3_8b.toml (1)

2-2: LGTM!

Copyright year update and NeMo container version bump to 25.11.01 are consistent with the PR objectives and the base test configuration in nemo_run_llama3_8b.toml.

Also applies to: 29-29

conf/common/test/nccl_test.toml (1)

2-2: LGTM!

Copyright year update and PyTorch container version bump to 25.12-py3 align with the PR objectives. The new test parameters (iters, warmup_iters, stepfactor) are consistent with other NCCL test configurations like nccl_test_all_gather.toml.

Also applies to: 22-28

conf/common/test_scenario/ucc_generator_test.toml (1)

2-2: LGTM!

Copyright year extended to 2026 and container image tag updated to 25.12-py3 as intended. This file uses the standard / separator for the image URL, which is valid per the repository conventions.

Also applies to: 33-33

conf/common/test_scenario/slurm_container.toml (1)

2-2: LGTM!

Copyright year extended and container image tag updated to 25.12-py3. The # separator format is supported per repository conventions.

Also applies to: 29-29

conf/common/test_scenario/nccl_test.toml (1)

2-2: LGTM!

Copyright year extended to 2026 and container image tag updated to 25.12-py3 for the nccl.scatter_perf test. Other tests in this scenario reference test definitions by name (nccl_base_test, nccl_test_all_gather) which should inherit the updated image from their respective test configuration files.

Also applies to: 71-71

conf/common/test/osu_test.toml (1)

2-2: LGTM!

Copyright year extended and container image updated to 25.12-py3. The addition of message_size = "1024" provides an explicit test parameter, which aligns with the pattern of extending test parameter coverage observed across this PR.

Also applies to: 22-26

conf/common/test/dse_nccl_all_gather.toml (1)

2-2: LGTM!

Copyright year extended and container image updated to 25.12-py3. The additional test parameters (maxbytes, iters, warmup_iters, --stepfactor) enhance the DSE configuration. Note that warmup_iters = ["5", "50"] defines a parameter sweep, which the DSE flow will unroll appropriately.

Also applies to: 22-31


Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 0

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (10)
conf/common/test_scenario/dse_nemo_run_llama3_8b.toml (1)

2-2: Fix copyright year to satisfy CI.

The pipeline reports a copyright header year mismatch. Update the end year from 2025 to 2026.

Proposed fix
-# Copyright (c) 2024-2025 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+# Copyright (c) 2024-2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
conf/common/test_scenario/nccl_test.toml (1)

2-2: Fix copyright year to satisfy CI.

The pipeline reports a copyright header year mismatch. Update the end year from 2025 to 2026.

Proposed fix
-# Copyright (c) 2024-2025 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+# Copyright (c) 2024-2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
conf/common/test/nccl_test.toml (1)

2-2: Fix copyright year to satisfy CI.

The pipeline reports a copyright header year mismatch. Update from 2025 to 2025-2026 (or just 2026 if this file was created this year).

Proposed fix
-# Copyright (c) 2025 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+# Copyright (c) 2025-2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
conf/common/test/nemo_run_llama3_8b.toml (1)

2-2: Fix copyright year to satisfy CI.

The pipeline reports a copyright header year mismatch. Update the end year from 2025 to 2026.

Proposed fix
-# Copyright (c) 2024-2025 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+# Copyright (c) 2024-2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
conf/common/test/osu_test.toml (1)

2-2: Fix copyright year to satisfy CI.

The pipeline reports a copyright header year mismatch. Update from 2025 to 2025-2026.

Proposed fix
-# Copyright (c) 2025 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+# Copyright (c) 2025-2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
conf/common/test/dse_nccl_all_gather.toml (1)

2-2: Update copyright year to 2026.

The CI pipeline reports a copyright header year mismatch. The end year should be 2026.

Proposed fix
-# Copyright (c) 2025 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+# Copyright (c) 2025-2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
conf/common/test/ucc_test.toml (1)

2-2: Update copyright year to 2026.

The CI pipeline reports a copyright header year mismatch. The end year should be 2026.

Proposed fix
-# Copyright (c) 2024-2025 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+# Copyright (c) 2024-2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
conf/common/test_scenario/ucc_generator_test.toml (1)

2-2: Update copyright year to 2026.

The CI pipeline reports a copyright header year mismatch. The end year should be 2026.

Proposed fix
-# Copyright (c) 2024-2025 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+# Copyright (c) 2024-2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
conf/common/test_scenario/slurm_container.toml (1)

2-2: Update copyright year to 2026.

The CI pipeline reports a copyright header year mismatch. The end year should be 2026.

Proposed fix
-# Copyright (c) 2025 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+# Copyright (c) 2025-2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
conf/common/test/nccl_test_all_gather.toml (1)

2-2: Update copyright year to 2026.

The CI pipeline reports a copyright header year mismatch. The end year should be 2026.

Proposed fix
-# Copyright (c) 2024-2025 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+# Copyright (c) 2024-2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: ASSERTIVE

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between bebc6ef and d7fd29a.

📒 Files selected for processing (10)
  • conf/common/test/dse_nccl_all_gather.toml
  • conf/common/test/nccl_test.toml
  • conf/common/test/nccl_test_all_gather.toml
  • conf/common/test/nemo_run_llama3_8b.toml
  • conf/common/test/osu_test.toml
  • conf/common/test/ucc_test.toml
  • conf/common/test_scenario/dse_nemo_run_llama3_8b.toml
  • conf/common/test_scenario/nccl_test.toml
  • conf/common/test_scenario/slurm_container.toml
  • conf/common/test_scenario/ucc_generator_test.toml
🧰 Additional context used
🧠 Learnings (3)
📚 Learning: 2026-01-05T22:24:31.807Z
Learnt from: srivatsankrishnan
Repo: NVIDIA/cloudai PR: 767
File: conf/experimental/megatron_bridge/test/gb300/megatron_bridge_qwen_30b.toml:28-37
Timestamp: 2026-01-05T22:24:31.807Z
Learning: In CloudAI Megatron-Bridge TOML configuration files, document and implement support for container_image to accept '#' as a separator in addition to '/'. For example, both 'nvcr.io/nvidia/nemo:25.11.01' and 'nvcr.io#nvidia/nemo:25.11.01' should be considered valid syntax. Update parsing/validation logic for container_image accordingly and add validation tests to cover both separator forms in all relevant TOML configs (e.g., under conf/**).

Applied to files:

  • conf/common/test_scenario/ucc_generator_test.toml
  • conf/common/test_scenario/slurm_container.toml
  • conf/common/test/dse_nccl_all_gather.toml
  • conf/common/test_scenario/dse_nemo_run_llama3_8b.toml
  • conf/common/test/nemo_run_llama3_8b.toml
  • conf/common/test_scenario/nccl_test.toml
  • conf/common/test/nccl_test_all_gather.toml
  • conf/common/test/ucc_test.toml
  • conf/common/test/osu_test.toml
  • conf/common/test/nccl_test.toml
📚 Learning: 2025-12-23T00:23:16.200Z
Learnt from: srivatsankrishnan
Repo: NVIDIA/cloudai PR: 764
File: src/cloudai/workloads/megatron_bridge/megatron_bridge.py:98-101
Timestamp: 2025-12-23T00:23:16.200Z
Learning: In src/cloudai/workloads/megatron_bridge/megatron_bridge.py, the nemo_run_repo GitRepo uses commit="main" intentionally. Nemo Run is a Slurm executor (not a framework) used by Megatron Bridge to launch recipes, and tracking the main branch is acceptable for this dependency.

Applied to files:

  • conf/common/test_scenario/dse_nemo_run_llama3_8b.toml
  • conf/common/test/nemo_run_llama3_8b.toml
📚 Learning: 2025-12-18T17:54:44.004Z
Learnt from: allkoow
Repo: NVIDIA/cloudai PR: 742
File: doc/workloads/osu.rst:14-23
Timestamp: 2025-12-18T17:54:44.004Z
Learning: In the OSU benchmark workload (src/cloudai/workloads/osu_bench/), list-valued fields in OSUBenchCmdArgs (e.g., benchmark: str | List[str], message_size: Optional[str | List[str]]) are unrolled by the DSE (Design Space Exploration) flow before command generation, so the command generation strategy code receives only scalar values.

Applied to files:

  • conf/common/test/osu_test.toml
🪛 GitHub Actions: CI
conf/common/test_scenario/ucc_generator_test.toml

[error] 2-2: Copyright header year range mismatch. Expected end year 2026, found 2025.

conf/common/test_scenario/slurm_container.toml

[error] 2-2: Copyright header year range mismatch. Expected end year 2026, found 2025.

conf/common/test/dse_nccl_all_gather.toml

[error] 2-2: Copyright header year range mismatch. Expected end year 2026, found 2025.

conf/common/test_scenario/dse_nemo_run_llama3_8b.toml

[error] 2-2: Copyright header year range mismatch. Expected end year 2026, found 2025.

conf/common/test/nemo_run_llama3_8b.toml

[error] 2-2: Copyright header year range mismatch. Expected end year 2026, found 2025.

conf/common/test_scenario/nccl_test.toml

[error] 2-2: Copyright header year range mismatch. Expected end year 2026, found 2025.

conf/common/test/nccl_test_all_gather.toml

[error] 2-2: Copyright header year range mismatch. Expected end year 2026, found 2025.

conf/common/test/ucc_test.toml

[error] 2-2: Copyright header year range mismatch. Expected end year 2026, found 2025.

conf/common/test/osu_test.toml

[error] 2-2: Copyright header year range mismatch. Expected end year 2026, found 2025.

conf/common/test/nccl_test.toml

[error] 2-2: Copyright header year range mismatch. Expected end year 2026, found 2025.

🔇 Additional comments (11)
conf/common/test_scenario/nccl_test.toml (1)

71-71: PyTorch container version update looks good.

The update to pytorch:25.12-py3 aligns with the PR objectives.

conf/common/test/nccl_test.toml (1)

22-28: Container update and new test parameters look good.

The PyTorch container bump to 25.12-py3 and the addition of iters, warmup_iters, and stepfactor parameters align with the PR objectives and provide more explicit test configuration.

conf/common/test/nemo_run_llama3_8b.toml (1)

22-22: NeMo container version update looks good.

The update to nemo:25.11.01 aligns with the PR objectives.

conf/common/test/osu_test.toml (1)

22-22: PyTorch container version update looks good.

The update to pytorch:25.12-py3 aligns with the PR objectives.

conf/common/test_scenario/dse_nemo_run_llama3_8b.toml (1)

29-29: NeMo container version update looks good.

The update to nemo:25.11.01 aligns with project patterns. The # separator is valid per project conventions and used consistently across configuration files.

conf/common/test/dse_nccl_all_gather.toml (2)

22-22: Container version update looks good.

The PyTorch image upgrade from 25.06-py3 to 25.12-py3 aligns with the PR objectives.


26-31: Verify the new test parameters are intentional.

These new test parameters (maxbytes, iters, warmup_iters, extra_cmd_args with --stepfactor) extend beyond the container version upgrade scope stated in the PR description. Please confirm these additions are intentional.

conf/common/test/ucc_test.toml (1)

22-22: Container version update looks good.

The PyTorch image upgrade from 25.06-py3 to 25.12-py3 aligns with the PR objectives.

conf/common/test_scenario/ucc_generator_test.toml (1)

33-33: Container version update looks good.

The PyTorch image upgrade to 25.12-py3 aligns with the PR objectives. Note this file uses / separator while others use # — both are valid per the project conventions.

conf/common/test_scenario/slurm_container.toml (1)

29-29: Container version update looks good.

The PyTorch image upgrade from 25.06-py3 to 25.12-py3 aligns with the PR objectives.

conf/common/test/nccl_test_all_gather.toml (1)

22-22: Container version update looks good.

The PyTorch image upgrade from 25.06-py3 to 25.12-py3 aligns with the PR objectives.

@greptile-apps
Copy link
Contributor

greptile-apps bot commented Jan 12, 2026

Greptile Overview

Greptile Summary

This PR upgrades container versions in common example configurations to newer releases:

Container Version Updates:

  • PyTorch: 25.06-py325.12-py3 (8 files)
  • NeMo: 25.09.0025.11.01 (2 files)

Scope and Changes:
The changes are limited to configuration files in conf/common/ directory, covering test definitions and test scenarios for various workloads including NCCL, UCC, OSU benchmarks, NeMo Run, and Slurm container tests. Copyright years have been appropriately updated to include 2026.

Technical Review:
All container version updates follow the established patterns in the codebase. The Docker image URLs use the standard nvcr.io#nvidia format (where # serves as a delimiter that gets replaced with / for Kubernetes and _ for cache filenames), with one pre-existing exception in ucc_generator_test.toml that uses / directly.

Consistency Note:
While this PR updates the common examples, there are additional configuration files outside the conf/common/ directory that still reference the older container versions (e.g., files in conf/hook/, conf/release/nemo_acceptance/, and conf/release/spcx/l40s/). This appears intentional based on the PR scope focusing specifically on "common examples." Documentation files (doc/workloads/nccl.rst and doc/workloads/ucc.rst) also still reference the older PyTorch version, which was noted in previous review comments.

The updates are straightforward version bumps with no breaking API changes or functional modifications to the test configurations themselves.

Confidence Score: 5/5

  • This PR is safe to merge with minimal risk - it's a straightforward container version upgrade
  • The changes are limited to version number updates in configuration files with no code modifications. All updated files follow established patterns, copyright years are correctly updated, and the container version upgrades (PyTorch 25.06→25.12 and NeMo 25.09→25.11) represent routine updates without breaking changes. The only minor style issue is a pre-existing URL format inconsistency in one file that doesn't affect functionality.
  • Pay attention to conf/common/test_scenario/ucc_generator_test.toml which has a pre-existing URL format inconsistency (though not critical)

Important Files Changed

File Analysis

Filename Score Overview
conf/common/test/nemo_run_llama3_8b.toml 5/5 Updated NeMo container from 25.09.00 to 25.11.01 and copyright year to 2026. No functional issues.
conf/common/test_scenario/dse_nemo_run_llama3_8b.toml 5/5 Updated NeMo container from 25.09.00 to 25.11.01 and copyright year to 2026. No functional issues.
conf/common/test_scenario/ucc_generator_test.toml 4/5 Updated PyTorch from 25.09-py3 to 25.12-py3. Uses inconsistent URL format (nvcr.io/nvidia vs nvcr.io#nvidia).

Sequence Diagram

sequenceDiagram
    participant User
    participant ConfigFile as Configuration Files
    participant TestDef as Test Definition
    participant DockerImage as Docker Image Handler
    participant Container as Container Registry
    
    User->>ConfigFile: Load test configuration
    ConfigFile->>TestDef: Parse docker_image_url
    Note over ConfigFile,TestDef: PyTorch: 25.06-py3 → 25.12-py3<br/>NeMo: 25.09.00 → 25.11.01
    TestDef->>DockerImage: Create DockerImage(url)
    DockerImage->>DockerImage: Replace # with _ for cache filename
    DockerImage->>Container: Pull/cache container image
    Container-->>DockerImage: Return container
    DockerImage-->>TestDef: Container ready
    TestDef-->>User: Execute test with updated container
Loading

Copy link
Contributor

@greptile-apps greptile-apps bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

4 files reviewed, 4 comments

Edit Code Review Agent Settings | Greptile

@greptile-apps
Copy link
Contributor

greptile-apps bot commented Jan 12, 2026

Additional Comments (4)

doc/workloads/nccl.rst
The documentation example still references the old PyTorch container version 25.06-py3. Consider updating this to 25.12-py3 to match the newly updated common configuration files.

   docker_image_url = "nvcr.io#nvidia/pytorch:25.12-py3"

This will ensure that users following the documentation are using the same container versions as the common examples.

Note: If this suggestion doesn't match your team's coding style, reply to this and let me know. I'll remember it for next time!


doc/workloads/ucc.rst
The documentation example still references the old PyTorch container version 25.06-py3. Consider updating this to 25.12-py3 to match the newly updated common configuration files.

    docker_image_url = "nvcr.io#nvidia/pytorch:25.12-py3"

This will ensure consistency between documentation and the actual configuration examples.


doc/workloads/nccl.rst
The Test-in-Scenario example also references the old PyTorch container version. Consider updating for consistency:

     docker_image_url = "nvcr.io#nvidia/pytorch:25.12-py3"

Note: If this suggestion doesn't match your team's coding style, reply to this and let me know. I'll remember it for next time!


doc/workloads/ucc.rst
The Test-in-Scenario example also references the old PyTorch container version. Consider updating for consistency:

    docker_image_url = "nvcr.io#nvidia/pytorch:25.12-py3"

Note: If this suggestion doesn't match your team's coding style, reply to this and let me know. I'll remember it for next time!

Copy link
Contributor

@greptile-apps greptile-apps bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

1 file reviewed, 1 comment

Edit Code Review Agent Settings | Greptile

@amaslenn amaslenn merged commit 320f510 into main Jan 12, 2026
5 checks passed
@amaslenn amaslenn deleted the am/container-versions branch January 12, 2026 11:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants