feat: Always save final checkpoint in RLTrainer #690

AmeenP · 2026-01-06T10:41:18Z

Summary

Ensure a checkpoint is always saved at the end of training, regardless of save_steps interval.

Problem

Short training runs (max_steps < save_steps) produce no checkpoints because HuggingFace Trainer only saves at save_steps intervals. The cleanup job then finds no step_* folder and deletes the adapter record.

Solution

Add _save_final_checkpoint() method that:

Runs in _inner_training_loop finally block (before orchestrator cleanup)
Saves to broadcasts/step_{final_step}/ directory
Skips if checkpoint already exists at that step
Only runs on main process

Changes

Add import os for path operations
Add _save_final_checkpoint() method
Call in _inner_training_loop finally block

Ensure a checkpoint is saved at the end of training, regardless of save_steps interval. This fixes the issue where short runs (max_steps < save_steps) would produce no adapters. Changes: - Add _save_final_checkpoint() method to RLTrainer - Call it in _inner_training_loop finally block before cleanup - Save to broadcasts/step_N/ to match cleanup script expectations

AmeenP closed this Jan 6, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Always save final checkpoint in RLTrainer #690

feat: Always save final checkpoint in RLTrainer #690

Uh oh!

AmeenP commented Jan 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: Always save final checkpoint in RLTrainer #690

feat: Always save final checkpoint in RLTrainer #690

Uh oh!

Conversation

AmeenP commented Jan 6, 2026

Summary

Problem

Solution

Changes

Related

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants