Skip to content
#

checkpoint-management

Here is 1 public repository matching this topic...

ML training orchestration for the Crucible ecosystem. Distributed training, hyperparameter optimization, checkpointing, model versioning, metrics collection, early stopping, LR scheduling, gradient accumulation, and mixed precision training with Nx/Scholar integration.

  • Updated Dec 27, 2025
  • Elixir

Improve this page

Add a description, image, and links to the checkpoint-management topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the checkpoint-management topic, visit your repo's landing page and select "manage topics."

Learn more