Convert job to recipe for LLM_HF example #3888

ZiyueXu77 · 2025-12-11T17:50:30Z

Fixes # .

Description

From job api to recipe

Types of changes

Non-breaking change (fix or new feature that would not break existing functionality).
Breaking change (fix or new feature that would cause existing functionality to change).
New tests added to cover the changes.
Quick tests passed locally by running ./runtest.sh.
In-line docstrings updated.
Documentation updated.

Copilot

Pull request overview

This PR converts the LLM HuggingFace example from using the imperative Job API to the declarative Recipe API, aligning with NVFLARE's preferred pattern for job configuration. The refactoring encapsulates the job configuration logic into a reusable LLMHFRecipe class while maintaining the same functional behavior.

Key changes:

Introduced LLMHFRecipe class that wraps the existing job configuration logic
Refactored main() function to instantiate the recipe and delegate execution
Streamlined argument parser help text for consistency
Fixed spelling error in client_ids help text

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

examples/advanced/llm_hf/job.py

greptile-apps · 2025-12-11T18:06:30Z

Greptile Overview

Greptile Summary

Refactored LLM HuggingFace example from imperative job API to Recipe pattern, wrapping the FedJob configuration logic in a LLMHFRecipe class that extends the base Recipe class.

Key changes:

Added LLMHFRecipe class that encapsulates all job configuration in __init__
Moved FedJob creation, controller setup, quantization filters, model persistor, and client runner configuration into the recipe constructor
Improved code organization by consolidating logic that was previously scattered in the main() function
Updated job names to include "_recipe" suffix (llm_hf_sft_recipe, llm_hf_peft_recipe)
Added type hints (List[str], Optional[str]) for better type safety
Enhanced error messages to mention case-insensitive support
Improved argument parser help text for clarity
Fixed ports default to be a list ["7777"] instead of string "7777" for consistency

Functional equivalence:

All existing functionality is preserved
Single-GPU, multi-GPU, and multi-node training modes continue to work as before
Quantization, WandB integration, and all other features remain intact
The refactored code follows the same pattern as other recently converted examples (sklearn-linear, sklearn-svm, sklearn-kmeans, random_forest)

Confidence Score: 5/5

This PR is safe to merge with minimal risk - it's a clean refactoring that follows established patterns
This refactoring follows the exact same pattern used in recent Recipe conversions (sklearn-linear, sklearn-svm, sklearn-kmeans, random_forest). All existing functionality is preserved, with no breaking changes to the API. The code quality is improved with better organization, type hints, and clearer help text. The changes are well-contained to a single file and maintain backward compatibility at the command-line level.
No files require special attention

Important Files Changed

File Analysis

Filename	Score	Overview
examples/advanced/llm_hf/job.py	5/5	Converted from imperative job API to Recipe class pattern, wrapping FedJob configuration in `LLMHFRecipe.__init__`, improving reusability and maintainability

Sequence Diagram

sequenceDiagram
    participant User as User/main()
    participant Recipe as LLMHFRecipe
    participant FedJob as FedJob
    participant Env as ExecEnv (SimEnv/ProdEnv)
    participant Run as Run

    User->>User: Parse arguments
    User->>User: Split GPUs & validate
    User->>Recipe: __init__(client_ids, num_rounds, etc.)
    activate Recipe
    Recipe->>FedJob: Create FedJob(name, min_clients)
    Recipe->>FedJob: to(FedAvg controller, "server")
    
    alt quantize_mode specified
        Recipe->>FedJob: to(ModelQuantizer, "server")
        Recipe->>FedJob: to(ModelDequantizer, "server")
    end
    
    Recipe->>FedJob: to(model_file, "server")
    Recipe->>FedJob: to(PTFileModelPersistor, "server")
    Recipe->>FedJob: to(IntimeModelSelector, "server")
    
    loop for each client
        Recipe->>FedJob: to(ScriptRunner, client_site)
        alt quantize_mode
            Recipe->>FedJob: to(quantizer/dequantizer, client_site)
        end
        Recipe->>FedJob: to(client_params, client_site)
    end
    
    Recipe->>Recipe: super().__init__(job)
    deactivate Recipe
    
    User->>FedJob: export_job(job_dir)
    
    alt startup_kit_location provided
        User->>Env: Create ProdEnv
    else simulation mode
        User->>Env: Create SimEnv
    end
    
    User->>Recipe: execute(env)
    Recipe->>Env: deploy(job)
    Env-->>Recipe: job_id
    Recipe->>Run: Create Run(env, job_id)
    Recipe-->>User: run
    
    User->>Run: get_status()
    Run-->>User: status
    User->>Run: get_result()
    Run-->>User: result

greptile-apps

_{1 file reviewed, no comments}

_{Edit Code Review Agent Settings | Greptile}

greptile-apps

_{1 file reviewed, no comments}

_{Edit Code Review Agent Settings | Greptile}

convert job to recipe

5dc5b01

Copilot AI review requested due to automatic review settings December 11, 2025 17:50

Copilot started reviewing on behalf of ZiyueXu77 December 11, 2025 17:51 View session

Copilot AI reviewed Dec 11, 2025

View reviewed changes

examples/advanced/llm_hf/job.py Outdated Show resolved Hide resolved

examples/advanced/llm_hf/job.py Outdated Show resolved Hide resolved

examples/advanced/llm_hf/job.py Outdated Show resolved Hide resolved

greptile-apps bot reviewed Dec 11, 2025

View reviewed changes

bug correction, further polish

946ae81

greptile-apps bot reviewed Dec 11, 2025

View reviewed changes

ZiyueXu77 requested review from YuanTingHsieh, holgerroth and nvkevlu December 11, 2025 19:16

holgerroth marked this pull request as draft December 16, 2025 18:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Convert job to recipe for LLM_HF example #3888

Convert job to recipe for LLM_HF example #3888

Uh oh!

ZiyueXu77 commented Dec 11, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

greptile-apps bot commented Dec 11, 2025 •

edited

Loading

Uh oh!

greptile-apps bot left a comment

Uh oh!

greptile-apps bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Convert job to recipe for LLM_HF example #3888

Are you sure you want to change the base?

Convert job to recipe for LLM_HF example #3888

Uh oh!

Conversation

ZiyueXu77 commented Dec 11, 2025

Description

Types of changes

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

greptile-apps bot commented Dec 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Overview

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Sequence Diagram

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

greptile-apps bot commented Dec 11, 2025 •

edited

Loading