Skip to content

Conversation

@emergenz
Copy link

No description provided.

Copy link

@maharajamihir maharajamihir left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

tested up to qwen 32B. works.

@emergenz
Copy link
Author

I'm just wondering whether we should wait for upstream lora support (for the default codepath + RL), or not.

For some reason the seem to be constantly reverting their lora implementation lol

@emergenz
Copy link
Author

I don't think they found a 'proper' solution for lora-based RL with sglang yet as the current proposal still unloads and reloads the lora adapter from disk (surely we can do better than that).

@emergenz
Copy link
Author

Question is whether we should:

  • Merge this PR now, wait for them to properly support lora and then rebase upstream into main (which will likely be a pain due to the overlapping changes)
  • Or, defer merging this PR for now, wait until proper lora support lands in upstream, merge upstream into this branch and squash merge into main

emergenz and others added 6 commits December 31, 2025 17:47
* feat: support lora in ray-less SFT codepath

* chore: add assert

* chore: fail fast if no lora-compatible modules found

* fix: set default lora dropout to 0

* fix: skip dataloader checkpoint loading if non-existent

* chore: change lora defaults

* feat: checkpoint in run script

* feat: separate load and save paths

* fix: only store trainable params in optimizer

* fix: only load adapter weights on lora restore

* feat: support lora in the ray-full SFT codepath (+assert that grad check. is off)
@emergenz
Copy link
Author

Merging now that we have a working workflow to keep in sync with upstream.

@emergenz
Copy link
Author

Actually, let's first merge #10 into this branch and then merge the entire thing into main.

That way, we frontload most of the conflict resolution down the line when lora is finally merged into upstream main.

@emergenz emergenz changed the title feat: ray-less SFT [DO NOT MERGE, WIP] feat: ray-less SFT Dec 31, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants