generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 2.4k
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
GKDTrainer: Fix return_outputs in Liger kernel path and update tests
#4688
opened Dec 13, 2025 by
roycho96
Loading…
2 of 5 tasks
Move
prepare_model_for_kbit_training, enable_gradient_checkpointing, prepare_peft_model to experimental.utils
#4686
opened Dec 12, 2025 by
qgallouedec
Loading…
Move
get_reward function to experimental.utils
#4683
opened Dec 12, 2025 by
qgallouedec
Loading…
5 tasks
loss calculation for evaluation without training
#4673
opened Dec 11, 2025 by
SonuDixit
Loading…
5 tasks
Overwrite model default generation config used by model.generate
#4647
opened Dec 9, 2025 by
albertvillanova
Loading…
7 of 9 tasks
CPOTrainer - Incorrect handling of different length chosen/rejected p…
#4639
opened Dec 8, 2025 by
davmels
Loading…
Support async reward functions and parallelize call to reward functions.
#4567
opened Nov 24, 2025 by
pramodith
Loading…
3 of 5 tasks
Add cross-tokenizer distillation support for GKD and MiniLLM trainers
#4561
opened Nov 22, 2025 by
sambhavnoobcoder
Loading…
Add PSPO trust region method as alternative to clipping in GRPOTrainer
#4548
opened Nov 19, 2025 by
MCDwyer
Loading…
2 of 5 tasks
[GRPO] switch grpo liger loss to triton version
#4519
opened Nov 13, 2025 by
kashif
Loading…
1 of 8 tasks
adding [SimPER](https://arxiv.org/abs/2502.00883)
#4486
opened Nov 6, 2025 by
leeparkuky
Loading…
2 of 5 tasks
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.