Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix self.current_gradient_accumulation_steps in GRPOTrainer
#3984 opened Aug 31, 2025 by ahatamiz Loading…
5 tasks done
🎯 Add Trackio integration documentation and update TOC
#3971 opened Aug 28, 2025 by qgallouedec Loading…
5 tasks
👷 Added Kernels on the Hub x TRL guide
#3969 opened Aug 28, 2025 by sergiopaniego Loading…
2 of 8 tasks
[GRPO]: Fix Multi-GPU training for Entropy based masking of tokens.
#3964 opened Aug 27, 2025 by pramodith Loading…
2 of 5 tasks
Dft
#3960 opened Aug 27, 2025 by 1485840691 Loading…
5 tasks
fix bug when using dataset streaming by accelerate
#3950 opened Aug 25, 2025 by kaixuanliu Loading…
Docker update
#3931 opened Aug 20, 2025 by qgallouedec Loading…
5 tasks
[SFTTrainer]: Check for assistant mask up to max_length
#3930 opened Aug 20, 2025 by pramodith Loading…
3 of 5 tasks
[DRAFT] Refactor DPO
#3906 opened Aug 15, 2025 by qgallouedec Draft
5 tasks
Test in distributed setting
#3902 opened Aug 15, 2025 by qgallouedec Loading…
5 tasks
BEMA for ref model
#3898 opened Aug 14, 2025 by qgallouedec Loading…
5 tasks
Implement DPOP
#3864 opened Aug 7, 2025 by 1485840691 Loading…
Update profiling.py: fix scoping problems for wandb and mlflow
#3845 opened Aug 4, 2025 by markshinyounglee Loading…
5 tasks done
dynamic temperature
#3844 opened Aug 4, 2025 by shirinyamani Draft
5 tasks
[GSPO]: Refactor _compute_loss
#3835 opened Aug 1, 2025 by pramodith Loading…
2 of 5 tasks
support GSPO-token
#3820 opened Jul 31, 2025 by hjh0119 Loading…
Add vLLM server mode and VLM support to OnlineDPOTrainer
#3783 opened Jul 27, 2025 by vaelev Loading…
6 tasks done
Dynamic sampling option in GRPO trainer based on DAPO paper
#3758 opened Jul 23, 2025 by almeidava93 Loading…
2 of 5 tasks
Support dLLM in GRPO reference model creation
#3743 opened Jul 18, 2025 by xijia-tao Loading…
Add basic support for FSDP/Lora when using TRL/VLLM
#3735 opened Jul 14, 2025 by ojh31 Loading…
5 tasks
ProTip! no:milestone will show everything without a milestone.