Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

👔 Apply doc-builder style
#3615 opened Jun 18, 2025 by qgallouedec Loading…
🏛️ Fix CI and Iterative SFT
#3614 opened Jun 18, 2025 by qgallouedec Loading…
📚 SFTTrainer support chat template kwargs
#3609 opened Jun 17, 2025 by qgallouedec Loading…
5 tasks
ClearML logging of visualization in RewardTrainer evaluation
#3602 opened Jun 16, 2025 by ioverho Loading…
2 of 5 tasks
[GRPO] Fix prompt truncation (max_prompt_length) with vLLM.
#3601 opened Jun 16, 2025 by LeonEricsson Loading…
2 of 5 tasks
📜 Add chat_template_source parameter to SFTConfig
#3599 opened Jun 16, 2025 by qgallouedec Loading…
5 tasks
fix bf16 fp16 config conflict issue
#3598 opened Jun 16, 2025 by yao-matrix Loading…
🧰 [SFT] Tool support
#3597 opened Jun 15, 2025 by qgallouedec Loading…
5 tasks
🤵‍♂️ SFT on assistant messages only
#3586 opened Jun 14, 2025 by qgallouedec Loading…
5 tasks
Fix: corrected fsdp in GRPO trainer
#3582 opened Jun 13, 2025 by tryumanshow Loading…
2 of 5 tasks
Check rewards shapes in RewardTrainer
#3577 opened Jun 13, 2025 by ioverho Loading…
4 tasks done
Chisquare regularized DPO
#3573 opened Jun 12, 2025 by asparius Loading…
Add entropy based filtering inside the GRPOTrainer.
#3563 opened Jun 10, 2025 by pramodith Loading…
4 of 5 tasks
Add vllm_gpu_memory_utilization recommendation script
#3554 opened Jun 9, 2025 by toslali-ibm Loading…
5 tasks
🥳 new rloo
#3533 opened Jun 3, 2025 by shirinyamani Loading…
5 tasks
Push KTAE impl
#3518 opened May 30, 2025 by SamComber Loading…
5 tasks
intuit
#3513 opened May 29, 2025 by shirinyamani Loading…
5 tasks
🎀 New defaults: gradient_checkpointing=True
#3510 opened May 29, 2025 by qgallouedec Loading…
5 tasks
Add Bidirectional Knowledge Distillation Option to GKDTrainer
#3508 opened May 29, 2025 by shaischaudhry Loading…
3 of 5 tasks
HF Doc Builder style
#3498 opened May 26, 2025 by qgallouedec Draft
[GRPO] Adds SSR priorized replay buffer
#3496 opened May 26, 2025 by edbeeching Loading…
ProTip! What’s not been updated in a month: updated:<2025-05-18.