generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 2k
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add
generation_kwargs
as a property of GRPOConfig
to support additional generation arguments.
#3617
opened Jun 18, 2025 by
pramodith
Loading…
4 of 5 tasks
ClearML logging of visualization in RewardTrainer evaluation
#3602
opened Jun 16, 2025 by
ioverho
Loading…
2 of 5 tasks
[GRPO] Fix prompt truncation (
max_prompt_length
) with vLLM.
#3601
opened Jun 16, 2025 by
LeonEricsson
Loading…
2 of 5 tasks
📜 Add
chat_template_source
parameter to SFTConfig
#3599
opened Jun 16, 2025 by
qgallouedec
Loading…
5 tasks
🦘 Skip no-op ChatML conversion for datasets already in ChatML format
#3594
opened Jun 15, 2025 by
qgallouedec
Loading…
5 tasks
🔖 Fix: ensure user-provided
labels
are retained in self._signature_columns
#3589
opened Jun 14, 2025 by
sxndqc
Loading…
5 tasks
Add entropy based filtering inside the GRPOTrainer.
#3563
opened Jun 10, 2025 by
pramodith
Loading…
4 of 5 tasks
Add
vllm_gpu_memory_utilization
recommendation script
#3554
opened Jun 9, 2025 by
toslali-ibm
Loading…
5 tasks
🎀 New defaults:
gradient_checkpointing=True
#3510
opened May 29, 2025 by
qgallouedec
Loading…
5 tasks
Add Bidirectional Knowledge Distillation Option to GKDTrainer
#3508
opened May 29, 2025 by
shaischaudhry
Loading…
3 of 5 tasks
[GRPO] Pad per minibatch instead of per generation batch
#3495
opened May 26, 2025 by
edbeeching
•
Draft
3 tasks
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-05-18.