generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 2.2k
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
docs: Add RapidFire AI integration guide
#4340
opened Oct 26, 2025 by
kamran-rapidfireAI
•
Draft
5 tasks
[vllm] update comment about communication group host ip
#4337
opened Oct 24, 2025 by
kashif
Loading…
5 tasks
Update SFT QLoRA notebook with **14B** model on free Colab
#4336
opened Oct 24, 2025 by
sergiopaniego
Loading…
5 tasks
Added custom
prepare_model_for_kbit_training to save VRAM
#4335
opened Oct 24, 2025 by
sergiopaniego
Loading…
5 tasks
feat(trainer): add PAPOTrainer for preference-based optimization
#4334
opened Oct 24, 2025 by
SolarWindRider
Loading…
4 tasks done
Update Reducing Memory Consumption guide with more details
#4332
opened Oct 24, 2025 by
sergiopaniego
Loading…
5 tasks
Use explicit tiny-Qwen2ForCausalLM-2.5 model_id param in CI tests
#4331
opened Oct 23, 2025 by
albertvillanova
Loading…
Implement CI test workflow for experimental module
#4330
opened Oct 23, 2025 by
albertvillanova
Loading…
Move tests of experimental GRPO with replay buffer to tests/experimental
#4329
opened Oct 23, 2025 by
albertvillanova
Loading…
Use explicit tiny-Qwen2_5_VL model_id parameter in CI tests
#4325
opened Oct 23, 2025 by
albertvillanova
Loading…
refactor: simplify parameter freezing in modeling_base.py
#4305
opened Oct 20, 2025 by
Ki-Seki
Loading…
2 of 5 tasks
GRPO: ScaleRL -> Support casting LM Head to FP32
#4303
opened Oct 18, 2025 by
pramodith
Loading…
4 of 5 tasks
[SFT] Log mean token accuracy from Liger kernel
#4302
opened Oct 18, 2025 by
kashif
Loading…
5 tasks
feat: Add Multi-Token Prediction (MTP) support to SFTTrainer
#4290
opened Oct 15, 2025 by
KLGR123
Loading…
[SFT] add support for unified conversion logic for both images and videos
#4264
opened Oct 13, 2025 by
kashif
Loading…
Remove FSDP1 support: use FSDP2 exclusively
#4260
opened Oct 11, 2025 by
behroozazarkhalili
Loading…
Fix DPO Trainer Bug For Qwen2-VL (Issue 2660)
#4257
opened Oct 11, 2025 by
FabianSchuetze
Loading…
1 of 3 tasks
Update
max_length explanation for VLM trainers
#4220
opened Oct 7, 2025 by
sergiopaniego
Loading…
5 tasks
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.