Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

ScaleRL: Add CISPO Loss
#4495 opened Nov 6, 2025 by pramodith Loading…
4 of 5 tasks
Buffer samples based on group level stds.
#4492 opened Nov 6, 2025 by pramodith Loading…
3 of 5 tasks
[DOCS] update and fix openenv
#4490 opened Nov 6, 2025 by burtenshaw Loading…
adding [SimPER](https://arxiv.org/abs/2502.00883)
#4486 opened Nov 6, 2025 by leeparkuky Loading…
2 of 5 tasks
Move GKDTrainer to experimental module
#4474 opened Nov 5, 2025 by behroozazarkhalili Loading…
7 tasks done
Add attention_mask to signature_columns
#4459 opened Nov 5, 2025 by shubhamjain0594 Loading…
5 tasks
fix: fix a little bug in GRPOTrainer
#4452 opened Nov 5, 2025 by SolarWindRider Loading…
Add kernels to Docker images
#4445 opened Nov 3, 2025 by ishitab02 Loading…
2 of 5 tasks
added 10 papers (+trainer cross-links) for #4407
#4441 opened Nov 3, 2025 by SSusantAchary Loading…
4 tasks done
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.