-
Notifications
You must be signed in to change notification settings - Fork 513
Pull requests: allenai/open-instruct
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Offline Distillation via DistillKit (Part One - Compression Helpers for Logit Capture)
#1525
opened Mar 12, 2026 by
wolfecameron
Loading…
Add DeepSpeed universal checkpoint (UCP) support for GRPO
#1517
opened Mar 7, 2026 by
MohdElgaar
Loading…
Migrate to vLLM 0.16.0 native weight transfer API
#1515
opened Mar 6, 2026 by
finbarrtimbers
Loading…
Add SLR-Bench (Scalable Logical Reasoning) verifier and dataset support for RLVR
#1511
opened Mar 6, 2026 by
lukashelff
Loading…
Rename TIS ratio cap, add low bound and hard filter flag
#1503
opened Mar 2, 2026 by
finbarrtimbers
Loading…
Add AppWorld environment integration for GRPO
#1501
opened Feb 27, 2026 by
hamishivi
Loading…
3 tasks done
Fix dataset mixer split validation in combined datasets
#1494
opened Feb 24, 2026 by
MohdElgaar
Loading…
Add SWERLSandboxEnv for per-sample Docker tasks with submit-based evaluation
#1492
opened Feb 24, 2026 by
hamishivi
Loading…
4 tasks done
Remove vllm_num_engines from VLLMConfig; compute inline from cluster resources
#1482
opened Feb 19, 2026 by
finbarrtimbers
Loading…
Require checkpoint on Beaker restarts for DPO and GRPO training
codex
#1469
opened Feb 10, 2026 by
finbarrtimbers
Loading…
Add DPO OLMo-core support with MFU improvements
#1440
opened Jan 30, 2026 by
finbarrtimbers
Loading…
3 tasks
Significantly improves
dpo.py performance: ~40% MFU
#1430
opened Jan 27, 2026 by
finbarrtimbers
•
Draft
Previous Next
ProTip!
Follow long discussions with comments:>50.