-
Notifications
You must be signed in to change notification settings - Fork 358
Pull requests: THUDM/slime
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Featuren] Change default value of rollout health check
#1197
opened Dec 23, 2025 by
yitianlian
Loading…
update default paths for AMD script to support out-of-the-box execution
#1191
opened Dec 23, 2025 by
Vivicai1005
Loading…
[WIP] Implement RDMA P2P weight update using TransferEngine
#1164
opened Dec 20, 2025 by
JD-ETH
Loading…
[FEATURE] Add tool call support for multi-turn SFT with delta-based loss masking
#1159
opened Dec 20, 2025 by
Surya-Gunukula
Loading…
tau-bench: offline stub user + tool parsing fallback
#1158
opened Dec 19, 2025 by
Fengzdadi
Loading…
Add tau2-bench training cookbook and implementation
#1156
opened Dec 19, 2025 by
jbarnes850
Loading…
fix: fix 8B VLM true on policy issue
run-ci-short
#1155
opened Dec 19, 2025 by
nanjiangwill
Loading…
[WIP] Add TerminalBench eval delegate + quickstart
#1154
opened Dec 19, 2025 by
XinyuJiangCMU
•
Draft
5 tasks
[FSDP][1/n] Support LoRA training for FSDP backend.
#1140
opened Dec 17, 2025 by
GuanxingLu
Loading…
4 tasks
[On Policy Distillation] resolve log prob dimension mismatch in on-policy distillation with CP > 1
#1135
opened Dec 17, 2025 by
Yuchen-Cao
Loading…
[FSDP] Support Qwen3-30B-A3B and analyse memory usage
#1132
opened Dec 16, 2025 by
mingMelody
Loading…
3 tasks done
[on-policy distillation] update reward function to fix potential token mismatches
#1128
opened Dec 16, 2025 by
ahxt
Loading…
fix: resolve size mismatch error during training (fixes #1076)
#1085
opened Dec 11, 2025 by
fangzhensheng
Loading…
Update sglang patch: add update_weights_from_tensor to EagleWorkerV2
#1044
opened Dec 6, 2025 by
zhihengy
Loading…
[Draft] Update Megatron patch to work for Megatron v0.15.0
#1042
opened Dec 6, 2025 by
Birch-san
Loading…
feat: Support
list-of-dicts format for multimodal message content
#1037
opened Dec 5, 2025 by
ppraneth
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-11-23.