Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

Pull requests: modelscope/ms-swift

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[infer] Support infer cache impl
#7150 opened Dec 22, 2025 by Jintao-Huang Loading…
[model] support olmoe
#7140 opened Dec 21, 2025 by qianhao0713 Loading…
1 of 4 tasks
[model] support minimind
#7136 opened Dec 20, 2025 by PiggerZZM Loading…
2 of 4 tasks
Improve vLLM examples regarding vllm_engine_kwargs use
#7133 opened Dec 19, 2025 by 3manifold Loading…
1 task done
Support features of cut cross entropy, TiledMLP and activation_offload
#7129 opened Dec 19, 2025 by vx120 Loading…
1 of 4 tasks
[megatron] support megatron fsdp
#7117 opened Dec 18, 2025 by Jintao-Huang Loading…
[template] support mimo-v2 template
#7095 opened Dec 17, 2025 by Jintao-Huang Loading…
[feat] support TiledMLP in Deepspeed and FSDP2
#7090 opened Dec 17, 2025 by kevssim Loading…
2 of 4 tasks
[bugfix] fix missing generate method for InternVL-2.5
#7019 opened Dec 12, 2025 by xwy-bit Loading…
1 of 4 tasks
[feat] Add Support Cut-Cross-Entropy (CCE)
#6971 opened Dec 9, 2025 by w1ida Loading…
[feat] support deepspeed elastic
#6955 opened Dec 8, 2025 by meichangsu1 Loading…
2 of 4 tasks
[WIP] [v4] refactor model_type & template
#6944 opened Dec 8, 2025 by Jintao-Huang Loading…
add muon clip optimizer
#6662 opened Nov 19, 2025 by vx120 Loading…
1 task
Add conditional distillation support for GKD trainer
#6542 opened Nov 11, 2025 by woshixiaobai2019 Loading…
3 tasks
[WIP][Exp]Support ray dpo
#6395 opened Nov 1, 2025 by tastelikefeet Loading…
1 of 4 tasks
[megatron] update megatron_args default_val
#6252 opened Oct 22, 2025 by Jintao-Huang Loading…
feat: Enable for exporting unmerged HF Lora Adapter
#6225 opened Oct 20, 2025 by jason9693 Loading…
1 of 4 tasks
[WIP] refactor template
#6085 opened Oct 11, 2025 by Jintao-Huang Loading…
update docs
#5691 opened Sep 6, 2025 by Jintao-Huang Loading…
[model] update minicpmv-4.5 video processor stale
#5679 opened Sep 5, 2025 by hjh0119 Loading…
Bug fix: eval OOM due to deepcopy of torch model stale
#5607 opened Aug 29, 2025 by hellopahe Loading…
1 task done
ProTip! Follow long discussions with comments:>50.
Morty Proxy This is a proxified and sanitized view of the page, visit original site.