-
Notifications
You must be signed in to change notification settings - Fork 1.3k
Pull requests: modelscope/ms-swift
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[megatron] support the fake distributed process group
#8347
opened Mar 16, 2026 by
yangbofun
Loading…
1 of 4 tasks
[bugfix] Fix abnormal grad_norm under GRPO LoRA + DeepSpeed ZeRO-0 (fix #6815)
#8341
opened Mar 14, 2026 by
alphadl
Loading…
[bugfix] Web UI: load from checkpoint path by type instead of dirty exception check
#8340
opened Mar 14, 2026 by
alphadl
Loading…
[bugfix] fix load audio from local video path when USE_AUDIO_IN_VIDEO (fix #8332)
#8339
opened Mar 14, 2026 by
alphadl
Loading…
[bugfix] fix PeftModel save_pretrained selected_adapters when using custom adapter_name
#8337
opened Mar 14, 2026 by
alphadl
Loading…
feat(template): add MiniCPM-o-4.5 training template with audio support and fix image_bound bug
#8307
opened Mar 12, 2026 by
xxddccaa
Loading…
[WIP] Moe kernel for qwen3 omni in ascend
#8214
opened Mar 5, 2026 by
jiaqiw09
Loading…
1 of 4 tasks
[megatron]feat: Add routing replay support for Megatron-Swift GRPO
#8196
opened Mar 4, 2026 by
XianlongLi
Loading…
1 of 4 tasks
feat: support resume_from_checkpoint=True to auto-find last checkpoint
#8050
opened Feb 14, 2026 by
zhichenggeng
Loading…
1 of 4 tasks
[feat] support frames packing for minicpmv4_5 video processing
#8046
opened Feb 13, 2026 by
fanqiNO1
Loading…
2 of 4 tasks
Add QAT (Quantization-Aware Training) Support Callback
#8042
opened Feb 12, 2026 by
y2logic
Loading…
1 task done
[fix] Pass all fsdp_config values to accelerate via environment variables…
#7962
opened Feb 2, 2026 by
tzteyang
Loading…
1 of 4 tasks
[feat] Support ProFit: Extend DFT with Probability Threshold-based Token Filtering
#7921
opened Jan 28, 2026 by
maybefunctionname
Loading…
1 of 4 tasks
feat: add greedy packing, MiniCPM packing support, and dataset progress tracking
#7904
opened Jan 26, 2026 by
Lollipop
Loading…
fix(megatron): disable checkpointing when calculate KL
#7828
opened Jan 20, 2026 by
zzc0430
Loading…
1 of 4 tasks
Previous Next
ProTip!
no:milestone will show everything without a milestone.