-
Notifications
You must be signed in to change notification settings - Fork 1k
Pull requests: modelscope/ms-swift
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[dataset] support Geometry3K for SFT and GRPO
#6922
opened Dec 6, 2025 by
zsxm1998
Loading…
1 of 4 tasks
feat: Add support for enabling and configuring msprobe via command-line and config.json
#6834
opened Dec 1, 2025 by
Vectorwh
Loading…
2 of 4 tasks
Add conditional distillation support for GKD trainer
#6542
opened Nov 11, 2025 by
woshixiaobai2019
Loading…
3 tasks
Add Tensor Input Support: Enable .pt file processing with <tensor> tags for latent representations
#6504
opened Nov 9, 2025 by
Marshall-mk
Loading…
1 of 4 tasks
[Fix Bug] Enhance
ProgressCallbackNew to initialize training bar with current step
#6415
opened Nov 3, 2025 by
YushunXiang
Loading…
1 of 4 tasks
feat: Enable for exporting unmerged HF Lora Adapter
#6225
opened Oct 20, 2025 by
jason9693
Loading…
1 of 4 tasks
bug fix: RuntimeError when training GRPO with LoRA and PtEngine
stale
#5645
opened Sep 3, 2025 by
chenjianhuii
Loading…
1 of 4 tasks
Bug fix: eval OOM due to deepcopy of torch model
stale
#5607
opened Aug 29, 2025 by
hellopahe
Loading…
1 task done
[init]support gptq grpo in colocate mode
stale
#5569
opened Aug 27, 2025 by
ItGirls
Loading…
1 of 4 tasks
Allow flexibility for users to pass attention_mask in data_collator
#2234
opened Oct 13, 2024 by
YerongLi
Loading…
3 tasks
# 观察数据后,发现下面的代码会过滤掉一些没有问题的数据,如:sure, here are some tools and …
#1931
opened Sep 4, 2024 by
KnightLancelot
Loading…
4 tasks
Fix bug for less data then grad acc
#779
opened Apr 23, 2024 by
Firmament-cyou
Loading…
1 of 4 tasks
ProTip!
What’s not been updated in a month: updated:<2025-11-06.