12 Star 215 Fork 114

GVPAscend/MindSpeed-RL

全共卡模式支持断点续训单独加载ref权重 可合并
ci-pipeline-passed
ascend-cla/yes
!259 liuxinyang 11
2025-04-28 19:45
特性use_remove_padding
ascend-cla/yes
!255 Nurxat 1
2025-04-27 22:32
dataset shuffle逻辑修复与变更 可合并
ci-pipeline-passed
ascend-cla/yes
stat/needs-squash
!251 LookAround 102
2025-04-27 15:00
增加CoC功能 可合并
ascend-cla/yes
!249 yuqi 1
2025-04-25 16:26
dev add profiler 可合并
ci-pipeline-passed
ascend-cla/yes
lgtm
!240 tangmengcheng 41
2025-04-22 15:20
新增partial rollout特性 存在冲突
ci-pipeline-failed
ascend-cla/yes
stat/needs-squash
!237 LookAround 11
2025-04-21 18:14
reward 存在冲突
ascend-cla/yes
!234 朱意霖 1
2025-04-21 15:31
!233 Nurxat 13
2025-04-21 09:27
feat: 支持td profiling 可合并
ci-pipeline-failed
ascend-cla/yes
stat/needs-squash
!231 sunjianping 61
2025-04-20 12:04
1
https://gitee.com/ascend/MindSpeed-RL.git
git@gitee.com:ascend/MindSpeed-RL.git
ascend
MindSpeed-RL
MindSpeed-RL

搜索帮助