12 Star 215 Fork 114

GVPAscend/MindSpeed-RL

[resharding] support EP and EP_expend 存在冲突
ascend-cla/yes
!216 zhoubeirong 2
2025-04-17 15:41
推理异步流水特性支持 存在冲突
ci-pipeline-passed
ascend-cla/yes
!209 shishaoyu 11
2025-04-16 17:56
添加混合奖励和7b R1模型训练示例 存在冲突
ci-pipeline-passed
ascend-cla/yes
!206 李尚蔚 202
2025-04-15 23:57
add format_code_math reward 可合并
ci-pipeline-passed
ascend-cla/yes
!204 白超 22
2025-04-15 17:15
添加32b R1模型训练示例 可合并
ascend-cla/yes
stat/needs-squash
!202 xiecheng 38
2025-04-15 15:49
[resharding] support vpp 可合并
ci-pipeline-passed
ascend-cla/yes
stat/needs-squash
!200 zhoubeirong 43
2025-04-15 13:37
拒绝采样:增加readme,及qwen2.5-7B、qwen2.5-32B运行配置和脚本 可合并
ci-pipeline-passed
ascend-cla/yes
stat/needs-squash
!188 linqihong 48
2025-04-10 20:20
拒绝采样:拒绝采样算法及ut、st 可合并
ci-pipeline-passed
ascend-cla/yes
stat/needs-squash
!187 linqihong 38
2025-04-10 20:19
【轻量级 PR】:update README.md. 可合并
ascend-cla/no
!177 060119朱昕浩 2
2025-04-07 15:01
特性use_remove_padding 存在冲突
ascend-cla/yes
stat/needs-squash
!174 Nurxat 2
2025-04-03 14:38
1
https://gitee.com/ascend/MindSpeed-RL.git
git@gitee.com:ascend/MindSpeed-RL.git
ascend
MindSpeed-RL
MindSpeed-RL

搜索帮助