@xtchen982
Tianchen Xiong 暂无简介
DeepSpeed v0.9.2 NPU 适配插件
A heterogeneous hardware acceleration library focused on efficient KV cache transfer operators (H2D/D2H), designed for large model training and inference scenarios.