# 深度学习编译 **Repository Path**: Qwesh157/DeepLearningCompiling ## Basic Information - **Project Name**: 深度学习编译 - **Description**: 深度学习编译资料共享 - **Primary Language**: Unknown - **License**: LGPL-2.1 - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 4 - **Created**: 2022-12-04 - **Last Updated**: 2022-12-04 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README ### 文档参考资料列表 :exclamation: :exclamation: :exclamation: 说明:文档参考资料列表整理自根目录下的各个章节文件夹,由于各章节参考资料存在交叉以及便于阅读整理,建议及时上传资料至相应文件夹并按 **如下格式更新此列表中的链接** 以及添加资料简要的介绍等内容 #### 01深度学习简介 - 1、[循环神经网络研究综述_杨丽](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/%E8%83%8C%E6%99%AF_%E8%83%8C%E6%99%AF%E5%BE%AA%E7%8E%AF%E7%A5%9E%E7%BB%8F%E7%BD%91%E7%BB%9C%E7%A0%94%E7%A9%B6%E7%BB%BC%E8%BF%B0_%E6%9D%A8%E4%B8%BD.pdf) - 2、[深度学习相关研究综述_张军阳](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/%E8%83%8C%E6%99%AF_%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%9B%B8%E5%85%B3%E7%A0%94%E7%A9%B6%E7%BB%BC%E8%BF%B0_%E5%BC%A0%E5%86%9B%E9%98%B3.pdf) - 3、[深度学习技术和平台发展综述_于佃海](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/%E8%83%8C%E6%99%AF_%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E6%8A%80%E6%9C%AF%E5%92%8C%E5%B9%B3%E5%8F%B0%E5%8F%91%E5%B1%95%E7%BB%BC%E8%BF%B0_%E4%BA%8E%E4%BD%83%E6%B5%B7.pdf) - 4、[图神经网络技术研究综述_李甜甜](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/%E8%83%8C%E6%99%AF_%E5%9B%BE%E7%A5%9E%E7%BB%8F%E7%BD%91%E7%BB%9C%E6%8A%80%E6%9C%AF%E7%A0%94%E7%A9%B6%E7%BB%BC%E8%BF%B0_%E6%9D%8E%E7%94%9C%E7%94%9C.pdf) - 5、[卷积神经网络研究综述_周飞燕](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/%E8%83%8C%E6%99%AF_%E5%8D%B7%E7%A7%AF%E7%A5%9E%E7%BB%8F%E7%BD%91%E7%BB%9C%E7%A0%94%E7%A9%B6%E7%BB%BC%E8%BF%B0_%E5%91%A8%E9%A3%9E%E7%87%95.pdf) - 6、[神经网络软硬件协同加速关键技术_王佩琪](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/%E7%A1%AC%E4%BB%B6_%E7%A5%9E%E7%BB%8F%E7%BD%91%E7%BB%9C%E8%BD%AF%E7%A1%AC%E4%BB%B6%E5%8D%8F%E5%90%8C%E5%8A%A0%E9%80%9F%E5%85%B3%E9%94%AE%E6%8A%80%E6%9C%AF_%E7%8E%8B%E4%BD%A9%E7%90%AA.pdf) - 7、[计算机视觉代码示例](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/%E8%AE%A1%E7%AE%97%E6%9C%BA%E8%A7%86%E8%A7%89%E7%A4%BA%E4%BE%8B%E4%BB%A3%E7%A0%81.docx) 地址:https://www.jianshu.com/p/a46191daacfc - 8、[目标检测代码示例](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/%E7%9B%AE%E6%A0%87%E6%A3%80%E6%B5%8B%E7%A4%BA%E4%BE%8B%E4%BB%A3%E7%A0%81.zip) 地址:https://github.com/qfs1980398040/pytorch-Single-target-detection-Minions - 9、[自然语言处理代码示例](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/%E8%87%AA%E7%84%B6%E8%AF%AD%E8%A8%80%E5%A4%84%E7%90%86%E7%A4%BA%E4%BE%8B%E4%BB%A3%E7%A0%81.zip) 地址:https://github.com/graykode/nlp-tutorial - 10、[语音识别代码示例](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/%E8%AF%AD%E9%9F%B3%E8%AF%86%E5%88%AB%E7%A4%BA%E4%BE%8B%E4%BB%A3%E7%A0%81.zip) 地址:https://github.com/xxbb1234021/speech_recognition - 11、[王秉睿. 神经网络专用编程语言[D].中国科学技术大学,2019](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/Domain%20Specif...ural%20Networks_%E7%8E%8B%E7%A7%89%E7%9D%BF.pdf) - 12、[图神经网络前沿进展与应用_吴博](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/%E5%9B%BE%E7%A5%9E%E7%BB%8F%E7%BD%91%E7%BB%9C%E5%89%8D%E6%B2%BF%E8%BF%9B%E5%B1%95%E4%B8%8E%E5%BA%94%E7%94%A8_%E5%90%B4%E5%8D%9A.pdf) - 13、[深度学习在计算机视觉领域的应用进展_曾子力](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E5%9C%A8%E8%AE%A1%E7%AE%97%E6%9C%BA%E8%A7%86%E8%A7%89%E9%A2%86%E5%9F%9F%E7%9A%84%E5%BA%94%E7%94%A8%E8%BF%9B%E5%B1%95_%E6%9B%BE%E5%AD%90%E5%8A%9B.pdf) - 14、[面向深度学习硬件加速器的网络编译工具设计_严天炜](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/%E9%9D%A2%E5%90%91%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%A1%AC%E4%BB%B6%E5%8A%A0%E9%80%9F%E5%99%A8%E7%9A%84%E7%BD%91%E7%BB%9C%E7%BC%96%E8%AF%91%E5%B7%A5%E5%85%B7%E8%AE%BE%E8%AE%A1_%E4%B8%A5%E5%A4%A9%E7%82%9C.pdf) - 15、[TensorFlow内核剖析](https://github.com/horance-liu/tensorflow-internals/blob/master/tensorflow-internals.pdf) #### 02深度学习硬件平台 - 1、[背景_面向实时应用的深度学习研究综述_张政馗](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/%E8%83%8C%E6%99%AF_%E9%9D%A2%E5%90%91%E5%AE%9E%E6%97%B6%E5%BA%94%E7%94%A8%E7%9A%84%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%A0%94%E7%A9%B6%E7%BB%BC%E8%BF%B0_%E5%BC%A0%E6%94%BF%E9%A6%97.pdf) - 2、[神经网络软硬件协同加速关键技术_王佩琪](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/%E7%A1%AC%E4%BB%B6_%E7%A5%9E%E7%BB%8F%E7%BD%91%E7%BB%9C%E8%BD%AF%E7%A1%AC%E4%BB%B6%E5%8D%8F%E5%90%8C%E5%8A%A0%E9%80%9F%E5%85%B3%E9%94%AE%E6%8A%80%E6%9C%AF_%E7%8E%8B%E4%BD%A9%E7%90%AA.pdf) - 3、[深度神经网络加速器体系结构概述_陈怡然](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/%E7%A1%AC%E4%BB%B6_%E6%B7%B1%E5%BA%A6%E7%A5%9E%E7%BB%8F%E7%BD%91%E7%BB%9C%E5%8A%A0%E9%80%9F%E5%99%A8%E4%BD%93%E7%B3%BB%E7%BB%93%E6%9E%84%E6%A6%82%E8%BF%B0_%E9%99%88%E6%80%A1%E7%84%B6.pdf) - 4、[深度学习计算平台发展综述_郭乔进](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/%E7%A1%AC%E4%BB%B6_%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E8%AE%A1%E7%AE%97%E5%B9%B3%E5%8F%B0%E5%8F%91%E5%B1%95%E7%BB%BC%E8%BF%B0_%E9%83%AD%E4%B9%94%E8%BF%9B.pdf) - 5、[基于NVDLA的深度学习推断芯片研究_周高峰](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/%E7%A1%AC%E4%BB%B6_%E5%9F%BA%E4%BA%8ENVDLA%E7%9A%84%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E6%8E%A8%E6%96%AD%E8%8A%AF%E7%89%87%E7%A0%94%E7%A9%B6_%E5%91%A8%E9%AB%98%E5%B3%B0.pdf) - 6、[基于FPGA的深度可分离卷积神经网络加速器设计研究_詹宏毅](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/%E7%A1%AC%E4%BB%B6_%E5%9F%BA%E4%BA%8EFPGA%E7%9A%84%E6%B7%B1%E5%BA%A6%E5%8F%AF%E5%88%86%E7%A6%BB%E5%8D%B7%E7%A7%AF%E7%A5%9E%E7%BB%8F%E7%BD%91%E7%BB%9C%E5%8A%A0%E9%80%9F%E5%99%A8%E8%AE%BE%E8%AE%A1%E7%A0%94%E7%A9%B6_%E8%A9%B9%E5%AE%8F%E6%AF%85.pdf) - 7、[人工智能加速体系结构综述_陈正博](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/%E7%A1%AC%E4%BB%B6_%E4%BA%BA%E5%B7%A5%E6%99%BA%E8%83%BD%E5%8A%A0%E9%80%9F%E4%BD%93%E7%B3%BB%E7%BB%93%E6%9E%84%E7%BB%BC%E8%BF%B0_%E9%99%88%E6%AD%A3%E5%8D%9A.pdf) - 8、[基于FPGA的通用深度卷积神经网络加速器设计_管兆康](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/%E7%A1%AC%E4%BB%B6_%E5%9F%BA%E4%BA%8EFPGA%E7%9A%84%E9%80%9A%E7%94%A8%E6%B7%B1%E5%BA%A6%E5%8D%B7%E7%A7%AF%E7%A5%9E%E7%BB%8F%E7%BD%91%E7%BB%9C%E5%8A%A0%E9%80%9F%E5%99%A8%E8%AE%BE%E8%AE%A1_%E7%AE%A1%E5%85%86%E5%BA%B7.pdf) - 9、[一种混合架构的高可靠性深度神经网络加速器_王乾龙](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/%E7%A1%AC%E4%BB%B6_%E4%B8%80%E7%A7%8D%E6%B7%B7%E5%90%88%E6%9E%B6%E6%9E%84%E7%9A%84%E9%AB%98%E5%8F%AF%E9%9D%A0%E6%80%A7%E6%B7%B1%E5%BA%A6%E7%A5%9E%E7%BB%8F%E7%BD%91%E7%BB%9C%E5%8A%A0%E9%80%9F%E5%99%A8_%E7%8E%8B%E4%B9%BE%E9%BE%99.pdf) - 10、[FPGA加速器深度卷积神经网络优化计算方法_梁修壮](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/%E7%A1%AC%E4%BB%B6_FPGA%E5%8A%A0%E9%80%9F%E5%99%A8%E6%B7%B1%E5%BA%A6%E5%8D%B7%E7%A7%AF%E7%A5%9E%E7%BB%8F%E7%BD%91%E7%BB%9C%E4%BC%98%E5%8C%96%E8%AE%A1%E7%AE%97%E6%96%B9%E6%B3%95_%E6%A2%81%E4%BF%AE%E5%A3%AE.pdf) - 11、[DNN加速器技术发展及航空计算系统应用展望_赵一煊](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/%E7%A1%AC%E4%BB%B6_DNN%E5%8A%A0%E9%80%9F%E5%99%A8%E6%8A%80%E6%9C%AF%E5%8F%91%E5%B1%95%E5%8F%8A%E8%88%AA%E7%A9%BA%E8%AE%A1%E7%AE%97%E7%B3%BB%E7%BB%9F%E5%BA%94%E7%94%A8%E5%B1%95%E6%9C%9B_%E8%B5%B5%E4%B8%80%E7%85%8A.pdf) - 12、[基于龙芯平台的深度学习算子库的构建与优化_尹宁](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/%E7%A1%AC%E4%BB%B6+%E6%A1%86%E6%9E%B6_%E5%9F%BA%E4%BA%8E%E9%BE%99%E8%8A%AF%E5%B9%B3%E5%8F%B0%E7%9A%84%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%97%E5%AD%90%E5%BA%93%E7%9A%84%E6%9E%84%E5%BB%BA%E4%B8%8E%E4%BC%98%E5%8C%96_%E5%B0%B9%E5%AE%81.pdf) - 13、[Designing Deep Learning Hardware Accelerator and efficiency evaluation](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/02%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%A1%AC%E4%BB%B6%E5%B9%B3%E5%8F%B0/Designing%20Deep%20Learning%20Hardware%20Accelerator%20and%20efficiency%20evaluation.pdf) - 14、[2022中国人工智能芯片行业研究报告](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/02%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%A1%AC%E4%BB%B6%E5%B9%B3%E5%8F%B0/2022%E4%B8%AD%E5%9B%BD%E4%BA%BA%E5%B7%A5%E6%99%BA%E8%83%BD%E8%8A%AF%E7%89%87%E8%A1%8C%E4%B8%9A%E7%A0%94%E7%A9%B6%E6%8A%A5%E5%91%8A.pdf) - 15、智能计算系统 [视频公开课](https://forum.cambricon.com/index.php?m=content&c=index&a=show&catid=154&id=232) [slide](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/02%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%A1%AC%E4%BB%B6%E5%B9%B3%E5%8F%B0/%E7%AC%AC%E5%85%AD%E7%AB%A0%20%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E5%A4%84%E7%90%86%E5%99%A8%E5%8E%9F%E7%90%86.pdf) - 16、[深度神经网络 FPGA 设计进展、实现与展望_焦李成](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/02%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%A1%AC%E4%BB%B6%E5%B9%B3%E5%8F%B0/%E6%B7%B1%E5%BA%A6%E7%A5%9E%E7%BB%8F%E7%BD%91%E7%BB%9C%20FPGA%20%E8%AE%BE%E8%AE%A1%E8%BF%9B%E5%B1%95%E3%80%81%E5%AE%9E%E7%8E%B0%E4%B8%8E%E5%B1%95%E6%9C%9B_%E7%84%A6%E7%A4%BC%E6%88%90.pdf) - 17、[人工智能芯片技术白皮书(2018)](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/02%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%A1%AC%E4%BB%B6%E5%B9%B3%E5%8F%B0/%E4%BA%BA%E5%B7%A5%E6%99%BA%E8%83%BD%E8%8A%AF%E7%89%87%E6%8A%80%E6%9C%AF%E7%99%BD%E7%9A%AE%E4%B9%A6%EF%BC%882018%EF%BC%89.pdf) - 18、[人工智能芯片研究报告(2018)](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/02%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%A1%AC%E4%BB%B6%E5%B9%B3%E5%8F%B0/aichip%E4%BA%BA%E5%B7%A5%E6%99%BA%E8%83%BD%E8%8A%AF%E7%89%87%E7%A0%94%E7%A9%B6%E6%8A%A5%E5%91%8A2018.pdf) - 19、[深度学习相关研究综述_张军阳:深度学习相关加速技术(2016)](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/%E8%83%8C%E6%99%AF_%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%9B%B8%E5%85%B3%E7%A0%94%E7%A9%B6%E7%BB%BC%E8%BF%B0_%E5%BC%A0%E5%86%9B%E9%98%B3.pdf) #### 03深度学习编译系统设计 - 1、[Deep Learning Systems: Algorithms, Compilers, and Processors for Large-Scale Production](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/03%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%BC%96%E8%AF%91%E7%B3%BB%E7%BB%9F%E8%AE%BE%E8%AE%A1/Deep%20Learning%20Systems%20Algorithms%20Compilers%20and%20Processors%20for%20Large-Scale%20Production.pdf) - 2、[An_In-depth_Comparison_of_Compilers_for_Deep_Neural_Networks_on_Hardware](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/03%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%BC%96%E8%AF%91%E7%B3%BB%E7%BB%9F%E8%AE%BE%E8%AE%A1/An_In-depth_Comparison_of_Compilers_for_Deep_Neural_Networks_on_Hardware.pdf) - 3、[The Deep Learning Compiler: A Comprehensive Survey by Mingzhen Li et al., TPDS 2020](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/07%E4%B8%AD%E9%97%B4%E8%A1%A8%E7%A4%BA/%E5%8F%82%E8%80%83%E8%B5%84%E6%96%99/The%20Deep%20Learning%20Compiler%20A%20Comprehensive%20Survey.pdf) - 4、[深度学习框架研究及初步实现_孙振](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/%E6%A1%86%E6%9E%B6_%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E6%A1%86%E6%9E%B6%E7%A0%94%E7%A9%B6%E5%8F%8A%E5%88%9D%E6%AD%A5%E5%AE%9E%E7%8E%B0_%E5%AD%99%E6%8C%AF.pdf) - 5、[智能遥感深度学习框架与模型设计_龚健雅](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/%E6%A1%86%E6%9E%B6_%E6%99%BA%E8%83%BD%E9%81%A5%E6%84%9F%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E6%A1%86%E6%9E%B6%E4%B8%8E%E6%A8%A1%E5%9E%8B%E8%AE%BE%E8%AE%A1_%E9%BE%9A%E5%81%A5%E9%9B%85.pdf) - 6、[新一代深度学习框架研究_于璠](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/%E6%96%B0%E4%B8%80%E4%BB%A3%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E6%A1%86%E6%9E%B6%E7%A0%94%E7%A9%B6_%E4%BA%8E%E7%92%A0.pdf) - 7、[基于TVM的移动端垃圾分类辅助识别系统_华林泉](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/%E5%9F%BA%E4%BA%8ETVM%E7%9A%84%E7%A7%BB%E5%8A%A8%E7%AB%AF%E5%9E%83%E5%9C%BE%E5%88%86%E7%B1%BB%E8%BE%85%E5%8A%A9%E8%AF%86%E5%88%AB%E7%B3%BB%E7%BB%9F_%E5%8D%8E%E6%9E%97%E6%B3%89.pdf) - 8、[基于CNN加速器的深度学习编译器设计与实现_张芳芳](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/%E5%9F%BA%E4%BA%8ECNN%E5%8A%A0%E9%80%9F%E5%99%A8%E7%9A%84%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%BC%96%E8%AF%91%E5%99%A8%E8%AE%BE%E8%AE%A1%E4%B8%8E%E5%AE%9E%E7%8E%B0_%E5%BC%A0%E8%8A%B3%E8%8A%B3.pdf) - 9、[AI 框架发展白皮书2022年](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/03%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%BC%96%E8%AF%91%E7%B3%BB%E7%BB%9F%E8%AE%BE%E8%AE%A1/AI%20%E6%A1%86%E6%9E%B6%E5%8F%91%E5%B1%95%E7%99%BD%E7%9A%AE%E4%B9%A62022%E5%B9%B4.pdf) - 10、[王秉睿,兰慧盈,陈云霁.深度学习编程框架[J].大数据,2018,4(04):56-63](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/01%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%80%E4%BB%8B/Programming%20f...ng%20algorithms_%E7%8E%8B%E7%A7%89%E7%9D%BF.pdf) - 11、[MLSys: The New Frontier of Machine Learning Systems](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/03%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%BC%96%E8%AF%91%E7%B3%BB%E7%BB%9F%E8%AE%BE%E8%AE%A1/MLSys%20The%20New%20Frontier%20of%20Machine%20Learning%20Systems.pdf) - 12、[Jittor: a novel deep learning framework with meta-operators and unified graph execution](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/03%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%BC%96%E8%AF%91%E7%B3%BB%E7%BB%9F%E8%AE%BE%E8%AE%A1/Jittor_%20a%20nov...aph%20execution_Shi-Min%20HU.pdf) #### 04编程接口 - 1、[FreeTensor: A Free-Form DSL with Holistic Optimizations for Irregular Tensor Programs by Shizhi Tang et al., PLDI 2022](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/04%E7%BC%96%E7%A8%8B%E6%8E%A5%E5%8F%A3/freetensor.pdf) #### 05类型系统与静态分析 - 1、[面向类型推导的Python类型标注分析_马洪跃](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/05%E7%B1%BB%E5%9E%8B%E7%B3%BB%E7%BB%9F%E4%B8%8E%E9%9D%99%E6%80%81%E5%88%86%E6%9E%90/Python%20Type%20A...pe%20Inferrence_%E9%A9%AC%E6%B4%AA%E8%B7%83.pdf) - 2、[Python静态类型分析及其应用_董天聪](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/05%E7%B1%BB%E5%9E%8B%E7%B3%BB%E7%BB%9F%E4%B8%8E%E9%9D%99%E6%80%81%E5%88%86%E6%9E%90/Static%20Type%20A...ng%20for%20Python_%E8%91%A3%E5%A4%A9%E8%81%AA.pdf) - 3、[浅谈Python中的可变与不可变数据类型_陈玲](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/05%E7%B1%BB%E5%9E%8B%E7%B3%BB%E7%BB%9F%E4%B8%8E%E9%9D%99%E6%80%81%E5%88%86%E6%9E%90/Variable%20and%20...pes%20in%20Python_%E9%99%88%E7%8E%B2.pdf) - 4、[一种Python外部函数的静态类型推断方法_张昱](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/05%E7%B1%BB%E5%9E%8B%E7%B3%BB%E7%BB%9F%E4%B8%8E%E9%9D%99%E6%80%81%E5%88%86%E6%9E%90/%E4%B8%80%E7%A7%8DPython%E5%A4%96%E9%83%A8%E5%87%BD%E6%95%B0%E7%9A%84%E9%9D%99%E6%80%81%E7%B1%BB%E5%9E%8B%E6%8E%A8%E6%96%AD%E6%96%B9%E6%B3%95%E5%8F%8A%E7%B3%BB%E7%BB%9F_%E5%BC%A0%E6%98%B1.pdf) - 5、[一种基于类型标注的Python程序类型推导方法_陈林](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/05%E7%B1%BB%E5%9E%8B%E7%B3%BB%E7%BB%9F%E4%B8%8E%E9%9D%99%E6%80%81%E5%88%86%E6%9E%90/%E4%B8%80%E7%A7%8D%E5%9F%BA%E4%BA%8E%E7%B1%BB%E5%9E%8B%E6%A0%87%E6%B3%A8%E7%9A%84Python%E7%A8%8B%E5%BA%8F%E7%B1%BB%E5%9E%8B%E6%8E%A8%E5%AF%BC%E6%96%B9%E6%B3%95.pdf) - 6、[面向Python程序源代码的分析与编译优化研究_范浩杰](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/05%E7%B1%BB%E5%9E%8B%E7%B3%BB%E7%BB%9F%E4%B8%8E%E9%9D%99%E6%80%81%E5%88%86%E6%9E%90/%E9%9D%A2%E5%90%91Python%E7%A8%8B%E5%BA%8F%E6%BA%90%E4%BB%A3%E7%A0%81%E7%9A%84%E5%88%86%E6%9E%90%E4%B8%8E%E7%BC%96%E8%AF%91%E4%BC%98%E5%8C%96%E7%A0%94%E7%A9%B6.pdf) #### 06计算图生成 #### 07中间表示 - 1、[MLIR入门-张洪宾](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/07%E4%B8%AD%E9%97%B4%E8%A1%A8%E7%A4%BA/MLIR%E5%85%A5%E9%97%A8-%E5%BC%A0%E6%B4%AA%E5%AE%BE.docx) - 2、[面向机器学习系统的张量中间表示_庄陈敏](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/07%E4%B8%AD%E9%97%B4%E8%A1%A8%E7%A4%BA/%E5%8F%82%E8%80%83%E8%B5%84%E6%96%99/A%20tensor%20inte...rning%20systems_%E5%BA%84%E6%AF%85%E6%95%8F.pdf) - 3、[An IR for ML Pipelines](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/07%E4%B8%AD%E9%97%B4%E8%A1%A8%E7%A4%BA/%E5%8F%82%E8%80%83%E8%B5%84%E6%96%99/An%20IR%20for%20ML%20Pipelines.pdf) - 4、[IR2013](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/07%E4%B8%AD%E9%97%B4%E8%A1%A8%E7%A4%BA/%E5%8F%82%E8%80%83%E8%B5%84%E6%96%99/IR2013.pdf) - 5、[Relay: A new ir for machine learning frameworks by Roesch J, Lyubomirsky S, Weber L, et al.2018](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/07%E4%B8%AD%E9%97%B4%E8%A1%A8%E7%A4%BA/%E5%8F%82%E8%80%83%E8%B5%84%E6%96%99/TVM_RelayIR.pdf) - 6、[TensorIR: An Abstraction for Automatic Tensorized Program Optimization by Siyuan Feng, Bohan Hou et al., arXiv 2022](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/07%E4%B8%AD%E9%97%B4%E8%A1%A8%E7%A4%BA/%E5%8F%82%E8%80%83%E8%B5%84%E6%96%99/TVM_TIR.pdf) - 7、[The Deep Learning Compiler: A Comprehensive Survey by Mingzhen Li et al., TPDS 2020](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/07%E4%B8%AD%E9%97%B4%E8%A1%A8%E7%A4%BA/%E5%8F%82%E8%80%83%E8%B5%84%E6%96%99/The%20Deep%20Learning%20Compiler%20A%20Comprehensive%20Survey.pdf) - 8、[一种深度学习编译器的高层中间表示转换方法及相关装置](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/07%E4%B8%AD%E9%97%B4%E8%A1%A8%E7%A4%BA/%E5%8F%82%E8%80%83%E8%B5%84%E6%96%99/%E4%B8%80%E7%A7%8D%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%BC%96%E8%AF%91%E5%99%A8%E7%9A%84%E9%AB%98%E5%B1%82%E4%B8%AD%E9%97%B4%E8%A1%A8%E7%A4%BA%E8%BD%AC%E6%8D%A2%E6%96%B9%E6%B3%95%E5%8F%8A%E7%9B%B8%E5%85%B3%E8%A3%85%E7%BD%AE.pdf) - 9、[一种面向神经网络模型计算的中间表示方法和装置](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/07%E4%B8%AD%E9%97%B4%E8%A1%A8%E7%A4%BA/%E5%8F%82%E8%80%83%E8%B5%84%E6%96%99/%E4%B8%80%E7%A7%8D%E9%9D%A2%E5%90%91%E7%A5%9E%E7%BB%8F%E7%BD%91%E7%BB%9C%E6%A8%A1%E5%9E%8B%E8%AE%A1%E7%AE%97%E7%9A%84%E4%B8%AD%E9%97%B4%E8%A1%A8%E7%A4%BA%E6%96%B9%E6%B3%95%E5%92%8C%E8%A3%85%E7%BD%AE.pdf) - 10、[在编译器中构建基于图的中间表示的方法](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/07%E4%B8%AD%E9%97%B4%E8%A1%A8%E7%A4%BA/%E5%8F%82%E8%80%83%E8%B5%84%E6%96%99/%E5%9C%A8%E7%BC%96%E8%AF%91%E5%99%A8%E4%B8%AD%E6%9E%84%E5%BB%BA%E5%9F%BA%E4%BA%8E%E5%9B%BE%E7%9A%84%E4%B8%AD%E9%97%B4%E8%A1%A8%E7%A4%BA%E7%9A%84%E6%96%B9%E6%B3%95.pdf) - 11、[用于根据TensorFlow图构建编译器中间表示的方法和系统](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/07%E4%B8%AD%E9%97%B4%E8%A1%A8%E7%A4%BA/%E5%8F%82%E8%80%83%E8%B5%84%E6%96%99/%E7%94%A8%E4%BA%8E%E6%A0%B9%E6%8D%AETensorFlow%E5%9B%BE%E6%9E%84%E5%BB%BA%E7%BC%96%E8%AF%91%E5%99%A8%E4%B8%AD%E9%97%B4%E8%A1%A8%E7%A4%BA%E7%9A%84%E6%96%B9%E6%B3%95%E5%92%8C%E7%B3%BB%E7%BB%9F.pdf) - 12、[主流IR相关帖子](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/07%E4%B8%AD%E9%97%B4%E8%A1%A8%E7%A4%BA/%E5%8F%82%E8%80%83%E8%B5%84%E6%96%99/%E4%B8%BB%E6%B5%81IR%E7%9B%B8%E5%85%B3%E5%B8%96%E5%AD%90) #### 08自动微分 - 1、[Automatic Differentiation in Machine Learning: a Survey](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/08%E8%87%AA%E5%8A%A8%E5%BE%AE%E5%88%86/Automatic%20Differentiation%20in%20Machine%20Learning%20a%20Survey.pdf) - 2、[A Brief Introduction to Automatic Differentiation for Machine Learning](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/08%E8%87%AA%E5%8A%A8%E5%BE%AE%E5%88%86/A%20Brief%20Introduction%20to%20Automatic%20Differentiation%20for%20Machine%20Learning.pdf) - 3、[automatic differentiation in ml where we are and where we should begoing](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/08%E8%87%AA%E5%8A%A8%E5%BE%AE%E5%88%86/automatic-differentiation-in-ml-where-we-are-and-where-we-should-be-going.pdf) - 4、[两种计算偏微分方程数值解的神经网络方法_韩祖良](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/08%E8%87%AA%E5%8A%A8%E5%BE%AE%E5%88%86/%E4%B8%A4%E7%A7%8D%E8%AE%A1%E7%AE%97%E5%81%8F%E5%BE%AE%E5%88%86%E6%96%B9%E7%A8%8B%E6%95%B0%E5%80%BC%E8%A7%A3%E7%9A%84%E7%A5%9E%E7%BB%8F%E7%BD%91%E7%BB%9C%E6%96%B9%E6%B3%95_%E9%9F%A9%E7%A5%96%E8%89%AF.pdf) - 5、[微分万物:深度学习的启示](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/08%E8%87%AA%E5%8A%A8%E5%BE%AE%E5%88%86/%E5%BE%AE%E5%88%86%E4%B8%87%E7%89%A9%EF%BC%9A%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%9A%84%E5%90%AF%E7%A4%BA_%E7%8E%8B%E7%A3%8A.pdf) - 6、[Some highlights on Source-to-Source Adjoint AD](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/08%E8%87%AA%E5%8A%A8%E5%BE%AE%E5%88%86/some_highlights_on_source_to_s.pdf) #### 09计算图优化 - 1、[AStitch: Enabling a New Multi-dimensional Optimization Space for Memory-Intensive ML Training and Inference on Modern SIMT Architectures by Zhen Zheng et al., ASPLOS 2022]( https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/09%E8%AE%A1%E7%AE%97%E5%9B%BE%E4%BC%98%E5%8C%96/Astitch.pdf) - 2、[Apollo: Automatic Partition-based Operator Fusion through Layer by Layer Optimization by Jie Zhao et al., MLSys 2022](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/09%E8%AE%A1%E7%AE%97%E5%9B%BE%E4%BC%98%E5%8C%96/MLSys-2022-apollo-automatic-partition-based-operator-fusion-through-layer-by-layer-optimization-Paper.pdf) - 3、[Cortex: A Compiler for Recursive Deep Learning Models by Pratik Fegade et al., MLSys 2021](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/09%E8%AE%A1%E7%AE%97%E5%9B%BE%E4%BC%98%E5%8C%96/MLSys21-cortex.pdf) - 4、[DISC: A Dynamic Shape Compiler for Machine Learning Workloads by Kai Zhu et al., EuroMLSys 2021](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/09%E8%AE%A1%E7%AE%97%E5%9B%BE%E4%BC%98%E5%8C%96/DISC.pdf) - 5、[GSO_基于图神经网络的深...学习计算图子图替换优化框架_苗旭鹏](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/09%E8%AE%A1%E7%AE%97%E5%9B%BE%E4%BC%98%E5%8C%96/GSO_%E5%9F%BA%E4%BA%8E%E5%9B%BE%E7%A5%9E%E7%BB%8F%E7%BD%91%E7%BB%9C%E7%9A%84%E6%B7%B1...%E5%AD%A6%E4%B9%A0%E8%AE%A1%E7%AE%97%E5%9B%BE%E5%AD%90%E5%9B%BE%E6%9B%BF%E6%8D%A2%E4%BC%98%E5%8C%96%E6%A1%86%E6%9E%B6_%E8%8B%97%E6%97%AD%E9%B9%8F.pdf) - 6、[Equality Saturation for Tensor Graph Superoptimization by Yichen Yang et al., MLSys 2021](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/09%E8%AE%A1%E7%AE%97%E5%9B%BE%E4%BC%98%E5%8C%96/Equality%20Saturation%20for%20Tensor%20Graph%20Superoptimization.pdf) - 7、[FusionStitching: Boosting Memory IntensiveComputations for Deep Learning Workloads by Zhen Zheng et al., arXiv 2020](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/09%E8%AE%A1%E7%AE%97%E5%9B%BE%E4%BC%98%E5%8C%96/FusionStitching.pdf) - 8、[Nimble: Lightweight and Parallel GPU Task Scheduling for Deep Learning by Woosuk Kwon et al., Neurips 2020](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/09%E8%AE%A1%E7%AE%97%E5%9B%BE%E4%BC%98%E5%8C%96/NeurIPS-2020-nimble-lightweight-and-parallel-gpu-task-scheduling-for-deep-learning-Paper.pdf) - 9、[TensorFlow 设计白皮书](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/09%E8%AE%A1%E7%AE%97%E5%9B%BE%E4%BC%98%E5%8C%96/Tensorflow%E8%AE%BE%E8%AE%A1%E7%99%BD%E7%9A%AE%E4%B9%A6.pdf) - 10、[MindSpore技术白皮书](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/09%E8%AE%A1%E7%AE%97%E5%9B%BE%E4%BC%98%E5%8C%96/MindSpore_white_paperV1.1.pdf) - 11、[Rammer: Enabling Holistic Deep Learning Compiler Optimizations with rTasks by Lingxiao Ma et al., OSDI 2020](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/09%E8%AE%A1%E7%AE%97%E5%9B%BE%E4%BC%98%E5%8C%96/osdi20-Rammer.pdf) - 12、[Relay: A High-Level Compiler for Deep Learning by Jared Roesch et al., arXiv 2019](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/09%E8%AE%A1%E7%AE%97%E5%9B%BE%E4%BC%98%E5%8C%96/Relay.pdf) - 13、[The CoRa Tensor Compiler: Compilation for Ragged Tensors with Minimal Padding by Pratik Fegade et al., MLSys 2022](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/09%E8%AE%A1%E7%AE%97%E5%9B%BE%E4%BC%98%E5%8C%96/MLSys-2022-the-cora-tensor-compiler-compilation-for-ragged-tensors-with-minimal-padding-Paper.pdf) - 14、[Roller: Fast and Efficient Tensor Compilation for Deep Learning by Hongyu Zhu et al., OSDI 2022](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/09%E8%AE%A1%E7%AE%97%E5%9B%BE%E4%BC%98%E5%8C%96/osdi22-Roller.pdf) - 15、[TensorFlow Eager论文](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/09%E8%AE%A1%E7%AE%97%E5%9B%BE%E4%BC%98%E5%8C%96/TensorFlow%20Eager%E8%AE%BA%E6%96%87.pdf) - 16、[TensorFlow 控制流](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/09%E8%AE%A1%E7%AE%97%E5%9B%BE%E4%BC%98%E5%8C%96/TensorFlow%E6%8E%A7%E5%88%B6%E6%B5%81.pdf) #### 10算子生成与优化 - 1、[TVM: An Automated End-to-End Optimizing Compiler for Deep Learning by Tianqi Chen et al., OSDI 2018](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/10%E7%AE%97%E5%AD%90%E7%94%9F%E6%88%90%E5%8F%8A%E4%BC%98%E5%8C%96/1.TVM2017.pdf) - 2、[Relay: A new ir for machine learning frameworks by Roesch J, Lyubomirsky S, Weber L, et al.2018](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/10%E7%AE%97%E5%AD%90%E7%94%9F%E6%88%90%E5%8F%8A%E4%BC%98%E5%8C%96/2.Relay18.pdf) - 3、[Ansor: Generating High-Performance Tensor Programs for Deep Learning by Lianmin Zheng et al., OSDI 2020](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/10%E7%AE%97%E5%AD%90%E7%94%9F%E6%88%90%E5%8F%8A%E4%BC%98%E5%8C%96/Ansor.pdf) - 4、[GSO_基于图神经网络的深度学习计算图子图替换优化框架_苗旭鹏](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/10%E7%AE%97%E5%AD%90%E7%94%9F%E6%88%90%E5%8F%8A%E4%BC%98%E5%8C%96/GSO_%E5%9F%BA%E4%BA%8E%E5%9B%BE%E7%A5%9E%E7%BB%8F%E7%BD%91%E7%BB%9C%E7%9A%84%E6%B7%B1...%E5%AD%A6%E4%B9%A0%E8%AE%A1%E7%AE%97%E5%9B%BE%E5%AD%90%E5%9B%BE%E6%9B%BF%E6%8D%A2%E4%BC%98%E5%8C%96%E6%A1%86%E6%9E%B6_%E8%8B%97%E6%97%AD%E9%B9%8F.pdf) - 5、[一种用于深度学习编译器中探索优化空间的加速方法_潘秋红](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/10%E7%AE%97%E5%AD%90%E7%94%9F%E6%88%90%E5%8F%8A%E4%BC%98%E5%8C%96/%E4%B8%80%E7%A7%8D%E7%94%A8%E4%BA%8E%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%BC%96%E8%AF%91%E5%99%A8%E4%B8%AD%E6%8E%A2%E7%B4%A2%E4%BC%98%E5%8C%96%E7%A9%BA%E9%97%B4%E7%9A%84%E5%8A%A0%E9%80%9F%E6%96%B9%E6%B3%95_%E6%BD%98%E7%A7%8B%E7%BA%A2.pdf) - 6、[一种面向深度学习编译器的高效算子优化方法_孟晓](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/10%E7%AE%97%E5%AD%90%E7%94%9F%E6%88%90%E5%8F%8A%E4%BC%98%E5%8C%96/%E4%B8%80%E7%A7%8D%E9%9D%A2%E5%90%91%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%BC%96%E8%AF%91%E5%99%A8%E7%9A%84%E9%AB%98%E6%95%88%E7%AE%97%E5%AD%90%E4%BC%98%E5%8C%96%E6%96%B9%E6%B3%95_%E5%AD%9F%E6%99%93.pdf) - 7、[基于ARM处理器的深度学习优化技术研究_罗纪杰](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/10%E7%AE%97%E5%AD%90%E7%94%9F%E6%88%90%E5%8F%8A%E4%BC%98%E5%8C%96/%E5%9F%BA%E4%BA%8EARM%E5%A4%84%E7%90%86%E5%99%A8%E7%9A%84%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E4%BC%98%E5%8C%96%E6%8A%80%E6%9C%AF%E7%A0%94%E7%A9%B6_%E7%BD%97%E7%BA%AA%E6%9D%B0.pdf) - 8、[面向航天异构平台的深度学习编译器加速技术优化_刘功晗](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/10%E7%AE%97%E5%AD%90%E7%94%9F%E6%88%90%E5%8F%8A%E4%BC%98%E5%8C%96/%E9%9D%A2%E5%90%91%E8%88%AA%E5%A4%A9%E5%BC%82%E6%9E%84%E5%B9%B3%E5%8F%B0%E7%9A%84%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%BC%96%E8%AF%91%E5%99%A8%E5%8A%A0%E9%80%9F%E6%8A%80%E6%9C%AF%E4%BC%98%E5%8C%96_%E5%88%98%E5%8A%9F%E6%99%97.pdf) - 9、[The Deep Learning Compiler A Comprehensive Survey](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/10%E7%AE%97%E5%AD%90%E7%94%9F%E6%88%90%E5%8F%8A%E4%BC%98%E5%8C%96/%E7%BB%BC%E8%BF%B0/The%20Deep%20Learning%20Compiler%20A%20Comprehensive%20Survey.pdf) - 10、[xla_A LEARNED PERFORMANCE MODEL FOR TENSOR PROCESSING UNITS.pdf](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/10%E7%AE%97%E5%AD%90%E7%94%9F%E6%88%90%E5%8F%8A%E4%BC%98%E5%8C%96/xla/A%20LEARNED%20PERFORMANCE%20MODEL%20FOR%20TENSOR%20PROCESSING%20UNITS.pdf) - 11、[xla_Learned TPU Cost Model for XLA Tensor Programs.pdf](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/10%E7%AE%97%E5%AD%90%E7%94%9F%E6%88%90%E5%8F%8A%E4%BC%98%E5%8C%96/xla/Learned%20TPU%20Cost%20Model%20for%20XLA%20Tensor%20Programs.pdf) - 12、[Halide A Language and Compiler for Optimizing Parallelism.pdf](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/10%E7%AE%97%E5%AD%90%E7%94%9F%E6%88%90%E5%8F%8A%E4%BC%98%E5%8C%96/Halide%20A%20Language%20and%20Compiler%20for%20Optimizing%20Parallelism.pdf) - 13、[Tensor Comprehensions.pdf](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/10%E7%AE%97%E5%AD%90%E7%94%9F%E6%88%90%E5%8F%8A%E4%BC%98%E5%8C%96/Tensor%20Comprehensions.pdf) - 14、[GLOW.pdf](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/10%E7%AE%97%E5%AD%90%E7%94%9F%E6%88%90%E5%8F%8A%E4%BC%98%E5%8C%96/GLOW.pdf) #### 11内存优化 #### 12代码生成和执行 - 1、[swTVM Exploring the Automated Compilation for Deep Learning on Sunway Architecture.pdf](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/12%E4%BB%A3%E7%A0%81%E7%94%9F%E6%88%90%E5%92%8C%E6%89%A7%E8%A1%8C/swTVM%20Exploring%20the%20Automated%20Compilation%20for%20Deep%20Learning%20on%20Sunway%20Architecture.pdf) - 2、[一种基于TVM编译器的异构平台的部署方法及装置_吴金进.pdf](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/12%E4%BB%A3%E7%A0%81%E7%94%9F%E6%88%90%E5%92%8C%E6%89%A7%E8%A1%8C/%E4%B8%80%E7%A7%8D%E5%9F%BA%E4%BA%8ETVM%E7%BC%96%E8%AF%91%E5%99%A8%E7%9A%84%E5%BC%82%E6%9E%84%E5%B9%B3%E5%8F%B0%E7%9A%84%E9%83%A8%E7%BD%B2%E6%96%B9%E6%B3%95%E5%8F%8A%E8%A3%85%E7%BD%AE_%E5%90%B4%E9%87%91%E8%BF%9B.pdf) - 3、[基于申威处理器的深度学习算子自动优化系统及方法_杨广文.pdf](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/12%E4%BB%A3%E7%A0%81%E7%94%9F%E6%88%90%E5%92%8C%E6%89%A7%E8%A1%8C/%E5%9F%BA%E4%BA%8E%E7%94%B3%E5%A8%81%E5%A4%84%E7%90%86%E5%99%A8%E7%9A%84%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%AE%97%E5%AD%90%E8%87%AA%E5%8A%A8%E4%BC%98%E5%8C%96%E7%B3%BB%E7%BB%9F%E5%8F%8A%E6%96%B9%E6%B3%95_%E6%9D%A8%E5%B9%BF%E6%96%87.pdf) #### 13分布式训练 - 1、[分布式深度学习训练网络综述](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/13%E5%88%86%E5%B8%83%E5%BC%8F%E8%AE%AD%E7%BB%83/%E5%88%86%E5%B8%83%E5%BC%8F%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E8%AE%AD%E7%BB%83%E7%BD%91%E7%BB%9C%E7%BB%BC%E8%BF%B0_%E6%9C%B1%E6%B3%93%E7%9D%BF.pdf) - 2、[分布式机器学习算法收敛敏感性优化技术研究_范禹辰](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/13%E5%88%86%E5%B8%83%E5%BC%8F%E8%AE%AD%E7%BB%83/%E5%88%86%E5%B8%83%E5%BC%8F%E8%AE%AD%E7%BB%83_%E5%88%86%E5%B8%83%E5%BC%8F%E6%9C%BA%E5%99%A8%E5%AD%A6%E4%B9%A0%E7%AE%97%E6%B3%95%E6%94%B6%E6%95%9B%E6%95%8F%E6%84%9F%E6%80%A7%E4%BC%98%E5%8C%96%E6%8A%80%E6%9C%AF%E7%A0%94%E7%A9%B6_%E8%8C%83%E7%A6%B9%E8%BE%B0.pdf) - 3、[分布式机器学习集群系统性能优化_和新树](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/13%E5%88%86%E5%B8%83%E5%BC%8F%E8%AE%AD%E7%BB%83/%E5%88%86%E5%B8%83%E5%BC%8F%E8%AE%AD%E7%BB%83_%E5%88%86%E5%B8%83%E5%BC%8F%E6%9C%BA%E5%99%A8%E5%AD%A6%E4%B9%A0%E9%9B%86%E7%BE%A4%E7%B3%BB%E7%BB%9F%E6%80%A7%E8%83%BD%E4%BC%98%E5%8C%96_%E5%92%8C%E6%96%B0%E6%A0%91.pdf) - 4、[基于分布式机器学习的车辆定位方法研究_张恩龙](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/13%E5%88%86%E5%B8%83%E5%BC%8F%E8%AE%AD%E7%BB%83/%E5%88%86%E5%B8%83%E5%BC%8F%E8%AE%AD%E7%BB%83_%E5%9F%BA%E4%BA%8E%E5%88%86%E5%B8%83%E5%BC%8F%E6%9C%BA%E5%99%A8%E5%AD%A6%E4%B9%A0%E7%9A%84%E8%BD%A6%E8%BE%86%E5%AE%9A%E4%BD%8D%E6%96%B9%E6%B3%95%E7%A0%94%E7%A9%B6_%E5%BC%A0%E6%81%A9%E9%BE%99.pdf) - 5、[基于数据和模型混合传输的分布式机器学习框架_严佳媚](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/13%E5%88%86%E5%B8%83%E5%BC%8F%E8%AE%AD%E7%BB%83/%E5%88%86%E5%B8%83%E5%BC%8F%E8%AE%AD%E7%BB%83_%E5%9F%BA%E4%BA%8E%E6%95%B0%E6%8D%AE%E5%92%8C%E6%A8%A1%E5%9E%8B%E6%B7%B7%E5%90%88%E4%BC%A0%E8%BE%93%E7%9A%84%E5%88%86%E5%B8%83%E5%BC%8F%E6%9C%BA%E5%99%A8%E5%AD%A6%E4%B9%A0%E6%A1%86%E6%9E%B6_%E4%B8%A5%E4%BD%B3%E5%AA%9A.pdf) - 6、[分布式训练_面向分布式机器学习框架的通信优化技术研究_阳瑞](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/13%E5%88%86%E5%B8%83%E5%BC%8F%E8%AE%AD%E7%BB%83/%E5%88%86%E5%B8%83%E5%BC%8F%E8%AE%AD%E7%BB%83_%E9%9D%A2%E5%90%91%E5%88%86%E5%B8%83%E5%BC%8F%E6%9C%BA%E5%99%A8%E5%AD%A6%E4%B9%A0%E6%A1%86%E6%9E%B6%E7%9A%84%E9%80%9A%E4%BF%A1%E4%BC%98%E5%8C%96%E6%8A%80%E6%9C%AF%E7%A0%94%E7%A9%B6_%E9%98%B3%E7%91%9E.pdf) - 7、[深度学习分布式训练并行策略的研究_万知雨](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/13%E5%88%86%E5%B8%83%E5%BC%8F%E8%AE%AD%E7%BB%83/%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E5%88%86%E5%B8%83%E5%BC%8F%E8%AE%AD%E7%BB%83%E5%B9%B6%E8%A1%8C%E7%AD%96%E7%95%A5%E7%9A%84%E7%A0%94%E7%A9%B6_%E4%B8%87%E7%9F%A5%E9%9B%A8.pdf) - 8、[深度学习并行分布式训练机制研究_姚琼杰](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/13%E5%88%86%E5%B8%83%E5%BC%8F%E8%AE%AD%E7%BB%83/%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E5%B9%B6%E8%A1%8C%E5%88%86%E5%B8%83%E5%BC%8F%E8%AE%AD%E7%BB%83%E6%9C%BA%E5%88%B6%E7%A0%94%E7%A9%B6_%E5%A7%9A%E7%90%BC%E6%9D%B0.pdf) - 9、[深度学习网络分布式训练方案研究与性能优化_张泽超](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/13%E5%88%86%E5%B8%83%E5%BC%8F%E8%AE%AD%E7%BB%83/%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E7%BD%91%E7%BB%9C%E5%88%86%E5%B8%83%E5%BC%8F%E8%AE%AD%E7%BB%83%E6%96%B9%E6%A1%88%E7%A0%94%E7%A9%B6%E4%B8%8E%E6%80%A7%E8%83%BD%E4%BC%98%E5%8C%96_%E5%BC%A0%E6%B3%BD%E8%B6%85.pdf) - 10、[深度神经网络并行化研究综述_朱虎明](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/13%E5%88%86%E5%B8%83%E5%BC%8F%E8%AE%AD%E7%BB%83/%E6%B7%B1%E5%BA%A6%E7%A5%9E%E7%BB%8F%E7%BD%91%E7%BB%9C%E5%B9%B6%E8%A1%8C%E5%8C%96%E7%A0%94%E7%A9%B6%E7%BB%BC%E8%BF%B0_%E6%9C%B1%E8%99%8E%E6%98%8E.pdf) - 11、[针对深度学习模型的优化问题研究_郑书新](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/13%E5%88%86%E5%B8%83%E5%BC%8F%E8%AE%AD%E7%BB%83/%E9%92%88%E5%AF%B9%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E6%A8%A1%E5%9E%8B%E7%9A%84%E4%BC%98%E5%8C%96%E9%97%AE%E9%A2%98%E7%A0%94%E7%A9%B6_%E9%83%91%E4%B9%A6%E6%96%B0.pdf) - 12、[自动并行_Exploring Hidden Dimensions in Parallelizing Convolutional Neural Networks](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/13%E5%88%86%E5%B8%83%E5%BC%8F%E8%AE%AD%E7%BB%83/%E8%87%AA%E5%8A%A8%E5%B9%B6%E8%A1%8C/1%E3%80%81Exploring%20Hidden%20Dimensions%20in%20Parallelizing%20Convolutional%20Neural%20Networks.pdf) - 13、[自动并行_BEYOND DATA AND MODEL PARALLELISM FOR DEEP NEURAL NETWORKS](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/13%E5%88%86%E5%B8%83%E5%BC%8F%E8%AE%AD%E7%BB%83/%E8%87%AA%E5%8A%A8%E5%B9%B6%E8%A1%8C/2%E3%80%81BEYOND%20DATA%20AND%20MODEL%20PARALLELISM%20FOR%20DEEP%20NEURAL%20NETWORKS.pdf) - 14、[自动并行_Device Placement Optimization with Reinforcement Learning](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/13%E5%88%86%E5%B8%83%E5%BC%8F%E8%AE%AD%E7%BB%83/%E8%87%AA%E5%8A%A8%E5%B9%B6%E8%A1%8C/3%E3%80%81Device%20Placement%20Optimization%20with%20Reinforcement%20Learning.pdf) - 15、[自动并行_OneFlow Redesign the Distributed Deep Learning Framework from Scratch](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/13%E5%88%86%E5%B8%83%E5%BC%8F%E8%AE%AD%E7%BB%83/%E8%87%AA%E5%8A%A8%E5%B9%B6%E8%A1%8C/4%E3%80%81OneFlow%20Redesign%20the%20Distributed%20Deep%20Learning%20Framework%20from%20Scratch.pdf) - 16、[自动并行_Automap Towards Ergonomic Automated Parallelism for ML Models](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/13%E5%88%86%E5%B8%83%E5%BC%8F%E8%AE%AD%E7%BB%83/%E8%87%AA%E5%8A%A8%E5%B9%B6%E8%A1%8C/5%E3%80%81Automap%20Towards%20Ergonomic%20Automated%20Parallelism%20for%20ML%20Models.pdf) - 17、[自动并行_Auto-MAP A DQN Framework for Exploring Distributed Execution Plans for DNN Workloads](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/13%E5%88%86%E5%B8%83%E5%BC%8F%E8%AE%AD%E7%BB%83/%E8%87%AA%E5%8A%A8%E5%B9%B6%E8%A1%8C/6%E3%80%81Auto-MAP%20A%20DQN%20Framework%20for%20Exploring%20Distributed%20Execution%20Plans%20for%20DNN%20Workloads.pdf) - 18、[自动并行_GShard Scaling Giant Models with Conditional Computation and Automatic Sharding](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/13%E5%88%86%E5%B8%83%E5%BC%8F%E8%AE%AD%E7%BB%83/%E8%87%AA%E5%8A%A8%E5%B9%B6%E8%A1%8C/7%E3%80%81GShard%20Scaling%20Giant%20Models%20with%20Conditional%20Computation%20and%20Automatic%20Sharding.pdf) - 19、[自动并行_GSPMD General and Scalable Parallelization for ML Computation Graphs](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/13%E5%88%86%E5%B8%83%E5%BC%8F%E8%AE%AD%E7%BB%83/%E8%87%AA%E5%8A%A8%E5%B9%B6%E8%A1%8C/8%E3%80%81GSPMD%20General%20and%20Scalable%20Parallelization%20for%20ML%20Computation%20Graphs..pdf) - 20、[自动并行_TensorOpt Exploring the Tradeoffs in Distributed DNN Training with Auto-Parallelism](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/13%E5%88%86%E5%B8%83%E5%BC%8F%E8%AE%AD%E7%BB%83/%E8%87%AA%E5%8A%A8%E5%B9%B6%E8%A1%8C/9%E3%80%81TensorOpt%20Exploring%20the%20Tradeoffs%20in%20Distributed%20DNN%20Training%20with%20Auto-Parallelism.pdf) - 21、[自动并行_贾志豪1、2 论文翻译](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/13%E5%88%86%E5%B8%83%E5%BC%8F%E8%AE%AD%E7%BB%83/%E8%87%AA%E5%8A%A8%E5%B9%B6%E8%A1%8C/%E8%B4%BE%E5%BF%97%E8%B1%AA1%E3%80%812%20%E8%AE%BA%E6%96%87%E7%BF%BB%E8%AF%91.docx) - 22、[自动并行_论文简介](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/13%E5%88%86%E5%B8%83%E5%BC%8F%E8%AE%AD%E7%BB%83/%E8%87%AA%E5%8A%A8%E5%B9%B6%E8%A1%8C/%E8%AE%BA%E6%96%87%E7%AE%80%E4%BB%8B.docx) #### 14模型部署 - 1、[模型压缩_深度神经网络压缩与加速综述_纪荣嵘](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/综述/模型压缩_深度神经网络压缩与加速综述_纪荣嵘.pdf) - 2、[模型压缩_深度神经网络模型压缩综述_李江昀](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/综述/模型压缩_深度神经网络模型压缩综述_李江昀.pdf) - 3、[深度学习模型压缩与加速综述_高晗](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/综述/深度学习模型压缩与加速综述_高晗.pdf) - 4、[深度神经网络压缩与加速综述_曾焕强](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/综述/深度神经网络压缩与加速综述_曾焕强.pdf) - 5、[深度神经网络模型压缩综述_耿丽丽](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/综述/深度神经网络模型压缩综述_耿丽丽.pdf) - 6、[深度网络模型压缩综述_雷杰](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/综述/深度网络模型压缩综述_雷杰.pdf) - 7、[神经网络模型压缩方法综述_曹文龙](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/综述/神经网络模型压缩方法综述_曹文龙.pdf) - 8、[面向智能移动端的深度学习模型压缩方法研究_秦晴](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/综述/面向智能移动端的深度学习模型压缩方法研究_秦晴.pdf) - 9、[基于在线量化的深度学习模型加速技术研究_秦阳](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/量化/基于在线量化的深度学习模型加速技术研究_秦阳.pdf) - 10、[深度学习模型的权值交互量化算法研究_肖国麟](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/量化/深度学习模型的权值交互量化算法研究_肖国麟.pdf) - 11、[知识蒸馏研究综述_黄震华](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/知识蒸馏/知识蒸馏研究综述_黄震华.pdf) - 12、[深度学习的轻量化神经网络结构研究综述_王军](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/轻量化卷积核/深度学习的轻量化神经网络结构研究综述_王军.pdf) - 13、[紧凑的神经网络模型设计研究综述_郎磊](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/轻量化卷积核/紧凑的神经网络模型设计研究综述_郎磊.pdf) - 14、[神经网络结构搜索方法综述_刘建伟](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/神经结构搜索/神经网络结构搜索方法综述_刘建伟.pdf) - 15、[神经结构搜索的研究进展综述_李航宇](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/神经结构搜索/神经结构搜索的研究进展综述_李航宇.pdf) - 16、[Neural Architecture Search A Survey](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/神经结构搜索/Neural%20Architecture%20Search%20A%20Survey.pdf) - 17、[NSGA-Net Neural Architecture Search using Multi-Objective Genetic Algorithm](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/神经结构搜索/NSGA-Net%20Neural%20Architecture%20Search%20using%20Multi-Objective%20Genetic%20Algorithm.pdf) - 18、[RENAS Reinforced Evolutionary Neural Architecture Search](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/神经结构搜索/RENAS%20Reinforced%20Evolutionary%20Neural%20Architecture%20Search%20.pdf) - 19、[面向多核众核平台的深度学习推理加速技术研究_朱科潜](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/推理加速/面向多核众核平台的深度学习推理加速技术研究_朱科潜.pdf) - 20、[推理_嵌入式设备可信运行环境机器学习服务的研究与实现_刘伟浩](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/模型安全/推理_嵌入式设备可信运行环境机器学习服务的研究与实现_刘伟浩.pdf) - 21、[模型安全_基于联邦学习的本地模型隐私保护研究_潘凯云](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/模型安全/模型安全_基于联邦学习的本地模型隐私保护研究_潘凯云.pdf) - 23、[Data-free Parameter Pruning for Deep Neural Networks 数据无关的非结构化剪枝,相似性度量,减去神经元](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/剪枝/Data-free%20Parameter%20Pruning%20for%20Deep%20Neural%20Networks.pdf) - 24、[NIPS-2017-learning-to-prune-deep-neural-networks-via-layer-wise-optimal-brain-surgeon-Paper 非结构化,分层OBS](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/剪枝/NIPS-2017-learning-to-prune-deep-neural-networks-via-layer-wise-optimal-brain-surgeon-Paper.pdf) - 25、[NIPS-2015-learning-both-weights-and-connections-for-efficient-neural-network-Paper 非结构化,迭代的剪枝](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/剪枝/NIPS-2015-learning-both-weights-and-connections-for-efficient-neural-network-Paper.pdf) - 26、[NIPS-2016-dynamic-network-surgery-for-efficient-dnns-Paper 非结构化,可恢复误剪的连接](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/剪枝/NIPS-2016-dynamic-network-surgery-for-efficient-dnns-Paper.pdf) - 27、[NeurIPS-2018-synaptic-strength-for-convolutional-neural-network-Paper 结构化,通过突触强度决定连接重要性](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/剪枝/NeurIPS-2018-synaptic-strength-for-convolutional-neural-network-Paper.pdf) - 28、[SNIP SINGLE-SHOT NETWORK PRUNING BASED ON CONNECTION SENSITIVITY 非结构化,无需训练的剪枝](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/剪枝/SNIP%20SINGLE-SHOT%20NETWORK%20PRUNING%20BASED%20ON%20CONNECTION%20SENSITIVITY.pdf) - 29、[Diversity Networks: Neural Network Compression Using Determinantal Point Processes 非结构化,对神经元剪枝,无需重训练](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/剪枝/Diversity%20Networks:%20Neural%20Network%20Compression%20Using%20Determinantal%20Point%20Processes.pdf) - 30、[NIPS-2016-perforatedcnns-acceleration-through-elimination-of-redundant-convolutions-Paper 结构化剪枝,通过掩码矩阵,可加速推理](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/剪枝/NIPS-2016-perforatedcnns-acceleration-through-elimination-of-redundant-convolutions-Paper.pdf) - 31、[Lebedev_Fast_ConvNets_Using_CVPR_2016_paper 结构化剪枝,组级别](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/剪枝/Lebedev_Fast_ConvNets_Using_CVPR_2016_paper.pdf) - 32、[PRUNING FILTERS FOR EFFICIENT CONVNETS 结构化剪枝,filter级别,l1范数决定剪枝的过滤器](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/剪枝/PRUNING%20FILTERS%20FOR%20EFFICIENT%20CONVNETS.pdf) - 33、[Yang_Designing_Energy-Efficient_Convolutional_CVPR_2017_paper 非结构化剪枝,一种基于能耗的剪枝方法](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/剪枝/Yang_Designing_Energy-Efficient_Convolutional_CVPR_2017_paper.pdf) - 34、[A Survey of Quantization Methods for Efficient Neural Network Inference 量化综述](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/量化/A%20Survey%20of%20Quantization%20Methods%20for%20Efficient%20Neural%20Network%20Inference.pdf) - 35、[Dong_HAWQ_Hessian_AWare_Quantization_of_Neural_Networks_With_Mixed-Precision_ICCV_2019_paper 混合精度量化,基于Hessian矩阵,需重训练](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/量化/Dong_HAWQ_Hessian_AWare_Quantization_of_Neural_Networks_With_Mixed-Precision_ICCV_2019_paper.pdf) - 36、[MIXED-PRECISION NEURAL NETWORKS: A SURVEY 混合精度量化综述](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/量化/MIXED-PRECISION%20NEURAL%20NETWORKS:%20A%20SURVEY.pdf) - 37、[Optimizing_the_Bit_Allocation_for_Compression_of_Weights_and_Activations_of_Deep_Neural_Networks 混合精度量化,量化权重与激活值,无需重训练,但需要数据](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/量化/Optimizing_the_Bit_Allocation_for_Compression_of_Weights_and_Activations_of_Deep_Neural_Networks.pdf) - 38、[Adaptive Quantization for Deep Neural Network 混合精度量化,权重,无需重训练,需要数据](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/量化/Adaptive%20Quantization%20for%20Deep%20Neural%20Network.pdf) - 39、[NeurIPS-2020-hawq-v2-hessian-aware-trace-weighted-quantization-of-neural-networks-Paper 混合精度量化,基于Hessian,权重与激活值共同量化,需要微调](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/量化/NeurIPS-2020-hawq-v2-hessian-aware-trace-weighted-quantization-of-neural-networks-Paper.pdf) - 40、[HAWQ-V3: Dyadic Neural Network Quantization 混合精度量化,激活值与权重,考虑硬件指标](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/量化/HAWQ-V3:%20Dyadic%20Neural%20Network%20Quantization.pdf) - 41、[SQUANT: ON-THE-FLY DATA-FREE QUANTIZATION VIA DIAGONAL HESSIAN APPROXIMATION 无数据的量化方法,对权重和激活值,效率很高 ](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/量化/squant_on_the_fly_data_free_qu.pdf) - 42、[Efficient Execution of Quantized Deep Learning Models: A Compiler Approach TVM对预量化模型的处理方法](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/量化/Efficient%20Execution%20of%20Quantized%20Deep%20Learning%20Models:%20A%20Compiler%20Approach.pdf) - 43、[Nagel_Data-Free_Quantization_Through_Weight_Equalization_and_Bias_Correction_ICCV_2019_paper 无数据的量化方法,调整权重的值使得网络更易于量化,大量采用BN层的信息指导优化过程](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/量化/Nagel_Data-Free_Quantization_Through_Weight_Equalization_and_Bias_Correction_ICCV_2019_paper.pdf) - 44、[ZeroQ: A Novel Zero Shot Quantization Framework 无数据的权重混合精度量化方法,通过BN层信息生成数据,激活值采用固定精度量化](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/量化/ZeroQ:%20A%20Novel%20Zero%20Shot%20Quantization%20Framework.pdf) - 45、[Data-Free_Network_Compression_via_Parametric_Non-uniform_Mixed_Precision_Quantization 无数据混合精度非均匀量化,通过位宽分配,优化非均匀网格](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/量化/Data-Free_Network_Compression_via_Parametric_Non-uniform_Mixed_Precision_Quantization.pdf) - 46、[Optimizing Information Theory Based Bitwise Bottlenecks for Efficient Mixed-Precision Activation Quantization 混合精度激活值量化,采用codebook,需要数据和重训练](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/量化/Optimizing%20Information%20Theory%20Based%20Bitwise%20Bottlenecks%20for%20Efficient%20Mixed-Precision%20Activation%20Quantization.pdf) - 47、[Post-training Quantization with Multiple Points: Mixed Precision without Mixed Precision 混合精度量化,对敏感通道采用多点量化,需要数据](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/量化/Post-training%20Quantization%20with%20Multiple%20Points:%20Mixed%20Precision%20without%20Mixed%20Precision.pdf) - 48、[Improving Neural Network Quantization without Retraining using Outlier Channel Splitting 量化时对权重和激活值中异常值的处理方法,将其进行分割两部分,保留异常值](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/14模型部署/量化/Improving%20Neural%20Network%20Quantization%20without%20Retraining%20using%20Outlier%20Channel%20Splitting.pdf) #### 会议论文集 - 1、ACT 2022 [论文列表](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/%E4%BC%9A%E8%AE%AE%E8%AE%BA%E6%96%87%E9%9B%86/ACT2022/ATC2022_list.pdf) [论文集](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/%E4%BC%9A%E8%AE%AE%E8%AE%BA%E6%96%87%E9%9B%86/ACT2022/ATC2022.pdf) - 2、OSDI 2022 [论文列表](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/%E4%BC%9A%E8%AE%AE%E8%AE%BA%E6%96%87%E9%9B%86/OSDI2022/OSDI2022_list.pdf) [论文集](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/%E4%BC%9A%E8%AE%AE%E8%AE%BA%E6%96%87%E9%9B%86/OSDI2022/OSDI2022.pdf) - 3、ASPLOS 2022 [论文列表](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/%E4%BC%9A%E8%AE%AE%E8%AE%BA%E6%96%87%E9%9B%86/ASPLOS2022/ASPLOS2022_list.pdf) [论文集1](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/%E4%BC%9A%E8%AE%AE%E8%AE%BA%E6%96%87%E9%9B%86/ASPLOS2022/ASPLOS2022_1.pdf) [论文集2](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/%E4%BC%9A%E8%AE%AE%E8%AE%BA%E6%96%87%E9%9B%86/ASPLOS2022/ASPLOS2022_2.pdf) - 4、PLDI 2022 [论文列表](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/%E4%BC%9A%E8%AE%AE%E8%AE%BA%E6%96%87%E9%9B%86/PLDI2022/PLDI2022_list.pdf) [论文集](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/%E4%BC%9A%E8%AE%AE%E8%AE%BA%E6%96%87%E9%9B%86/PLDI2022/PLDI2022.pdf) - 5、POPL 2022 [论文列表](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/%E4%BC%9A%E8%AE%AE%E8%AE%BA%E6%96%87%E9%9B%86/POPL2022/POPL2022_list.pdf) [论文集](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/%E4%BC%9A%E8%AE%AE%E8%AE%BA%E6%96%87%E9%9B%86/POPL2022/POPL2022.pdf) - 6、CGO 2022 [论文列表](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/%E4%BC%9A%E8%AE%AE%E8%AE%BA%E6%96%87%E9%9B%86/CGO2022/CGO2022_list.pdf) [论文集](https://gitee.com/wanglei07/DeepLearningCompiling/raw/master/%E4%BC%9A%E8%AE%AE%E8%AE%BA%E6%96%87%E9%9B%86/CGO2022/CGO2022.pdf) - 7、MLSys 2022 [论文列表/集](https://gitee.com/wanglei07/DeepLearningCompiling/blob/master/%E4%BC%9A%E8%AE%AE%E8%AE%BA%E6%96%87%E9%9B%86/MLSys2022/MLSys2022_list.md) - 8、PACT 2022 - 9、SC 2022