From f77253ae522e7b48f5cccb362d32d83a0cf95dfe Mon Sep 17 00:00:00 2001 From: yanhuiling Date: Fri, 21 Jan 2022 02:27:44 +0000 Subject: [PATCH 1/3] =?UTF-8?q?=E5=88=A0=E9=99=A4=E6=96=87=E4=BB=B6=20docs?= =?UTF-8?q?/en/RELEASENOTE/RELEASENOTE.md?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit --- docs/en/RELEASENOTE/RELEASENOTE.md | 411 ----------------------------- 1 file changed, 411 deletions(-) delete mode 100644 docs/en/RELEASENOTE/RELEASENOTE.md diff --git a/docs/en/RELEASENOTE/RELEASENOTE.md b/docs/en/RELEASENOTE/RELEASENOTE.md deleted file mode 100644 index 2b00b4a7739..00000000000 --- a/docs/en/RELEASENOTE/RELEASENOTE.md +++ /dev/null @@ -1,411 +0,0 @@ -# FrameworkPTAdapter 2.0.3 Release Notes -- [FrameworkPTAdapter 2.0.3](#frameworkptadapter-2-0-3md) - - [Before You Start](#before-you-startmd) - - [New Features](#new-featuresmd) - - [Modified Features](#modified-featuresmd) - - [Resolved Issues](#resolved-issuesmd) - - [Known Issues](#known-issuesmd) - - [Compatibility](#compatibilitymd) -- [FrameworkPTAdapter 2.0.2](#frameworkptadapter-2-0-2md) - - [Before You Start](#before-you-start-0md) - - [New Features](#new-features-1md) - - [Modified Features](#modified-features-2md) - - [Resolved Issues](#resolved-issues-3md) - - [Known Issues](#known-issues-4md) - - [Compatibility](#compatibility-5md) -

FrameworkPTAdapter 2.0.3

- -- **[Before You Start](#before-you-startmd)** - -- **[New Features](#new-featuresmd)** - -- **[Modified Features](#modified-featuresmd)** - -- **[Resolved Issues](#resolved-issuesmd)** - -- **[Known Issues](#known-issuesmd)** - -- **[Compatibility](#compatibilitymd)** - - -

Before You Start

- -This framework is modified based on the open-source PyTorch 1.5.0 developed by Facebook, inherits native PyTorch features, and uses NPUs for dynamic image training. Models are adapted by operator granularity, code can be reused, and current networks can be ported and used on NPUs with only device types or data types modified. - -PyTorch 1.8.1 is supported by this version and later, and this version inherits the features of PyTorch 1.5.0 and provides the same functions, except for the Profiling tool. In addition, it optimizes the backend operator adaptation. Currently, PyTorch 1.8.1 supports only the ResNet-50 network model. - -

New Features

- -**Table 1** Features supported by PyTorch - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Level-1 Feature

-

Level-2 Feature

-

Description

-

PyTorch features adapted to NPUs

-

PyTorch 1.8.1

-

Added PyTorch 1.8.1. Currently, only the ResNet-50 network is supported, including the training scenario for distributed data parallel (DDP).

-

Python 3.8

-

Supported compilation and use of Python 3.8.

-

Operator overflow/underflow detection tool

-

Supported IR-level operator overflow/underflow detection in the PyTorch framework. When an AI Core operator overflow/underflow occurs, the IR information is displayed.

-

OS compatibility

-

Supported UOS 20 1020e ARM.

-

Basic framework functions

-

Added the function of adapted operator development. For details, see the operator list.

-

Model training

-

CenterFace

-

-

-

PCBU

-

-

-

Net++

-

-

-

FCN8S

-

-

-

OSNetRetinaFace

-

-

-

PSPnet

-

-

-

EDSR

-

-

-

Tsm

-

-

-

pnasnet5large

-

-

-

Gaitset

-

-

-

fcn

-

-

-

Albert

-

-

-

AdvancedEast

-

-

-

ReidStrongBaseline

-

-

-

Fast-scnn

-

-

-

RDN

-

-

-

SRFlow

-

-

-

MGN

-

-

-

Roberta

-

-

-

RegNetY

-

-

-

VoVNet-39

-

-

-

RegNetX

-

-

-

RefineNet

-

-

-

RefineDet

-

-

-

AlignedReID

-

-

-

FaceBoxes

-

-

-
- -

Modified Features

- -N/A - -

Resolved Issues

- -N/A - -

Known Issues

- - - - - - - - - - - - - - - - - - - - - - - -

Known Issue

-

Description

-

Data type support

-

NPUs do not support the input or output of the inf/nan data of the float16 type.

-

Data format

-

Dimensions cannot be reduced when the format larger than 4D is used.

-

Restrictions on collective communication

-

-

The graphs executed on different devices in a training job must be the same.

-

Allocation at only 1, 2, 4, or 8 processors is supported.

-

Only the int8, int32, float16, and float32 data types are supported.

-

Apex function

-

In the current version, Apex is implemented mainly using Python, and the customized optimization of CUDA kernel in Apex is not supported.

-
- -

Compatibility

- -Atlas 800 \(model 9010\): CentOS 7.6, Ubuntu 18.04, BC-Linux 7.6, Debian 9.9, Debian 10, and openEuler 20.03 LTS. - -Atlas 800 \(model 9000\): CentOS 7.6, Euler 2.8, Kylin v10, BC-Linux 7.6, OpenEuler 20.03 LTS, and UOS 20 1020e. - -

FrameworkPTAdapter 2.0.2

- -- **[Before You Start](#before-you-start-0md)** - -- **[New Features](#new-features-1md)** - -- **[Modified Features](#modified-features-2md)** - -- **[Resolved Issues](#resolved-issues-3md)** - -- **[Known Issues](#known-issues-4md)** - -- **[Compatibility](#compatibility-5md)** - - -

Before You Start

- -This framework is modified based on the open-source PyTorch 1.5.0 primarily developed by Facebook, inherits native PyTorch features, and uses NPUs for dynamic image training. Models are adapted by operator granularity, code can be reused, and current networks can be ported and used on NPUs with only device types or data types modified. - -

New Features

- -**Table 1** Features supported by PyTorch - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Level-1 Feature

-

Level-2 Feature

-

Description

-

Adapted training models

-

YOLOv4

-

-

-

YOLOv3

-

-

-

DB

-

-

-

RFCN

-

-

-

CRNN

-

-

-

Densenset161

-

-

-

Densenset191

-

-

-

PyTorch features adapted to NPUs

-

Basic framework functions

-

Added the function of adapted operator development. For details, see the operator list.

-

Model Accuracy Analyzer

-

Added the Model Accuracy Analyzer and supported training accuracy demarcation.

-

Ascend 710 AI Processor

-

Supported the online inference on Ascend 710 AI Processors.

-

OS compatibility

-

Supported Ubuntu 18.04.5 and openEuler 20.03 LTS.

-
- -

Modified Features

- -N/A - -

Resolved Issues

- -N/A - -

Known Issues

- - - - - - - - - - - - - - - - - - - - - - - -

Known Issue

-

Description

-

Data type support

-

NPUs do not support the input or output of the inf/nan data of the float16 type.

-

Data format

-

Dimensions cannot be reduced when the format larger than 4D is used.

-

Restrictions on collective communication

-

-

The graphs executed on different devices in a training job must be the same.

-

Allocation at only 1, 2, 4, or 8 processors is supported.

-

Only the int8, int32, float16, and float32 data types are supported.

-

Apex function

-

In the current version, Apex is implemented mainly using Python, and the customized optimization CUDA kernel in Apex is not supported.

-
- -

Compatibility

- -Atlas 800 \(model 9010\): CentOS 7.6/Ubuntu 18.04/BC-Linux 7.6/Debian 9.9/Debian 10/openEuler 20.03 LTS - -Atlas 800 \(model 9000\): CentOS 7.6/EulerOS 2.8/Kylin v10/BC-Linux 7.6/openEuler 20.03 LTS - -- Gitee From 316cc29ee2cfc3dd41a2c21f6d34a26d70120e0a Mon Sep 17 00:00:00 2001 From: yanhuiling Date: Fri, 21 Jan 2022 02:27:55 +0000 Subject: [PATCH 2/3] Updated this file --- .../FrameworkPTAdapter 2.0.4 Release Notes.md | 491 ++++++++++++++++++ 1 file changed, 491 insertions(+) create mode 100644 docs/en/RELEASENOTE/FrameworkPTAdapter 2.0.4 Release Notes.md diff --git a/docs/en/RELEASENOTE/FrameworkPTAdapter 2.0.4 Release Notes.md b/docs/en/RELEASENOTE/FrameworkPTAdapter 2.0.4 Release Notes.md new file mode 100644 index 00000000000..a5e93015ca1 --- /dev/null +++ b/docs/en/RELEASENOTE/FrameworkPTAdapter 2.0.4 Release Notes.md @@ -0,0 +1,491 @@ +# FrameworkPTAdapter 2.0.4 Release Notes + +- [FrameworkPTAdapter 2.0.4 Release Notes](#frameworkptadapter-204-release-notes) + - [FrameworkPTAdapter 2.0.4](#frameworkptadapter-204) + - [Before You Start](#before-you-start) + - [New Features](#new-features) + - [Modified Features](#modified-features) + - [Resolved Issues](#resolved-issues) + - [Known Issues](#known-issues) + - [Compatibility](#compatibility) + - [FrameworkPTAdapter 2.0.3](#frameworkptadapter-203) + - [Before You Start](#before-you-start-1) + - [New Features](#new-features-1) + - [Modified Features](#modified-features-1) + - [Resolved Issues](#resolved-issues-1) + - [Known Issues](#known-issues-1) + - [Compatibility](#compatibility-1) + - [FrameworkPTAdapter 2.0.2](#frameworkptadapter-202) + - [Before You Start](#before-you-start-2) + - [New Features](#new-features-2) + - [Modified Features](#modified-features-2) + - [Resolved Issues](#resolved-issues-2) + - [Known Issues](#known-issues-2) + - [Compatibility](#compatibility-2) + +## FrameworkPTAdapter 2.0.4 +### Before You Start + +This framework is modified based on the open-source PyTorch 1.5.0 and 1.8.1 developed by Facebook, inherits native PyTorch features, and uses NPUs for dynamic image training. Models are adapted by operator granularity, code can be reused, and current networks can be ported and used on NPUs with only device types or data types modified. + +PyTorch 1.8.1 inherits the features of PyTorch 1.5.0. Their functions are basically the same, but PyTorch 1.8.1 provides better development experience for backend operator adaptation. Currently, PyTorch 1.8.1 supports only the ResNet-50 network model. +### New Features + +**Table 1** Features supported by PyTorch + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +

Level-1 Feature

+

Level-2 Feature

+

Description

+

PyTorch 1.5.0 features adapted to NPUs

+

Optimization of Model Accuracy Analyzer

+

Supported mapping between IR and TBE operators and enabled the NPU dump data to be loaded to the GPU side for comparison.

+

E2E prof

+

Integrated the profile data obtained by the native PyTorch Profiling tool and CANN prof tool to implement end-to-end model and operator performance analysis.

+

Basic framework functions

+

Added the function of adapted operator development. For details, see the operator list.

+

PyTorch 1.8.1 features adapted to NPUs

+

AMP

+

Supported the native training with automatic mixed precision (AMP) of PyTorch.

+

Profiling

+

Supported the native profiling function of PyTorch.

+

OS compatibility

+

OS compatibility

+

Supported Ubuntu 20.04 (x86 and ARM) and EulerOS 2.10 (ARM).

+

Python version compatibility

+

Supported compilation and use of Python 3.9 (only in PyTorch 1.8.1).

+
+ +### Modified Features + +N/A +### Resolved Issues + +N/A +### Known Issues + + + + + + + + + + + + + + + + + + + + + + +

Known Issue

+

Description

+

Data type support

+

NPUs do not support the input or output of the inf/nan data of the float16 type.

+

Data format

+

Dimensions cannot be reduced when the format larger than 4D is used.

+

Restrictions on collective communication

+

+

The graphs executed on different devices in a training job must be the same.

+

Allocation at only 1, 2, 4, or 8 processors is supported.

+

Only the int8, int32, float16, and float32 data types are supported.

+

Apex function

+

In the current version, Apex is implemented mainly using Python, and the customized optimization of CUDA kernel in Apex is not supported.

+
+### Compatibility + +Atlas 800 (model 9010): CentOS 7.6, Ubuntu 18.04/2.04, BC-Linux 7.6, Debian 9.9, Debian 10, openEuler 20.03 LTS + +Atlas 800 (model 9000): CentOS 7.6, Ubuntu 18.04/2.04, EulerOS 2.8/2.10, Kylin V10, BC-Linux 7.6, openEuler 20.03 LTS, UOS 20 1020e +## FrameworkPTAdapter 2.0.3 +### Before You Start + +This framework is modified based on the open-source PyTorch 1.5.0 developed by Facebook, inherits native PyTorch features, and uses NPUs for dynamic image training. Models are adapted by operator granularity, code can be reused, and current networks can be ported and used on NPUs with only device types or data types modified. + +PyTorch 1.8.1 is supported by this version and later, and this version inherits the features of PyTorch 1.5.0 and provides the same functions, except for the Profiling tool. In addition, it optimizes the backend operator adaptation. Currently, PyTorch 1.8.1 supports only the ResNet-50 network model. +### New Features + +**Table 1** Features supported by PyTorch + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +

Level-1 Feature

+

Level-2 Feature

+

Description

+

PyTorch features adapted to NPUs

+

PyTorch 1.8.1

+

Added PyTorch 1.8.1. Currently, only the ResNet-50 network is supported, including the training scenario for distributed data parallel (DDP).

+

Python 3.8

+

Supported compilation and use of Python 3.8.

+

Operator overflow/underflow detection tool

+

Supported IR-level operator overflow/underflow detection in the PyTorch framework. When an AI Core operator overflow/underflow occurs, the IR information is displayed.

+

OS compatibility

+

Supported UOS 20 1020e ARM.

+

Basic framework functions

+

Added the function of adapted operator development. For details, see the operator list.

+

Model training

+

CenterFace

+

-

+

PCBU

+

-

+

Net++

+

-

+

FCN8S

+

-

+

OSNetRetinaFace

+

-

+

PSPnet

+

-

+

EDSR

+

-

+

Tsm

+

-

+

pnasnet5large

+

-

+

Gaitset

+

-

+

fcn

+

-

+

Albert

+

-

+

AdvancedEast

+

-

+

ReidStrongBaseline

+

-

+

Fast-scnn

+

-

+

RDN

+

-

+

SRFlow

+

-

+

MGN

+

-

+

Roberta

+

-

+

RegNetY

+

-

+

VoVNet-39

+

-

+

RegNetX

+

-

+

RefineNet

+

-

+

RefineDet

+

-

+

AlignedReID

+

-

+

FaceBoxes

+

-

+
+ +### Modified Features + +N/A +### Resolved Issues + +N/A +### Known Issues + + + + + + + + + + + + + + + + + + + + + + +

Known Issue

+

Description

+

Data type support

+

NPUs do not support the input or output of the inf/nan data of the float16 type.

+

Data format

+

Dimensions cannot be reduced when the format larger than 4D is used.

+

Restrictions on collective communication

+

+

The graphs executed on different devices in a training job must be the same.

+

Allocation at only 1, 2, 4, or 8 processors is supported.

+

Only the int8, int32, float16, and float32 data types are supported.

+

Apex function

+

In the current version, Apex is implemented mainly using Python, and the customized optimization of CUDA kernel in Apex is not supported.

+
+### Compatibility + +Atlas 800 (model 9010): CentOS 7.6, Ubuntu 18.04, BC-Linux 7.6, Debian 9.9, Debian 10, and openEuler 20.03 LTS. + +Atlas 800 (model 9000): CentOS 7.6, Euler 2.8, Kylin v10, BC-Linux 7.6, OpenEuler 20.03 LTS, and UOS 20 1020e. + +## FrameworkPTAdapter 2.0.2 +### Before You Start + +This framework is modified based on the open-source PyTorch 1.5.0 primarily developed by Facebook, inherits native PyTorch features, and uses NPUs for dynamic image training. Models are adapted by operator granularity, code can be reused, and current networks can be ported and used on NPUs with only device types or data types modified. +### New Features + +**Table 1** Features supported by PyTorch + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +

Level-1 Feature

+

Level-2 Feature

+

Description

+

Adapted training models

+

YOLOv4

+

-

+

YOLOv3

+

-

+

DB

+

-

+

RFCN

+

-

+

CRNN

+

-

+

Densenset161

+

-

+

Densenset191

+

-

+

PyTorch features adapted to NPUs

+

Basic framework functions

+

Added the function of adapted operator development. For details, see the operator list.

+

Model Accuracy Analyzer

+

Added the Model Accuracy Analyzer and supported training accuracy demarcation.

+

Ascend 710 AI Processor

+

Supported the online inference on Ascend 710 AI Processors.

+

OS compatibility

+

Supported Ubuntu 18.04.5 and openEuler 20.03 LTS.

+
+ +### Modified Features + +N/A +### Resolved Issues + +N/A +### Known Issues + + + + + + + + + + + + + + + + + + + + + + +

Known Issue

+

Description

+

Data type support

+

NPUs do not support the input or output of the inf/nan data of the float16 type.

+

Data format

+

Dimensions cannot be reduced when the format larger than 4D is used.

+

Restrictions on collective communication

+

+

The graphs executed on different devices in a training job must be the same.

+

Allocation at only 1, 2, 4, or 8 processors is supported.

+

Only the int8, int32, float16, and float32 data types are supported.

+

Apex function

+

In the current version, Apex is implemented mainly using Python, and the customized optimization CUDA kernel in Apex is not supported.

+
+ +### Compatibility + +Atlas 800 (model 9010): CentOS 7.6/Ubuntu 18.04/BC-Linux 7.6/Debian 9.9/Debian 10/openEuler 20.03 LTS + +Atlas 800 (model 9000): CentOS 7.6/EulerOS 2.8/Kylin v10/BC-Linux 7.6/openEuler 20.03 LTS + -- Gitee From 287c6e6cc3fb81a002bfb38168d3eb562f4e81bb Mon Sep 17 00:00:00 2001 From: yanhuiling Date: Fri, 21 Jan 2022 02:30:39 +0000 Subject: [PATCH 3/3] update docs/en/RELEASENOTE/FrameworkPTAdapter 2.0.4 Release Notes.md. --- docs/en/RELEASENOTE/FrameworkPTAdapter 2.0.4 Release Notes.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/docs/en/RELEASENOTE/FrameworkPTAdapter 2.0.4 Release Notes.md b/docs/en/RELEASENOTE/FrameworkPTAdapter 2.0.4 Release Notes.md index a5e93015ca1..6acb0da1c20 100644 --- a/docs/en/RELEASENOTE/FrameworkPTAdapter 2.0.4 Release Notes.md +++ b/docs/en/RELEASENOTE/FrameworkPTAdapter 2.0.4 Release Notes.md @@ -127,6 +127,7 @@ N/A + ### Compatibility Atlas 800 (model 9010): CentOS 7.6, Ubuntu 18.04/2.04, BC-Linux 7.6, Debian 9.9, Debian 10, openEuler 20.03 LTS @@ -355,6 +356,7 @@ N/A + ### Compatibility Atlas 800 (model 9010): CentOS 7.6, Ubuntu 18.04, BC-Linux 7.6, Debian 9.9, Debian 10, and openEuler 20.03 LTS. -- Gitee