diff --git a/tutorials/experts/source_en/debug/dump.md b/tutorials/experts/source_en/debug/dump.md index 79179bea4968f13aa600265e2f89c6b4a11f8c67..8840b2eec79ebdf11b04065fbad59b32eb71bafe 100644 --- a/tutorials/experts/source_en/debug/dump.md +++ b/tutorials/experts/source_en/debug/dump.md @@ -574,7 +574,7 @@ MindSpore provides debugging capabilities for large networks through asynchronou - `dump_mode`: 0: all operator data in the network dumped out; 1: dump kernels data in kernels list. When overflow detection is enabled, the setting of this field becomes invalid, and Dump only saves the data of the overflow node. - `path`: The absolute path to save Dump data. When the graph compilation level is O0, MindSpore will create a new subdirectory for each step in the path directory. - `net_name`: The customized net name: "ResNet50". - - `iteration`: Specify the iterations to dump, type is string. Use "|" to separate the step data of different intervals to be saved. For example, "0 | 5-8 | 100-120" represents dump the data of the 1st, 6th to 9th, and 101st to 121st steps. If iteration set to "all", data of every iteration will be dumped. When overflow detection is enabled for PyNative mode, it must be set to "all". + - `iteration`: Specify the iterations to dump, type is string. Use "|" to separate the step data of different intervals to be saved. For example, "0 | 5-8 | 100-120" represents dump the data of the 1st, 6th to 9th, and 101st to 121st steps. If iteration set to "all", data of every iteration will be dumped. When overflow detection is enabled for PyNative mode, it must be set to "all". When ACL dump is enabled, sink size can only be set to 1. - `saved_data`: Specify what data is to be dumped, type is string. Use "tensor" to dump tensor data, use "statistic" to dump tensor statistics, use "full" to dump both tensor data and statistics. Default setting is "tensor". Asynchronous statistics dump is only supported when `file_format` is set to `npy`, using "statistic" or "full" when `file_format` is set to `bin` will result in exception. - `input_output`: When set to 0, it means to Dump the operator's input and output; when set to 1, it means to Dump the operator's input; setting it to 2 means to Dump the output of the operator. - `kernels`: This item can be configured in two formats: @@ -795,3 +795,4 @@ Through the asynchronous Dump function, the data files generated by the operator - Complex64 and complex128 only support saving as npy files, not as statistics information. - The Print operator has an input parameter with type of string, which is not a data type supported by Dump. Therefore, when the Print operator is included in the script, there will be an error log, which will not affect the saving data of other types. - When ACL dump is enabled, lite exception dump is not supported by using set_context(ascend_config={"exception_dump": "2"), while full exception dump is supported by using set_context(ascend_config={"exception_dump": "1"). +- When ACL dump is enabled, sink size can only be set to 1. User can use model.train() and ms.data_sink() to set up sink size. Indtoduction of sink mode can be found at 。 diff --git a/tutorials/experts/source_zh_cn/debug/dump.md b/tutorials/experts/source_zh_cn/debug/dump.md index cb1cf8a20d07017fc3fff24169c6e86000a8aefb..782b003b96bc1a8103354abb1b5f1c850e63f554 100644 --- a/tutorials/experts/source_zh_cn/debug/dump.md +++ b/tutorials/experts/source_zh_cn/debug/dump.md @@ -574,7 +574,7 @@ MindSpore通过异步Dump提供了Ascend平台上大型网络的调试能力。 - `dump_mode`:设置成0,表示Dump出该网络中的所有算子数据;设置成1,表示Dump`"kernels"`里面指定的算子数据或算子类型数据。开启溢出检测时,此字段的设置失效,Dump只会保存溢出节点的数据。 - `path`:Dump保存数据的绝对路径。在图编译等级为O0时,MindSpore会在path目录下新建每个step的子目录。 - `net_name`:自定义的网络名称,例如:"ResNet50"。 - - `iteration`:指定需要Dump的迭代。类型为str,用“|”分离要保存的不同区间的step的数据。如"0|5-8|100-120"表示Dump第1个,第6个到第9个, 第101个到第121个step的数据。指定“all”,表示Dump所有迭代的数据。PyNative模式开启溢出检测时,必须设置为"all"。 + - `iteration`:指定需要Dump的迭代。类型为str,用“|”分离要保存的不同区间的step的数据。如"0|5-8|100-120"表示Dump第1个,第6个到第9个, 第101个到第121个step的数据。指定“all”,表示Dump所有迭代的数据。PyNative模式开启溢出检测时,必须设置为"all"。注意,使能ACL dump时,sink size只能设置为1。 - `saved_data`: 指定Dump的数据。类型为str,取值成"tensor",表示Dump出完整张量数据;取值成"statistic",表示只Dump张量的统计信息;取值"full"代表两种都要。异步Dump统计信息只有在`file_format`设置为`npy`时可以成功,若在`file_format`设置为`bin`时选"statistic"或"full"便会错误退出。默认取值为"tensor"。 - `input_output`:设置成0,表示Dump出算子的输入和算子的输出;设置成1,表示Dump出算子的输入;设置成2,表示Dump出算子的输出。 - `kernels`:该项可以配置两种格式: @@ -802,3 +802,5 @@ Dump生成的原始数据文件也可以使用MindSpore Insight的数据解析 - complex64和complex128仅支持保存为npy文件,不支持保存为统计值信息。 - Print算子内部有一个输入参数为string类型,string类型不属于Dump支持的数据类型,所以在脚本中包含Print算子时,会有错误日志,这不会影响其它类型数据的保存。 - 使能ACL dump时,不支持同时使用set_context(ascend_config={"exception_dump": "2")配置轻量异常dump; 支持同时使用set_context(ascend_config={"exception_dump": "1")配置全量异常dump。 +- 使能ACL dump时,sink size只能设置为1。用户通常可以使用model.train()或ms.data_sink()接口配置sink size。下沉模式配置可参考使用说明链接 。 +