[NPU] support npu profiler #31684

zhiqiu · 2021-03-17T06:45:51Z

PR types

New features

PR changes

APIs

Describe

[NPU] support npu profiler

function 1： paddle profiler
function 2: paddle timeline
function 3: ACL profiler

paddle-bot-old · 2021-03-17T06:46:49Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

pangyoki

paddle profiler里只展示CPU time 和 GPU time吗？
NPU kernel计算时间无法统计？

liym27

LGTM

zhiqiu · 2021-04-01T05:43:31Z

paddle profiler里只展示CPU time 和 GPU time吗？
NPU kernel计算时间无法统计？

是的，NPU没有类似cuda的cupti API，不支持自己获取kernel时间。

* support npu profiler * add python api * fix bugs * add wrapper for incomplete type * update profile proto * record npu wait * add xpu placeholder

…to develop (#32294) * [NPU] support GarbageCollector for npu (#31874) * support GarbageCollector for npu * fix typo * fix gather_grad * disable NPUDefaultStreamGarbageCollector on NPU * [NPU] support npu for memcpy op (#31808) * support npu for memcpy op * add ut * fix ut * fix typo * 【NPU】fix bug of using temp vector (#31963) * fix bug when beta1_pow on cpu (#31995) * [NPU] support npu profiler (#31684) * support npu profiler * add python api * fix bugs * add wrapper for incomplete type * update profile proto * record npu wait * add xpu placeholder * fix adam (#32016) * [NPU] enable async copy and add wait before sync operation (#31956) * enable async copy and add wait before sync operation * remove unneccessary wait * add FillNpuTensorWithConstant * refine * fix fill_constant * make TensorFromVector/TensorToVector sync * [NPU] Support dataloader on npu place. (#31867) * [NPU] Wait on NPUPlace (#32086) * [NPU] fix cast op (#32121) * fix npu kernel of cast op to handle casting to same dtype * add comments * [NPU] support cann 20.3 (#32044) * fix compile problem on cann 20.3 * fix ut * fix test_mul * fix check_finite_and_scale * fix lookup_table_v2_grad * fix cmake * support print op * [NPU] Support npu save load (#31893) * support save load for NPU * add save load npu unittest * support np.array transform in NPU * fix errors * delete dygraph in unittest * add Wait * fix unittest * fix review comment * fix unittest problem * fix little problem * change aclrtSynchronizeDevice to aclrtSynchronizeStream for better performance (#32196) * change aclrtSynchronizeDevice to aclrtSynchronizeStream for better performace * refine code * fix NPUDeviceContext in all c++ unittest (#32198) * fix NPUDeviceContext in all c++ unittest * refine log Co-authored-by: pangyoki <pangyoki@126.com> * [NPU] Remove TensorFromVector and avoid sync copy in npu op kernel for better performance (#31994) * enable async copy and add wait before sync operation * remove unneccessary wait * add FillNpuTensorWithConstant * refine * fix fill_constant * change TensorFromVector to FillNpuTensorWithConstant * fix ignored api * delete extra unittest * fix little error * fix update_loss_scaling_op_npu and check_finite_and_unscale_op_npu * change TensorCopySync to TensorCopy * delete useless Wait and add StreamWait * fix npu_stream error * fix check_finite_and_unscale_op_npu TensorCopy * only save stream wait * fix NPUDeviceContext in all c++ unittest * delete wait Co-authored-by: zhiqiu <chenqiuliang@baidu.com> * delete useless unittest file (#32206) * Fix op test (#32231) * fix conditional block (#32243) * fix adam bug again (#32246) * fix compile * fix ut * fix ut Co-authored-by: liym27 <33742067+liym27@users.noreply.github.com> Co-authored-by: pangyoki <pangyoki@126.com>

zhiqiu force-pushed the dev/npu_profiler branch from c74251e to b2d8c8b Compare March 29, 2021 07:46

zhiqiu added 3 commits March 29, 2021 07:55

support npu profiler

7cc4d17

add python api

b5d92f9

fix bugs

0911405

zhiqiu force-pushed the dev/npu_profiler branch from b2d8c8b to 0911405 Compare March 29, 2021 08:04

zhiqiu added 4 commits March 29, 2021 09:25

add wrapper for incomplete type

87af7a0

update profile proto

2b5daa3

record npu wait

4028b41

add xpu placeholder

e02d0f8

pangyoki approved these changes Mar 30, 2021

View reviewed changes

liym27 approved these changes Mar 30, 2021

View reviewed changes

zhiqiu merged commit 6503ef5 into PaddlePaddle:ascendrc Apr 1, 2021

zhiqiu added a commit to zhiqiu/Paddle that referenced this pull request Apr 15, 2021

[NPU] support npu profiler (PaddlePaddle#31684)

2b4d669

* support npu profiler * add python api * fix bugs * add wrapper for incomplete type * update profile proto * record npu wait * add xpu placeholder

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NPU] support npu profiler #31684

[NPU] support npu profiler #31684

zhiqiu commented Mar 17, 2021 •

edited

Loading

paddle-bot-old bot commented Mar 17, 2021

pangyoki left a comment

liym27 left a comment

zhiqiu commented Apr 1, 2021

[NPU] support npu profiler #31684

[NPU] support npu profiler #31684

Conversation

zhiqiu commented Mar 17, 2021 • edited Loading

PR types

PR changes

Describe

paddle-bot-old bot commented Mar 17, 2021

pangyoki left a comment

Choose a reason for hiding this comment

liym27 left a comment

Choose a reason for hiding this comment

zhiqiu commented Apr 1, 2021

zhiqiu commented Mar 17, 2021 •

edited

Loading