Update passes in quant2_int8_mkldnn_pass #38912

wozna · 2022-01-12T16:52:12Z

PR types

Bug fixes

PR changes

Others

Describe

This PR updates passes that are applied to the graph during quant2_int8_mkldnn_pass. It turns out that many passes were added to CpuPassStrategy and EnableMkldnn passes, but they weren't added to mentioned python script. I used the same order of the passes that is presented in paddle_pass_builder.cc.

paddle-bot-old · 2022-01-12T16:52:16Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

lidanqing-intel · 2022-01-13T02:10:54Z

@wozna @pmajchrzak Please verify this PR will not cause our existing daily CI int8 models accuracy and performance drop. Thanks!

sfraczek

LGTM

lidanqing-intel · 2022-01-17T06:59:04Z

@wozna Hi could you or ask Pior to upload spreadsheet showing this PR does not cause any accuracy drop or performance drop on exisiting int8 modes CI

wozna · 2022-01-19T09:52:40Z

I can confirm that all recent changes don't cause any accuracy change or performance change in any model from our CI. Before I had a problem with scale_matmul_fuse_pass. I couldn't understand why this pass has to be in _quantize_fp32_graph not in the _optimize_fp32_graph. It turned out that we use a scale operator to gather scale for matmul and when I changed the place of scale_matmul_fuse_pass, 12 matmuls weren't quantized because the scale for input was missing. This change improved a little accuracy on Ernie from 0.791165 to 0.8000 but caused a performance drop of 1.5%. That's why I returned to the previous order, but I moved cpu_quantize_placement_pass, after scale_matmul and reshape_transpose_matmul/V2 passes.

wozna · 2022-01-19T15:40:07Z

@lidanqing-intel @sfraczek Could you please repeat your review?

sfraczek

LGTM.

lidanqing-intel

LGTM

lidanqing-intel · 2022-01-20T13:57:31Z

@baoachun Could you please review this PR. Thanks

lidanqing-intel · 2022-01-26T11:44:41Z

@baoachun Hi could you please approve this PR? Some models from BML team still need to use save_quant_model.py solution for now. So we still need to maintain this save_quant_model.py. Thanks

baoachun

LGTM

Upadate pass in quant2_int8_mkldnn_pass

d3e1d67

wozna added int8 Intel labels Jan 12, 2022

$sfraczek$

sfraczek previously approved these changes Jan 13, 2022

View reviewed changes

lidanqing-intel assigned baoachun Jan 17, 2022

Back to the previous scale_matmul order

4f2e240

wozna dismissed sfraczek’s stale review via 4f2e240 January 17, 2022 16:32

Change place of cpu_quantize_placement_pass

95d68aa

wozna requested review from lidanqing-intel and sfraczek January 19, 2022 15:40

$sfraczek$

sfraczek approved these changes Jan 19, 2022

View reviewed changes

lidanqing-intel approved these changes Jan 20, 2022

View reviewed changes

wozna requested a review from baoachun January 24, 2022 11:02

baoachun approved these changes Jan 26, 2022

View reviewed changes

wozna closed this Jan 26, 2022

wozna reopened this Jan 26, 2022

lidanqing-intel merged commit 0e235e5 into PaddlePaddle:develop Jan 27, 2022

wozna deleted the update_passes branch February 24, 2023 16:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update passes in quant2_int8_mkldnn_pass #38912

Update passes in quant2_int8_mkldnn_pass #38912

wozna commented Jan 12, 2022

paddle-bot-old bot commented Jan 12, 2022

lidanqing-intel commented Jan 13, 2022

$@sfraczek$ sfraczek left a comment

lidanqing-intel commented Jan 17, 2022

wozna commented Jan 19, 2022 •

edited

Loading

wozna commented Jan 19, 2022

$@sfraczek$ sfraczek left a comment

lidanqing-intel left a comment

lidanqing-intel commented Jan 20, 2022

lidanqing-intel commented Jan 26, 2022

baoachun left a comment

Update passes in quant2_int8_mkldnn_pass #38912

Update passes in quant2_int8_mkldnn_pass #38912

Conversation

wozna commented Jan 12, 2022

PR types

PR changes

Describe

paddle-bot-old bot commented Jan 12, 2022

lidanqing-intel commented Jan 13, 2022

sfraczek left a comment

Choose a reason for hiding this comment

lidanqing-intel commented Jan 17, 2022

wozna commented Jan 19, 2022 • edited Loading

wozna commented Jan 19, 2022

sfraczek left a comment

Choose a reason for hiding this comment

lidanqing-intel left a comment

Choose a reason for hiding this comment

lidanqing-intel commented Jan 20, 2022

lidanqing-intel commented Jan 26, 2022

baoachun left a comment

Choose a reason for hiding this comment

$@sfraczek$ sfraczek left a comment

wozna commented Jan 19, 2022 •

edited

Loading

$@sfraczek$ sfraczek left a comment