[Paddle-TRT] support new quant format from slim #46022

zhoutianzi666 · 2022-09-14T03:43:49Z

PR types

Others

PR changes

Others

Describe

support new quant format(add QDQ before every op) from PaddleSlim
put identity_scale_op_clean_pass after quant
- when pattern QDQ-> scale -> QDQ -> scale arises, identity_scale_op_clean_pass will make it to QDQ -> QDQ, bugs arises later in delete_quant_dequant_linear_op_pass.
- so we must put identity_scale_op_clean_pass after delete_quant_dequant_linear_op_pass

目前此pr支持新格式的量化（所有op前都插入QDQ）。
改动主要是：

防止Q/DQ中共享的Scale权重被删除。
matmul_v2 支持 matrix * vector ，picodet中有。
- 同时顺带支持了vec*vec

将"identity_scale_op_clean_pass"移动到量化pass之后，避免在量化时出现QDQ->QDQ这样的结构，这会触发delete_quant_dequant_linear_op_pass的bug。本质原因还是关系到pattern匹配策略的问题。同46178 pr是一个问题。

…into new_new_slim

zhangjun

LGTM

zhoutianzi666 force-pushed the new_new_slim branch from 015041c to 9938330 Compare September 14, 2022 05:02

support new quant format form slim

1e31760

zhoutianzi666 force-pushed the new_new_slim branch from 9938330 to 1e31760 Compare September 14, 2022 05:17

paddle-bot-old bot added the contributor External developers label Sep 14, 2022

zhoutianzi666 added 7 commits September 19, 2022 02:34

commit

673d219

commit

b307355

Merge branch 'develop' into new_new_slim

4c9a5cb

Merge branch 'develop' into new_new_slim

abe0fd9

Merge branch 'develop' into new_new_slim

7c91d54

fix bug in matmul_v2

e814ca8

fix bug in matmul_v2

4365597

qingqing01 requested review from wanghaoshuang, zhangjun, yghstill, Wangzheee and qingqing01 September 30, 2022 07:53

zhoutianzi666 and others added 4 commits October 8, 2022 03:04

add matmul_v2 unitest

97c8fa6

Merge branch 'PaddlePaddle:develop' into new_new_slim

5f7c1df

add matmul_v2 unitest

a9d3a10

Merge branch 'new_new_slim' of https://github.com/zhoutianzi666/Paddle …

5b77de6

…into new_new_slim

zhoutianzi666 changed the title ~~[Paddle-TRT] support new quant format form slim~~ [Paddle-TRT] support new quant format from slim Oct 8, 2022

zhoutianzi666 added 3 commits October 8, 2022 06:44

add matmul_v2 unitest

d6a0d2d

add matmul_v2 unitest

7d3cf7b

Merge branch 'develop' into new_new_slim

587aa7f

zhoutianzi666 force-pushed the new_new_slim branch 3 times, most recently from 2a3950f to b7bd70e Compare October 9, 2022 01:56

support mat*vec,vec*vec in matmul_v2

8b63a02

zhoutianzi666 force-pushed the new_new_slim branch from b7bd70e to 8b63a02 Compare October 9, 2022 02:02

clean code in test_trt_convert_matmul_v2.py

eadbadd

zhangjun reviewed Oct 9, 2022

View reviewed changes

zhangjun approved these changes Oct 9, 2022

View reviewed changes

jiweibo approved these changes Oct 10, 2022

View reviewed changes

jiweibo merged commit 7987a90 into PaddlePaddle:develop Oct 10, 2022

zhoutianzi666 added a commit to zhoutianzi666/Paddle that referenced this pull request Oct 13, 2022

[Paddle-TRT] support new quant format from slim (PaddlePaddle#46022)

a3db645

zhoutianzi666 mentioned this pull request Oct 13, 2022

[Paddle-TRT][Cherry-Pick] support new quant format from slim (#46022) #46979

Merged

qingqing01 pushed a commit that referenced this pull request Oct 14, 2022

[Paddle-TRT] support new quant format from slim (#46022) (#46979)

b8677c0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Paddle-TRT] support new quant format from slim #46022

[Paddle-TRT] support new quant format from slim #46022

zhoutianzi666 commented Sep 14, 2022 •

edited

Loading

zhangjun left a comment

[Paddle-TRT] support new quant format from slim #46022

[Paddle-TRT] support new quant format from slim #46022

Conversation

zhoutianzi666 commented Sep 14, 2022 • edited Loading

PR types

PR changes

Describe

zhangjun left a comment

Choose a reason for hiding this comment

zhoutianzi666 commented Sep 14, 2022 •

edited

Loading