Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

support gpu mixed precision inference #40531

Merged
merged 6 commits into from
Mar 17, 2022
Merged

support gpu mixed precision inference #40531

merged 6 commits into from
Mar 17, 2022

Conversation

baoachun
Copy link
Contributor

PR types

Others

PR changes

Others

Describe

支持GPU混合精度推理

@paddle-bot-old
Copy link

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

/// \brief Turn on GPU fp16 precision.
///
///
void EnableUseGpuFp16();
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Exp_EnableUseGpuFp16 接口名未来可能会更改为通用的EnableFp16

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

已修改

///
/// \param op_list The operator type list.
///
void SetGpuFp16DisabledOp(std::unordered_set<std::string> op_list) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

是否把该接口参数放入Exp_EnableUseGpuFp16中比较合适?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

已修改

"conv_elementwise_add_fuse_pass", //
#endif //
"transpose_flatten_concat_fuse_pass", //
"mixed_precision_configure_pass", //
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

能否和GpuPassStrategy默认构造用同一份pass列表呢?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

有些算子融合之后没有fp16的kernel,例如fc_fuse_pass

Copy link

@leiqing1 leiqing1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

TODO:正式发板前,完善代码+补充文档。

Copy link
Contributor

@Superjomn Superjomn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@Superjomn Superjomn merged commit 06fee99 into PaddlePaddle:develop Mar 17, 2022
JamesLim-sy pushed a commit to JamesLim-sy/Paddle that referenced this pull request Mar 17, 2022
liqitong-a pushed a commit to liqitong-a/Paddle that referenced this pull request Mar 17, 2022
@jiweibo jiweibo mentioned this pull request Jun 24, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants