Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

update #9

Merged
merged 46 commits into from
Jun 1, 2021
Merged

update #9

merged 46 commits into from
Jun 1, 2021

Conversation

AnnaTrainingG
Copy link
Owner

PR types

PR changes

Describe

jim19930609 and others added 30 commits May 26, 2021 15:54
Sigmoid: Out = Sigmoid(X)
SigmoidGrad: DX = DOut*(1-Out)*Out

[This Patch]
Out
DOut -> SigmoidGradGrad -> DOutNew
DDX                        DDOut

DDOut = (1-Out)*Out*DDX
DOutNew = (1-2*Out)*DOut*DDX
* optimize OP's compilation time, test=develop

* add more op and run ci test, test=develop

* CUDA Kernel register in cc file, test=develop

* fix macros, test=develop

* fix undefined symbol error, test=develop

* fix compilation error and undefined symbol, test=develop

* fix compilation error on Windows, test=develop

* fix compilation error on Windows, test=develop
* refine ~npuOpRunner

* implement destructor and forbid copy

* use reference to avoid copy

* use const reference

* relax adam precision

* fix top_k
* modify matmul Op to complex template types

* remove complex64/128 head file
Change-Id: I2e82dfcee6a1d0512b94cebc32281123fa5bf597

* pretty print for datafeed error

Change-Id: I056a8b6f03608e96679a83846c97aed289cef7e6

* fix fleet dist infer ut
…y API error msg (#33003)

* Unify all external API error message mechanism and enhance third-party API error msg

* fix some comment

* fix some comment
* support ssd in PsCore

* remove log

* remove bz2

* defalut value

* code style

* parse table class

* code style

* add define
* modify kron OP to complex template types

* modify reshape, slice, trace, transpose OPs to complex template types

* modify to complex template types in eigen slice files

* change to complex template types for pad.cc and pac.cu

* format code style
* modify API nn.Bilinear's doc, test=develop

* speed up paddle.add paddle.nn.Linear, test=develop

* fix bug, test=develop
* run pre-commit

* use HOST or DEVICE instead of HOSTDEVICE in implementation of lgamma op

* add test for fp32

* add lgamma to op_threshold_white_list

* add cuda kernel for lgamma kernel

* modify numeric grad delta

* fix small English issue

* change LaunchElementwiseCudaKernel to LaunchSameDimsElementwiseCudaKernel
#33153)

* Add GLIBCXX_USE_CXX11_ABI flag

* fix typo

* fix typo
* modify to complex template types for fill_constant op

* modify to complex template types for py_layer, strided_slice and reduce_sum_op.part
* add more conditions

* dynamic shape

* ut

* correct contidions

* commnent

* remove rebandadnt op type

* remove rebandant if
* fix uint8 check in assign

* fix assign unittests

* fix xpu test_assign
* support params groups, test=develop

* simplify updating opt attr

* update according to review
* improve group norm cpu precision and performance

* add unit test to group norm
wanghaoshuang and others added 16 commits May 31, 2021 19:56
* enhance error message for conv

* fix ci coverage
…33195)

* replace and remove complex64/128 types in custom OP and other files

* fix custom_tensor_test fail bug

* fix custom_conj_test fail bug

* fix dispatch_test_op build fail bug
* download_verify

* checksum
* Align download_filename with cached_filename
* fix syncbn
* fix spawn default nprocs get error

* polish error message
* add trt convert op: reshape

* reshape

* fix

* input: Shape,ShapeTensor

* Add some comments
* fix path error on windows when precision switch is turn on

* fix error

* Update get_pr_ut.py

fix format error

* Update get_pr_ut.py
@AnnaTrainingG AnnaTrainingG merged commit b0cbcca into AnnaTrainingG:develop Jun 1, 2021
AnnaTrainingG pushed a commit that referenced this pull request Jun 9, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.