New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Fix bug of reduce_sum op. #46045

Merged

sneaxiy merged 4 commits into PaddlePaddle:develop from GhostScreaming:reduce_sum

Sep 17, 2022

Contributor

GhostScreaming commented Sep 14, 2022 •

edited by sneaxiy

Loading

PR types

Bug fixes

PR changes

OPs

Describe

Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result is wrong. Use eigen tensor sum function to caculate in such case.

GhostScreaming added 2 commits

September 14, 2022 10:10


          Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result

1e70140

is wrong.


          Fix some problems.

85ae3d4

1. Change fluid head files to phi files.
2. Delete useless code.
3. Fix code style problems.

sneaxiy reviewed

View reviewed changes

paddle/phi/kernels/kps/reduce_sum_kernel.cu Outdated

@@ @@ -29,10 +78,42 @@ void SumRawKernel(const Context& dev_ctx, @@
                 if (out_dtype == DataType::UNDEFINED && out->dtype() != x.dtype()) {
                   out_dtype = out->dtype();
                 }
-                phi::Reduce<T, kps::AddFunctor, kps::IdentityFunctor>(
+                if (x.numel() > INT_MAX) {

Collaborator

sneaxiy Sep 14, 2022

x.numel() >= std::numeric_limits<int32_t>::max()

paddle/phi/kernels/kps/reduce_sum_kernel.cu Outdated

+                // Caculate
+                eigen_out_tensor.device(*dev_ctx.eigen_device()) =
+                    eigen_x_tensor.sum(eigen_reduce_dim);
+                std::vector<int64_t> final_out_dim;

Collaborator

sneaxiy Sep 14, 2022

I prefer to saving the original output dims first like:

auto origin_out_dims = out->dims();
// some other codes...
out->Resize(origin_out_dims);

The code above is more readable.

paddle/phi/kernels/kps/reduce_sum_kernel.cu Outdated

+                // Resize Input Tensor
+                auto new_x = x;
+                int added_dims = EigenDimSize - x.dims().size();
+                std::vector<int64_t> new_dim(added_dims, 1);

Collaborator

sneaxiy Sep 14, 2022

Change new_dim to be new_x_dim, which is more readable.

Change the type of new_dim from std::vector<int64_t> to be std::array<int64_t, EigenDimSize>.

paddle/phi/kernels/kps/reduce_sum_kernel.cu Outdated

+                // Create Out Tensor
+                dev_ctx.Alloc<T>(out);
+                // Resize Out Tensor
+                std::vector<int64_t> new_reduced_dim(added_dims, 1);

Collaborator

sneaxiy Sep 14, 2022

new_reduced_dim -> new_out_dim.

Use std::array<int64_t, kReduceOutRank> instead of std::vector<int64_t>.

paddle/phi/kernels/kps/reduce_sum_kernel.cu Outdated

+                  new_dim.push_back(x.dims().at(i));
+                }
+                new_x.Resize(phi::make_ddim(new_dim));
+                auto eigen_x_tensor = EigenTensor<T, EigenDimSize>::From(x);

Collaborator

sneaxiy Sep 14, 2022

x->new_x.

paddle/phi/kernels/kps/reduce_sum_kernel.cu

-                    dev_ctx, x, reduce_all, dims.GetData(), keep_dim, out_dtype, out);
-              }
+                if (x.numel() > INT_MAX) {
+              #ifndef PADDLE_WITH_XPU_KP

Collaborator

sneaxiy Sep 14, 2022

Seems that out_dtype is not considered here. How about raise an exception directly if a valid out_dtype is provided?


          Fix some code style problems.

d4bf76b

sneaxiy reviewed

View reviewed changes

paddle/phi/kernels/kps/reduce_sum_kernel.cu

-                    dev_ctx, x, reduce_all, dims.GetData(), keep_dim, out_dtype, out);
-              }
+                if (x.numel() > std::numeric_limits<int32_t>::max()) {
+              #ifndef PADDLE_WITH_XPU_KP

Collaborator

sneaxiy Sep 14, 2022

Wrong macro position. It should be:

#ifndef PADDLE_WITH_XPU_KP
if (x.numel() > std::numeric_limits<int32_t>::max()) {
   // Some other codes..
   return;
}
#endif
// The original codes...


          Fix some code style problems.

00a98f1

paddle-bot-old bot added the contributor label

sneaxiy approved these changes

View reviewed changes

Collaborator

sneaxiy left a comment

LGTM. Great work.

luotao1 removed the contributor label

paddle-bot-old bot added the contributor label

GhostScreaming mentioned this pull request

[Release/2.4][Cherry-pick] Fix bug of reduce_sum op #46160

Merged

sneaxiy merged commit 28b4240 into PaddlePaddle:develop

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels