[Pten]Refactor the Elementwise_add Kernel #37043

YuanRisheng · 2021-11-08T13:27:46Z

PR types

Others

PR changes

OPs

Describe

迁移主要逻辑

1，elementwise基础组件迁移

2，elementwise_add kernel迁移

paddle-bot-old · 2021-11-08T13:27:51Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

…ise_add_refactor

chenwhql · 2021-11-10T03:23:43Z

paddle/fluid/operators/elementwise/elementwise_add_op.cu

-    int axis = PackTensorsIntoVector<T>(ctx, &ins, &outs);
-    LaunchElementwiseCudaKernel<ElementwiseType::kBinary, T, T>(
-        cuda_ctx, ins, &outs, axis, AddFunctor<T>());
+    auto* x = ctx.Input<framework::LoDTensor>("X");


这个函数是不是可以移除了，直接复用下面paddle/fluid/operators/elementwise/elementwise_add_op.h中的kernel，在注册时，给cuda注册一份就可以

chenwhql · 2021-11-10T03:26:37Z

paddle/fluid/operators/elementwise/elementwise_add_op.h

@@ -20,6 +20,13 @@ limitations under the License. */
 #include "paddle/fluid/operators/math/blas.h"
 #include "paddle/fluid/operators/math/math_function.h"

+#include "paddle/fluid/framework/pten_utils.h"
+
+// only can include the headers in paddle/top/api dirs


注释目录需要更改，现在应该是paddle/pten/include

chenwhql · 2021-11-10T03:26:57Z

paddle/fluid/operators/elementwise/elementwise_op_function.h

@@ -29,6 +29,11 @@ limitations under the License. */
 #include "paddle/fluid/platform/gpu_info.h"
 #include "paddle/fluid/platform/transform.h"

+// only can include the headers in paddle/top/api dirs


chenwhql · 2021-11-10T03:28:20Z

paddle/fluid/operators/elementwise/elementwise_op_function.h

-    (*post) *= x_dims[i];
-  }
+  pten::general::get_mid_dims(x_dims, y_dims, axis, pre, n, post,
+                              is_run_common_broadcast);
 }

 inline int GetElementwiseIndex(const int *x_dims_array, const int max_dim,


这层壳的保留是必要的吗

chenwhql · 2021-11-10T03:31:06Z

paddle/fluid/operators/elementwise/elementwise_op_impl.cu.h

-    }
+  std::vector<const pten::DenseTensor *> pt_inputs;
+  std::vector<pten::DenseTensor *> pt_outputs;
+  // *_tmp for cache DenseTensor


可以加TODO注释说明是DenseTensor暂不支持拷贝构造导致需要这样写？后续会优化？

chenwhql · 2021-11-10T03:47:39Z

paddle/pten/kernels/cpu/CMakeLists.txt

@@ -3,3 +3,5 @@ cc_library(linalg_cpu SRCS linalg.cc DEPS dense_tensor kernel_context kernel_fac
 cc_library(creation_cpu SRCS creation.cc DEPS dense_tensor kernel_context kernel_factory eigen_function)
 cc_library(utils_cpu SRCS utils.cc DEPS dense_tensor kernel_context kernel_factory memory convert_utils)
 cc_library(manipulation_cpu SRCS manipulation.cc DEPS dense_tensor kernel_context kernel_factory utils_cpu unary)
+cc_library(nn_cpu SRCS nn.cc DEPS dense_tensor kernel_context kernel_factory blas eigen_function)
+add_subdirectory(funcs)


可以将原来的functions改为funcs，是不是不在每个目录下新建funcs比较好

chenwhql · 2021-11-10T03:48:56Z

paddle/pten/kernels/cuda/CMakeLists.txt

@@ -4,10 +4,12 @@ if(WITH_GPU)
  nv_library(creation_cuda SRCS creation.cu DEPS eigen_function dense_tensor kernel_context kernel_factory)
  nv_library(utils_cuda SRCS utils.cu DEPS dense_tensor kernel_context kernel_factory memory convert_utils)
  nv_library(manipulation_cuda SRCS manipulation.cu DEPS dense_tensor kernel_context kernel_factory utils_cuda unary)
-elseif(WITH_ROCM)
+  nv_library(nn_cuda SRCS nn.cu DEPS dense_tensor kernel_context kernel_factory eigen_function)
+  elseif(WITH_ROCM)


缩进有点问题

chenwhql · 2021-11-10T03:52:37Z

paddle/pten/kernels/functions/general/elementwise_base.h

+  int64_t post_;
+};
+
+// #if defined(__NVCC__) || defined(__HIPCC__)


这些注释是什么情况

chenwhql · 2021-11-10T03:52:54Z

paddle/pten/kernels/functions/general/elementwise_base.h

+namespace general {
+
+using DDim = paddle::framework::DDim;
+using CPUContext = paddle::platform::CPUDeviceContext;


这里用的cpu，为什么是在general下面

这里是cpu和gpu的公共方法，所以放在里general下边

chenwhql · 2021-11-10T03:57:39Z

paddle/pten/kernels/cuda/funcs/elementwise/elementwise.h

+
+#pragma once
+
+#include "paddle/pten/kernels/cuda/funcs/elementwise/elementwise_broadcast.cu.h"


同上，要不在functions下面新增cpu和cuda目录吧，不在kernel层的设备目录里新增子目录了，后续这里也还要调整，另外如果都在cuda目录下，是不是文件后缀中的cu也可以去掉了

zyfncg · 2021-11-10T06:51:13Z

paddle/pten/api/lib/nn.cc

+namespace paddle {
+namespace experimental {
+
+Tensor elementwise_add(const Tensor& x, const Tensor& y, int axis) {


API中好像没有aixs这个参数

…ise_add_refactor

MingMingShangTian

LGTM

chenwhql

LGTM

chenwhql · 2021-11-11T02:31:22Z

paddle/fluid/operators/elementwise/elementwise_op_function.h

@@ -2020,7 +1856,8 @@ void FusedElemwiseAndActComputeWithBroadcast(
  axis = (y_dim.size() == 0) ? x_dim.size() : axis;

  int pre, n, post, is_run_common_broadcast;
-  get_mid_dims(x_dim, y_dim, axis, &pre, &n, &post, &is_run_common_broadcast);
+  pten::general::get_mid_dims(x_dim, y_dim, axis, &pre, &n, &post,


函数命名建议按照code style统一，采用驼峰式命名，这里原先就不太规范，可在后续PR更改

Avin0323

LGTM for PR-CI-OP-benchmark

YuanRisheng added 2 commits November 8, 2021 13:13

elementwise_add kernel refactor

16d4256

Merge From Develop

94af4d2

YuanRisheng added 5 commits November 9, 2021 03:00

fix compile bugs in elementwise_add refactor

26da73b

fix compile bugs when run in npu/xpu

446767f

Merge branch 'develop' of github.com:YuanRisheng/Paddle into elementw…

b261507

…ise_add_refactor

fix bugs when run unit test

6934d11

fix bugs when run ci-windows

39b08ae

chenwhql reviewed Nov 10, 2021

View reviewed changes

zyfncg reviewed Nov 10, 2021

View reviewed changes

YuanRisheng added 5 commits November 10, 2021 08:34

modify code as recommended

557b797

Merge branch 'develop' of github.com:YuanRisheng/Paddle into elementw…

ab65d61

…ise_add_refactor

code format adjust

1ef9630

fix bugs when run ci

21913c1

fix compile bug when run in ci-windwos

39612bc

MingMingShangTian approved these changes Nov 11, 2021

View reviewed changes

chenwhql approved these changes Nov 11, 2021

View reviewed changes

Avin0323 approved these changes Nov 11, 2021

View reviewed changes

raindrops2sea approved these changes Nov 11, 2021

View reviewed changes

chenwhql merged commit c131034 into PaddlePaddle:develop Nov 12, 2021

chenwhql mentioned this pull request Nov 12, 2021

[Pten] Refactor the implementation of custom operator #37122

Merged

YuanRisheng deleted the elementwise_add_refactor branch November 19, 2021 02:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Pten]Refactor the Elementwise_add Kernel #37043

[Pten]Refactor the Elementwise_add Kernel #37043

YuanRisheng commented Nov 8, 2021 •

edited

Loading

paddle-bot-old bot commented Nov 8, 2021

chenwhql Nov 10, 2021

YuanRisheng Nov 10, 2021

chenwhql Nov 10, 2021

YuanRisheng Nov 10, 2021

chenwhql Nov 10, 2021

YuanRisheng Nov 10, 2021

chenwhql Nov 10, 2021

YuanRisheng Nov 10, 2021

chenwhql Nov 10, 2021

YuanRisheng Nov 10, 2021

chenwhql Nov 10, 2021

YuanRisheng Nov 10, 2021

chenwhql Nov 10, 2021

YuanRisheng Nov 10, 2021

chenwhql Nov 10, 2021

YuanRisheng Nov 10, 2021

chenwhql Nov 10, 2021

YuanRisheng Nov 10, 2021

chenwhql Nov 10, 2021

YuanRisheng Nov 10, 2021

zyfncg Nov 10, 2021

YuanRisheng Nov 10, 2021

MingMingShangTian left a comment

chenwhql left a comment

chenwhql Nov 11, 2021 •

edited

Loading

Avin0323 left a comment


		#pragma once

		#include "paddle/pten/kernels/cuda/funcs/elementwise/elementwise_broadcast.cu.h"

[Pten]Refactor the Elementwise_add Kernel #37043

[Pten]Refactor the Elementwise_add Kernel #37043

Conversation

YuanRisheng commented Nov 8, 2021 • edited Loading

PR types

PR changes

Describe

相关背景

迁移主要逻辑

1，elementwise基础组件迁移

2，elementwise_add kernel迁移

paddle-bot-old bot commented Nov 8, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

MingMingShangTian left a comment

Choose a reason for hiding this comment

chenwhql left a comment

Choose a reason for hiding this comment

chenwhql Nov 11, 2021 • edited Loading

Choose a reason for hiding this comment

Avin0323 left a comment

Choose a reason for hiding this comment

YuanRisheng commented Nov 8, 2021 •

edited

Loading

chenwhql Nov 11, 2021 •

edited

Loading