Add graph apis #40809

DesmonDay · 2022-03-22T06:21:49Z

PR types

New features

PR changes

APIs

Describe

Add graph_sample_neighbors API and graph_reindex API.

chenwhql · 2022-03-22T06:23:11Z

paddle/fluid/operators/graph_reindex_op.cc

+    // Note(daisiming): If using buffer hashtable, we must ensure the number of
+    // nodes of
+    //                  the input graph should be no larger than maximum(int32).
+    AddInput("HashTable_Value",


命名中间建议不加下划线

chenwhql · 2022-03-22T06:39:30Z

python/paddle/incubate/operators/graph_reindex.py

+            "X": x,
+            "Neighbors": neighbors,
+            "Count": count,
+            "HashTable_Value": None,


API都不支持传这两个值进来的话，这两个Dispensable的输入在什么情况下会用到

… add_graph_apis

wawltor · 2022-04-01T02:58:27Z

python/paddle/incubate/operators/graph_reindex.py

+                            should be the same with `x`.
+        count (Tensor): The neighbor count of the input nodes `x`. And the 
+                        data type should be int32.
+        value_buffer (Tensor|None): Value buffer for hashtable. The data type should 


value buffer 和 index buffer这块和name的统一一下描述吧，要不为None，要么是optional

wawltor · 2022-04-01T03:01:35Z

python/paddle/incubate/operators/graph_reindex.py

+    """
+    if flag_buffer_hashtable:
+        if value_buffer is None or index_buffer is None:
+            raise ValueError(f"`value_buffer` and `index_buffer` should not"


这块buffer的设计具体有测试增加的显存量吗？

在实验中有具体看过，增加的显存量和图节点数量相关，而且采用int32的范围填充，所以实际不会耗费很多显存。比较担心的是图节点数量超过int32最大值，buffer方法就可能不太适用了，所以用户也可以采用非buffer的方式来reindex。

wawltor · 2022-04-01T03:18:29Z

paddle/phi/kernels/gpu/graph_sample_neighbors_kernel.cu

+    thrust::transform(
+        output_count, output_count + bs, output_count, MaxFunctor(sample_size));
+  }
+  int total_sample_num = thrust::reduce(output_count, output_count + bs);


这块有疑问，如果sample size这里< 0, 看起来sample size这个变量不太可控，会有随机性

如果sample_size < 0的话，就默认采样所有邻居了，确实有一定随机性。主要是这个设计可以满足PGL那边的一些直接返回邻居的API。

wawltor · 2022-04-01T03:25:03Z

paddle/phi/kernels/gpu/graph_sample_neighbors_kernel.cu

+  constexpr int TILE_SIZE = BLOCK_WARPS * 16;
+  const dim3 block(WARP_SIZE, BLOCK_WARPS);
+  const dim3 grid((bs + TILE_SIZE - 1) / TILE_SIZE);
+  SampleKernel<T,


看起来普通sampler是没有一个hash table版本，不是一个buffer版本

是的，区分了两个采样版本。因为fisher_yates采样依赖于一个和边的数量相同的buffer，占的显存会比较多一些，所以也保留原来的采样方式。

wawltor · 2022-04-01T03:32:31Z

paddle/phi/kernels/gpu/graph_sample_neighbors_kernel.cu

+}
+
+template <typename T>
+__global__ void FisherYatesSampleKernel(const uint64_t rand_seed,


在UVA模式下这种访存效率看起来不是特别高，后续改成一个warp访问

ok，下个PR修改看看。

wawltor · 2022-04-01T07:53:32Z

paddle/phi/kernels/gpu/graph_reindex_kernel.cu

+  ReindexSrcOutput<T><<<grid, block, 0, dev_ctx.stream()>>>(
+      thrust::raw_pointer_cast(src_outputs), num_edges, hashtable_value);
+
+  ResetBufferHashTable<T, Context>(dev_ctx,


比较好奇的是为啥要写一个kernel进行reset，为啥不能对一块显存直接reset

wawltor · 2022-04-01T07:55:57Z

paddle/phi/kernels/cpu/graph_sample_neighbors_kernel.cc

+      row_data, col_ptr_data, x_data, &output, &output_count, sample_size, bs);
+  out->Resize({static_cast<int>(output.size())});
+  T* out_data = dev_ctx.template Alloc<T>(out);
+  std::copy(output.begin(), output.end(), out_data);


看起来这里也不用直接拷贝，直接使用ResetHolder，或者ShareDataWith，看std::shared_ptr<phi::Allocation> holder_ 是一个shared_ptr

统一下个PR修改。

wawltor · 2022-04-01T07:56:10Z

paddle/phi/kernels/cpu/graph_sample_neighbors_kernel.cc

+  out_count->Resize({bs});
+  int* out_count_data = dev_ctx.template Alloc<int>(out_count);
+  std::copy(output_count.begin(), output_count.end(), out_count_data);
+}


wawltor · 2022-04-01T07:57:31Z

paddle/phi/kernels/cpu/graph_reindex_kernel.cc

+  std::copy(dst.begin(), dst.end(), reindex_dst_data);
+  out_nodes->Resize({static_cast<int>(unique_nodes.size())});
+  T* out_nodes_data = dev_ctx.template Alloc<T>(out_nodes);
+  std::copy(unique_nodes.begin(), unique_nodes.end(), out_nodes_data);


看看这些拷贝是否需要

wawltor · 2022-04-01T08:37:57Z

paddle/phi/infermeta/multiary.cc

+  out->set_dtype(row.dtype());
+  out_count->set_dims({-1});
+  out_count->set_dtype(DataType::INT32);
+}


静态图测试过了吗？

单测里有静态图的测试，应该是ok的

wawltor

LGTM

DesmonDay added 2 commits March 21, 2022 03:55

Add graph_reindex API

78c648a

add graph_sample_neighbors api

ca1fcb0

chenwhql reviewed Mar 22, 2022

View reviewed changes

DesmonDay added 3 commits March 22, 2022 10:59

Add buffer

ff9ffca

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

59a7c27

… add_graph_apis

delete VLOG

ab4f462

DesmonDay force-pushed the add_graph_apis branch from c53da45 to ab4f462 Compare March 22, 2022 13:52

DesmonDay added 19 commits March 24, 2022 08:36

delete thrust::copy for output

b8ec18b

add ShareDataWith

0bb7336

delete graph_reindex hashtable output

ca2c430

add graph_reindex dispensable

820e6d1

add reindex unittest, move memset to cuda kernel, change api

529dc0d

fix conflict

0d4b13a

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

3604b6a

… add_graph_apis

add reindex buffer for gpu version note

d78e34f

fix conflicts for op_func_generator

76ba8e6

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

c0c9722

… add_graph_apis

Add fisher_yates sampling, add dispensable, change infermeta

543589a

add dtype for edge_id

67bea9f

fix rocm ci and static check ci

1adc1c2

add unittest

26ede11

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

7947c8d

… add_graph_apis

fix unittest

5f32f3c

fix unittest

3fef714

Merge branch 'develop' into add_graph_apis

ec0bee6

fix bug

a479711

wawltor reviewed Apr 1, 2022

View reviewed changes

wawltor approved these changes Apr 1, 2022

View reviewed changes

ZeyuChen merged commit b0398c8 into PaddlePaddle:develop Apr 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add graph apis #40809

Add graph apis #40809

DesmonDay commented Mar 22, 2022 •

edited

Loading

chenwhql Mar 22, 2022

chenwhql Mar 22, 2022

wawltor Apr 1, 2022

wawltor Apr 1, 2022

DesmonDay Apr 1, 2022

wawltor Apr 1, 2022

DesmonDay Apr 1, 2022

wawltor Apr 1, 2022

DesmonDay Apr 1, 2022

wawltor Apr 1, 2022

DesmonDay Apr 1, 2022

wawltor Apr 1, 2022

wawltor Apr 1, 2022

DesmonDay Apr 1, 2022

wawltor Apr 1, 2022

wawltor Apr 1, 2022

wawltor Apr 1, 2022

DesmonDay Apr 1, 2022

wawltor left a comment

Add graph apis #40809

Add graph apis #40809

Conversation

DesmonDay commented Mar 22, 2022 • edited Loading

PR types

PR changes

Describe

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wawltor left a comment

Choose a reason for hiding this comment

DesmonDay commented Mar 22, 2022 •

edited

Loading