[Feature] x_dot_x builtin kernel support #831

classicsong · 2019-09-04T15:19:37Z

Description

Adding support for u_dot_v, u_dot_e, v_dot_e, v_dot_u, e_dot_u and e_dot_v as builtin kernel.
Implementation will base on current binary_reduce structure.
#659

Tasks

Forward of [x]_add_[x], [x]_sub_[x], [x]_mul_[x], [x]_div_[x] still work
Backward of [x]_add_[x], [x]_sub_[x], [x]_mul_[x], [x]_div_[x] still work
Forward of [x]_dot_[x]
Backward of [x]_dot_[x]

Checklist

Please feel free to remove inapplicable items for your PR.

The PR title starts with [$CATEGORY] (such as [NN], [Model], [Doc], [Feature]])
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage
Code is well-documented
To the my best knowledge, examples are either not affected by this change,
or have been fixed to be compatible with this change
Related issue is referred in this PR

Changes

Add noting that the PinSage model example under example/pytorch/recommendation only work with Python 3.6+ as its dataset loader depends on stanfordnlp package which work only with Python 3.6+.

…ide. 1. make dgl.nn.xxx frame agnostic 2. make test.backend include dgl.nn modules 3. modify test_edge_softmax of test/mxnet/test_nn.py and test/pytorch/test_nn.py work on both CPU and GPU

1. clear all agnostic related code in dgl.nn 2. make test_graph_conv agnostic to cpu/gpu

work on both CPU and GPU.

Add base control flow code.

…asked-mm

TODO: 1. make sure x_add_x, x_sub_x, x_mul_x, x_div_x work 2. let x_dot_x work 3. make sure backward of x_add_x, x_sub_x, x_mul_x, x_div_x work 4. let x_dot_x backward work

VoVAllen · 2019-09-04T15:45:18Z

MXNet CI test may have some problem due to adapting to new version. I'll try to fix this tomorrow.

…asked-mm

…forward and backward

python/dgl/function/message.py

src/kernel/binary_reduce_common.h

src/kernel/cpu/binary_reduce_impl.h

yzh119 · 2019-09-05T09:11:26Z

@jermainewang , I've verified the correctness with STT. The GPU memory footprint is about the same but builtin function is 2x slower than my custom kernels(node parallel strategy).

Backward is still slow for dot

yzh119

I'm ok with this PR.

python/dgl/kernel.py

jermainewang · 2019-09-12T16:24:10Z

src/kernel/binary_reduce_common.h

+  static DGLDEVICE DGLINLINE DType Call(DType *lhs, DType *rhs, int64_t len) {
+    DType out = 0;
+    // simple vector dot vector
+#pragma unroll


There is already a pragma unroll in the graph level in minigun. Nested pragma unroll usually does not give benefit. Consider remove this.

jermainewang

LGTM. Thanks!

yzh119 and others added 30 commits August 6, 2019 16:13

upd

6219fe6

fig edgebatch edges

66541cc

add test

8258153

Merge remote-tracking branch 'upstream/master' into fix

21404bb

trigger

fb62f10

Merge remote-tracking branch 'upstream/master'

ba138a3

Merge remote-tracking branch 'upstream/master'

d725786

Update README.md for pytorch PinSage example.

e291d6d

Add noting that the PinSage model example under example/pytorch/recommendation only work with Python 3.6+ as its dataset loader depends on stanfordnlp package which work only with Python 3.6+.

Merge branch 'master' of https://github.com/classicsong/dgl

eab01bb

Provid a frame agnostic API to test nn modules on both CPU and CUDA s…

5fdc289

…ide. 1. make dgl.nn.xxx frame agnostic 2. make test.backend include dgl.nn modules 3. modify test_edge_softmax of test/mxnet/test_nn.py and test/pytorch/test_nn.py work on both CPU and GPU

Fix style

2e89c6f

Delete unused code

b1af382

Make agnostic test only related to tests/backend

85630e3

1. clear all agnostic related code in dgl.nn 2. make test_graph_conv agnostic to cpu/gpu

Fix code style

874352f

Merge remote-tracking branch 'upstream/master'

b918c9b

fix

47e468a

Merge remote-tracking branch 'zihao/fix-nn'

0769269

doc

10e1d27

Merge remote-tracking branch 'zihao/fix-nn'

e1b4864

Make all test code under tests.mxnet/pytorch.test_nn.py

4bfe71c

work on both CPU and GPU.

Fix syntex

a91b1bb

Merge branch 'master' into master

475c0c3

Remove rand

edf6a0e

Merge branch 'master' of https://github.com/classicsong/dgl

ac8f6e4

Start implementing masked-mm kernel.

b86e8db

Add base control flow code.

Add masked dot declare

a0a55ec

Update func/variable name

f27fddc

Merge branch 'masked-mm' of https://github.com/classicsong/dgl into m…

d12d565

…asked-mm

Skeleton compile OK

d882599

Update Implement. Unify BinaryDot with BinaryReduce

190102b

classicsong and others added 2 commits September 4, 2019 21:35

New Impl of x_dot_x, reuse binary reduce template

4fdb8fe

Compile OK.

faa3b2d

TODO: 1. make sure x_add_x, x_sub_x, x_mul_x, x_div_x work 2. let x_dot_x work 3. make sure backward of x_add_x, x_sub_x, x_mul_x, x_div_x work 4. let x_dot_x backward work

classicsong and others added 7 commits September 5, 2019 10:00

Merge branch 'master' into masked-mm

edd378c

Fix code style

f9a0676

Merge branch 'masked-mm' of https://github.com/classicsong/dgl into m…

2bd6097

…asked-mm

Now we can pass the tests/compute/test_kernel.py for add/sub/mul/div …

9c00adc

…forward and backward

Fix mxnet test code

037b142

Add u_dot_v, u_dot_e, v_dot_e unitest.

c59de93

Update doc

bed5bcd

yzh119 reviewed Sep 5, 2019

View reviewed changes

python/dgl/function/message.py Show resolved Hide resolved

src/kernel/binary_reduce_common.h Show resolved Hide resolved

src/kernel/cpu/binary_reduce_impl.h Show resolved Hide resolved

Ubuntu and others added 3 commits September 5, 2019 14:13

Now also support v_dot_u, e_dot_u, e_dot_v

056cbf5

Add unroll for some loop

9e94b31

Merge branch 'master' into masked-mm

bdd3ff1

classicsong changed the title ~~[WIP][Feature] x_dot_x builtin kernel support~~ [Feature] x_dot_x builtin kernel support Sep 6, 2019

Ubuntu and others added 5 commits September 6, 2019 08:48

Add some Opt for cuda backward of dot builtin.

89e61dd

Backward is still slow for dot

Merge branch 'masked-mm' of github.com:classicsong/dgl into masked-mm

fea7cbb

Merge branch 'master' into masked-mm

73c5642

Merge branch 'master' into masked-mm

a78edfb

Apply UnravelRavel opt for broadcast backward

c96338a

yzh119 approved these changes Sep 11, 2019

View reviewed changes

python/dgl/kernel.py Show resolved Hide resolved

classicsong and others added 3 commits September 11, 2019 23:02

update docstring

cb7d1ac

Merge branch 'master' into masked-mm

63563fe

Merge branch 'master' into masked-mm

b065a4e

jermainewang reviewed Sep 12, 2019

View reviewed changes

jermainewang approved these changes Sep 12, 2019

View reviewed changes

yzh119 merged commit 0a56d65 into dmlc:master Sep 14, 2019

classicsong deleted the masked-mm branch November 25, 2019 07:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] x_dot_x builtin kernel support #831

[Feature] x_dot_x builtin kernel support #831

classicsong commented Sep 4, 2019 •

edited

Loading

VoVAllen commented Sep 4, 2019

yzh119 commented Sep 5, 2019

yzh119 left a comment

jermainewang Sep 12, 2019

jermainewang left a comment

[Feature] x_dot_x builtin kernel support #831

[Feature] x_dot_x builtin kernel support #831

Conversation

classicsong commented Sep 4, 2019 • edited Loading

Description

Checklist

Changes

VoVAllen commented Sep 4, 2019

yzh119 commented Sep 5, 2019

yzh119 left a comment

Choose a reason for hiding this comment

jermainewang Sep 12, 2019

Choose a reason for hiding this comment

jermainewang left a comment

Choose a reason for hiding this comment

classicsong commented Sep 4, 2019 •

edited

Loading