[Performance][Feature] Implement edge excluding in EdgeDataLoader on GPU #3226

nv-dlasalle · 2021-08-06T19:46:40Z

Description

When the output device for the dataloader is a GPU, this enable making use of the cuda hash table implementation to find the set of edges to remove.

In my testing using /examples/pytorch/graphsage/train_sampling_unsupervised.py, this reduced the time to find the edges from 148ms using numpy's isin() method, to 2.0ms to copy the edge ids to the GPU and check them against the excluded edges (74x speedup). The majority of this is performing the CPU to GPU copy. In terms of just checking the edges, it takes just 200us.

Although I believe outside of the scope of this PR, the CPU side of things would benefit from a similar strategy.

Checklist

Please feel free to remove inapplicable items for your PR.

The PR title starts with [$CATEGORY] (such as [NN], [Model], [Doc], [Feature]])
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage
Code is well-documented
To the best of my knowledge, examples are either not affected by this change,
or have been fixed to be compatible with this change

Changes

Add Filter class to store the hashmap of edges to be excluded. This speeds up the process when there are multiple layers, as opposed to generating the same hashmap for each layer.

This also adds a unit test.

dgl-bot · 2021-08-06T19:47:40Z

To trigger regression tests:

@dgl-bot run [instance-type] [which tests] [compare-with-branch];
For example: @dgl-bot run g4dn.4xlarge all dmlc/master or @dgl-bot run c5.9xlarge kernel,api dmlc/master

BarclayII · 2021-08-10T01:22:32Z

@zheng-da How do you like it combined with #2971? Ideally I would like to see the entry point of CPU and GPU implementations unified

BarclayII

After some inspection I found the PR largely orthogonal to #2971 since it is just moving the edge ID exclusion postprocessing to C++/CUDA. Although if the latter was merged then we probably want to skip the edge ID exclusion postprocessing.

There's a potential conflict with #2971 though since the latter is moving the edge ID exclusion into sample_frontier.

BarclayII · 2021-08-12T02:34:52Z

python/dgl/utils/filter.py

+
+
+class Filter(object):
+    """Class used to either find either find the subset of ids that are in this


"either find either find". Also we usually capitalize "ids" as "IDs".

Fixed in
fe0bc66.

BarclayII · 2021-08-12T02:35:04Z

python/dgl/utils/filter.py

+
+class Filter(object):
+    """Class used to either find either find the subset of ids that are in this
+    filter, or th subset of ids that are not in this filter,


Fixed in
fe0bc66.

BarclayII · 2021-08-12T02:40:16Z

python/dgl/dataloading/dataloader.py

+        self._exclude_eids = None
+        self._filter = None
+
+        if device == F.cpu():


Maybe add a TODO for CPU? If the filters are implemented for CPU then we won't use _locate_eids_to_exclude right?

Added in
aa21891.

…GPU (#3226) * Update filter code * Add unit tests * Fixes * Switch to indices * Rename functions * Fix linting * Fix whitespace * Add doc * Fix heterograph * Change workspace allocation * Fix linting * Fix docs in filter.py * Add todo Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com> Co-authored-by: Quan (Andy) Gan <coin2028@hotmail.com>

nv-dlasalle added 6 commits August 5, 2021 16:29

Update filter code

03f335f

Add unit tests

7390e10

Fixes

21d39d1

Switch to indices

a7988e9

Rename functions

6348ceb

Fix linting

79ef115

nv-dlasalle requested a review from BarclayII August 6, 2021 19:46

nv-dlasalle added 5 commits August 6, 2021 17:40

Fix whitespace

2f6f0a0

Add doc

a27de65

Fix heterograph

5e4774b

Change workspace allocation

5679045

Fix linting

5717246

Merge branch 'master' into edge_dataloader

074b282

BarclayII approved these changes Aug 12, 2021

View reviewed changes

BarclayII reviewed Aug 12, 2021

View reviewed changes

BarclayII requested a review from zheng-da August 12, 2021 02:40

nv-dlasalle and others added 5 commits August 12, 2021 10:23

Fix docs in filter.py

fe0bc66

Add todo

aa21891

Merge branch 'master' into edge_dataloader

a02e9c9

Merge branch 'master' into edge_dataloader

de6d157

Merge branch 'master' into edge_dataloader

247ebec

zheng-da approved these changes Aug 19, 2021

View reviewed changes

zheng-da merged commit f634950 into dmlc:master Aug 19, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Performance][Feature] Implement edge excluding in EdgeDataLoader on GPU #3226

[Performance][Feature] Implement edge excluding in EdgeDataLoader on GPU #3226

nv-dlasalle commented Aug 6, 2021

dgl-bot commented Aug 6, 2021

BarclayII commented Aug 10, 2021

BarclayII left a comment •

edited

Loading

BarclayII Aug 12, 2021

nv-dlasalle Aug 12, 2021

BarclayII Aug 12, 2021

nv-dlasalle Aug 12, 2021

BarclayII Aug 12, 2021

nv-dlasalle Aug 12, 2021



		class Filter(object):
		"""Class used to either find either find the subset of ids that are in this

[Performance][Feature] Implement edge excluding in EdgeDataLoader on GPU #3226

[Performance][Feature] Implement edge excluding in EdgeDataLoader on GPU #3226

Conversation

nv-dlasalle commented Aug 6, 2021

Description

Checklist

Changes

dgl-bot commented Aug 6, 2021

BarclayII commented Aug 10, 2021

BarclayII left a comment • edited Loading

Choose a reason for hiding this comment

BarclayII Aug 12, 2021

Choose a reason for hiding this comment

nv-dlasalle Aug 12, 2021

Choose a reason for hiding this comment

BarclayII Aug 12, 2021

Choose a reason for hiding this comment

nv-dlasalle Aug 12, 2021

Choose a reason for hiding this comment

BarclayII Aug 12, 2021

Choose a reason for hiding this comment

nv-dlasalle Aug 12, 2021

Choose a reason for hiding this comment

BarclayII left a comment •

edited

Loading