[example] Create EEG-GCNN example. #3186

JOHNW02 · 2021-07-26T03:25:04Z

Description

This is an EEG-GCNN example using DGL library. The original code using pytorch_geometric can be found here.

Checklist

Please feel free to remove inapplicable items for your PR.

The PR title starts with [$CATEGORY] (such as [NN], [Model], [Doc], [Feature]])
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage
Code is well-documented
To the my best knowledge, examples are either not affected by this change,
or have been fixed to be compatible with this change
Related issue is referred in this PR
If the PR is for a new model/paper, I've updated the example index here.

Changes

dgl-bot · 2021-07-26T03:26:11Z

To trigger regression tests:

@dgl-bot run [instance-type] [which tests] [compare-with-branch];
For example: @dgl-bot run g4dn.4xlarge all dmlc/master or @dgl-bot run c5.9xlarge kernel,api dmlc/master

examples/pytorch/eeg-gcnn/.gitignore

examples/pytorch/eeg-gcnn/README.md

mufeili · 2021-07-27T05:40:19Z

examples/pytorch/eeg-gcnn/README.md

+This DGL example implements the EEG-GCNN model proposed in the paper [EEG-GCNN](https://arxiv.org/abs/2011.12107). The original code is [here](https://github.com/neerajwagh/eeg-gcnn).
+
+## All References
+- Paper can also be found on [PMLR](http://proceedings.mlr.press/v136/wagh20a.html).


Is this one the same as the ArXiv version? If so, we can probably remove it and just mention it in the citation entry at the end.

Hi Mufei,

This is a simplified version. The original code has precomputed patient indices. In this example, I removed the dependency on the indices.

I see, thanks. Then perhaps you can mention explicitly that this is a simplified version.

examples/pytorch/eeg-gcnn/README.md

mufeili · 2021-07-27T05:59:43Z

examples/pytorch/eeg-gcnn/README.md

+First, download the precomputed data, labels, indices and put them in the repo. <br>
+Then run 
+```python
+python main.py --num_feats --num_nodes --gpu_idx --num_epochs --exp_name --batch_size


There's no need to add --num_feats --num_nodes --gpu_idx --num_epochs --exp_name --batch_size as you have a default value for them.

mufeili · 2021-07-27T06:01:29Z

examples/pytorch/eeg-gcnn/main.py

+    num_feats = args.num_feats
+
+    # set up input and targets from files
+    memmap_x = f'norm_X'


I checked the preprocessed dataset here, which does not seem to have the file norm_X.

We haven't updated Figshare yet. We were dealing with a bug in the original code. So, some files need to be updated. We will do that next month, including norm_X. Also, norm_X is the normalized version of psd_featureds_data_X.

In that case, we will need to wait for the files to be updated before we merge the PR.

mufeili · 2021-07-27T06:06:39Z

Can you add an entry for this example in the indexing page here?

mufeili · 2021-07-27T06:11:10Z

examples/pytorch/eeg-gcnn/README.md

+|      DGL          | AUC         | Precision     | Recall       | F-1          | Bal. Accuracy |
+|-------------------|-------------|---------------|--------------|--------------|---------------|
+| Shallow EEG-GCNN  | 0.875(0.036)| 0.980(0.013)  | 0.735(0.055) | 0.839(0.035) | 0.811(0.035)  |
+| Deep EEG-GCNN     | 0.890(0.004)| 0.988(0.004)  | 0.723(0.035) | 0.834(0.022) | 0.829(0.005)  |


How can a user reproduce the numbers here using python main.py?

I don't think they can. We are planning to update the original repo with both pytorch_geometric and dgl implementations, and release the trainning code as well. This should be done sometime next month too. For this example, can I just point people to the original repo for reproducing the stats?

We expect a DGL example to be self-contained and able to reproduce the results reported in README. In that case, either we can wait till you get everything ready for this PR or we can simply close the PR and add a pointer to your own repo once you get everything ready there.

I see. Let me talk to Neeraj about this.

Hi @mufeili, maybe the following might clarify the design of the PR:

We built the PR with the following assumption: that people can go to https://github.com/neerajwagh/eeg-gcnn for reproducing published results on the original data while the DGL example here would enable others to bring their own datasets (X, y) and train our model using the data they care about. Compared to our original PyT implementation, here we have skipped: a) subject-level data splitting, b) subject-level aggregate evaluation, c) cross-validation. This makes the example a "simplified" version and therefore will not be able to exactly reproduce published results. Nevertheless, we can update main.py and the results table so that main.py can reproduce the new "simplified" table made from a simpler model evaluation scheme (as long as users pull X, y from FigShare). We can close the PR after that change while keeping a pointer to the PyT repo for the full/"complex" implementation. We'll mention this simplification in the README so users are aware of the difference.

We can fix the absence of norm_X in FigShare by adding normalization after X is read in the code. (instead of updating the uploaded FigShare files).

Let me know if that works for you.

Meanwhile, the updates we make to our original PyT repo will remain independent of this example and this example will remain self-contained.

Hi @neerajwagh, the proposal sounds good to me in general. To be more specific, this example does not need to fully reproduce the experiments reported in your paper and I'm good as long as a user can run the script following the instructions in the README file and get similar numbers reported there.

@mufeili Sounds good! We'll make changes accordingly then. Thanks.

mufeili · 2021-07-27T06:14:40Z

examples/pytorch/eeg-gcnn/README.md

+
+### Contact
+
+- Issues regarding non-reproducibility of results or support with the codebase should be emailed to _wei33@illinois.edu_


How about

emailed to John(wei33@illinois.edu)

You may also contact the authors:

Neeraj: nwagh2@illinois.edu / Website / Twitter / Google Scholar

Yoga: varatha2@illinois.edu / Website / Google Scholar

…rking

examples/pytorch/eeg-gcnn/README.md

examples/pytorch/eeg-gcnn/main.py

examples/pytorch/eeg-gcnn/shallow_EEGGraphConvNet.py

examples/pytorch/eeg-gcnn/deep_EEGGraphConvNet.py

examples/pytorch/eeg-gcnn/EEGGraphDataset.py

examples/pytorch/eeg-gcnn/main.py

examples/pytorch/eeg-gcnn/README.md

mufeili · 2021-08-10T08:45:34Z

Thank you for the great job! This PR is ready to merge once we hear back from Neeraj and resolve the last conversation.

mufeili

LGTM

* Create EEG-GCNN example. * Update README.md * Remove gitignore file. * Update README.md * change 'datas' to 'datasets'. * Change train.py to main.py * Added an entry in the indexing page. * State "simplified version"; change how to run. * Fix bug in contact * Remove paper link in reference. * Create working branch * Add normalization of x. * Update paper link and tags * Update paper link in readme * Update readme; add patient level indices * Update readme. Add comments to models * Update README.md * change to with; specify location for ch and el; move note * fix bug for note * Add args for models; clean code. * delete = in readme * Add reference for spec_coh_values

Create EEG-GCNN example.

991e027

mufeili self-requested a review July 27, 2021 02:34

mufeili reviewed Jul 27, 2021

View reviewed changes

examples/pytorch/eeg-gcnn/.gitignore Outdated Show resolved Hide resolved

mufeili reviewed Jul 27, 2021

View reviewed changes

examples/pytorch/eeg-gcnn/README.md Outdated Show resolved Hide resolved

mufeili reviewed Jul 27, 2021

View reviewed changes

examples/pytorch/eeg-gcnn/README.md Outdated Show resolved Hide resolved

mufeili reviewed Jul 27, 2021

View reviewed changes

examples/pytorch/eeg-gcnn/README.md Outdated Show resolved Hide resolved

JOHNW02 and others added 4 commits July 27, 2021 13:08

Update README.md

d4ebee3

Remove gitignore file.

b90b537

Update README.md

41915c9

change 'datas' to 'datasets'.

e1d0dd1

mufeili reviewed Jul 27, 2021

View reviewed changes

examples/pytorch/eeg-gcnn/README.md Outdated Show resolved Hide resolved

Change train.py to main.py

904d50f

mufeili reviewed Jul 27, 2021

View reviewed changes

JOHNW02 and others added 11 commits July 27, 2021 16:33

Added an entry in the indexing page.

a1d2d4d

State "simplified version"; change how to run.

a0fe820

Fix bug in contact

f2a6e92

Remove paper link in reference.

320194b

Create working branch

b76cc94

Add normalization of x.

6a28faa

Update paper link and tags

8f19917

Update paper link in readme

4480c89

Update readme; add patient level indices

a5a6e8c

Merge branch 'working' of https://hub.fastgit.org/JOHNW02/dgl into wo…

40f1fe4

…rking

Merge branch 'dmlc:master' into master

8127cf9