[performance] FeatGraph TVM kernels support #2136

kira-lin · 2020-09-01T07:16:25Z

Description

This branch merges FeatGraph TVM kernels, and implement necessary functions to be able to use them. When it is completed, it should be able to compile optimal kernels, perform partitioning if necessary, given a specific workload just-in-time though TVM.

Checklist

Add tests for graph partitioning(for FeatGraph), and different kernels.
Add documentation.
Implement parameter searching, i.e. finding optimal number of partitions automatically.
Fix cache for partitioned graph.
Add switch between TVM kernels and traditional ones.

* change edge_ids behavior and C++ impl * fix unittests; remove utils.Index in edge_id * pass mx and th tests * pass tf test * add aten::Scatter_ * Add nonzero; impl CSRGetDataAndIndices/CSRSliceMatrix * CSRGetData and CSRGetDataAndIndices passed tests * CSRSliceMatrix basic tests * fix bug in empty slice * CUDA CSRHasDuplicate * has_node; has_edge_between * predecessors, successors * deprecate send/recv; fix send_and_recv * deprecate send/recv; fix send_and_recv * in_edges; out_edges; all_edges; apply_edges * in deg/out deg * subgraph/edge_subgraph * adj * in_subgraph/out_subgraph * sample neighbors * set/get_n/e_repr * wip: working on refactoring all idtypes * pass ndata/edata tests on gpu * fix * stash * workaround nonzero issue * stash * nx conversion * test_hetero_basics except update routines * test_update_routines * test_hetero_basics for pytorch * more fixes * WIP: flatten graph * wip: flatten * test_flatten * test_to_device * fix bug in to_homo * fix bug in CSRSliceMatrix * pass subgraph test * fix send_and_recv * fix filter * test_heterograph * passed all pytorch tests * fix mx unittest * fix pytorch test_nn * fix all unittests for PyTorch * passed all mxnet tests * lint * fix tf nn test * pass all tf tests * lint * lint * change deprecation * try fix compile * lint * update METIDS * fix utest * fix * fix utests * try debug * revert * small fix * fix utests

* upd * upd * upd * fix * upd * upd * upd * upd * upd * trigger * +1s

* mutation add_nodes and add_edges * Add support for remove_edges, remove_nodes, add_selfloop, remove_selfloop * Fix Co-authored-by: Ubuntu <[email protected]>

…raph_refactor

* add nodesy * All three * Fix * lint * Add some test case * Fix * Fix * Fix * Fix * Fix * Fix * fix * triger * Fix * fix Co-authored-by: Ubuntu <[email protected]>

update sparse, add cache for partitioned graphs, pass tests

yzh119

Congratulations, and thanks for your contribution to this PR.
I've left some comments, please address these concerns.

I suggest preserving old kernel code instead of deleting them. When users want to use tvm kernels, the gspmm call would be dispatched to gspmm_featgraph, likewise for gsddmm.

examples/pytorch/gcmc/model.py

python/dgl/sparse.py

yzh119 · 2020-09-01T09:22:38Z

python/dgl/sparse.py

+    # pass edge_mapping to tvm only when array packing will be used
+    use_idx = edge_shuffled and num_feat_partitions > 1 and not use_bcast and use_e
+    f_input = [indptr, indices]
+    key = (num_rows, num_cols, nnz, op, reduce_op, u_shp, e_shp, use_idx, \


Used for indexing, should find a more elegant way.

yzh119 · 2020-09-01T09:26:48Z

python/dgl/sparse.py

+        # num_row_partitions, num_col_partitions = 2, 2
+        # num_feat_partitions = 2
+    if target == 'cuda' and (num_row_partitions > 1 or num_col_partitions > 1 or num_feat_partitions > 1):
+        print('Partitioning not supported on GPU')


Replace this with logging.

python/dgl/sparse.py

python/dgl/tvm/gsddmm.py

yzh119 · 2020-09-01T15:38:07Z

python/dgl/tvm/gspmm.py

+    else:
+        raise NotImplementedError
+    # check if used for sampling
+    generic_shape = nnz == 0 and num_rows == 0 and num_cols == 0


If user would like to deal with generic shape, nnz, num_rows and num_cols need to be set to all zeros?

Right. We can use another flag, if you prefer.

remove unrelated changes

fix some type bugs. eliminate reshape operator

dimensions of feature tensor have same type minor fixes

yzh119

LGTM, @jermainewang I think this PR is ready to be merged.
Let's finish the remaining stuff in future PRs:

support sampling.
support fp16.
support GE-SpMM.

jermainewang · 2020-09-14T06:31:13Z

Things todo before merging this PR.

Move all the related codes to dgl/experimental/.
Write an one to two pager as a mini user guide on how to use this feature and what are the expected results. You can put it under a new "Experimental Features" section on docs.dgl.ai . @yzh119
Benchmark the current implementation against GESpMM and merge-based algorithm. @kira-lin
Code review by @jermainewang

yzh119 · 2020-12-31T06:11:43Z

After discussion we decide to use AOT solution (see #2367).

Ubuntu and others added 30 commits July 13, 2020 06:45

pass NDArray to python for use of tvm kernel

177f293

Merge

ec016e4

Merge remote-tracking branch 'upstream/master' into tvm-kernel

68b7b1d

Merge branch 'master' of github.com:dmlc/dgl into graph_refactor

b7ef313

Merge remote-tracking branch 'upstream/master' into tvm-kernel

c99ba0f

upd

04dc998

upd

37a6807

integrated tvm featgraph kernel

fdd06fc

upd

4f8732f

fix

6c99ff9

upd

9d61466

upd

5426e4d

upd

d06170a

upd

680d221

upd

414545f

trigger

251b36e

+1s

0a877b6

[kernel] Use heterograph index instead of unitgraph index (dmlc#1813)

40855ba

* upd * upd * upd * fix * upd * upd * upd * upd * upd * trigger * +1s

[Graph] Mutation for Heterograph (dmlc#1818)

6f29ce2

* mutation add_nodes and add_edges * Add support for remove_edges, remove_nodes, add_selfloop, remove_selfloop * Fix Co-authored-by: Ubuntu <[email protected]>

support max, min, argu/arge

4077795

upd

3645ace

Merge commit '3645ace0' into graph_refactor

86c02fc

Merge branch 'graph_refactor' of https://github.com/yzh119/dgl into g…

735d067

…raph_refactor

upd

4248073

upd

a84a7d4

upd

1dbf282

fix

2ef52a8

[Transfom] Mutable transform (dmlc#1833)

8ffbbf6

* add nodesy * All three * Fix * lint * Add some test case * Fix * Fix * Fix * Fix * Fix * Fix * fix * triger * Fix * fix Co-authored-by: Ubuntu <[email protected]>

Merge branch 'master' of https://github.com/dmlc/dgl into tvm-kernel

0c02ed6

Zhi Lin added 4 commits August 27, 2020 07:14

introduce featgraph

808b970

partition change parameters

b615190

update sparse, add cache for partitioned graphs, pass tests

minor fix

c6aa1c3

Merge remote-tracking branch 'upstream/master' into tvm-kernel

2190e6d

yzh119 requested review from yzh119 and jermainewang and removed request for yzh119 September 1, 2020 07:17

remove

c69ece7

yzh119 requested changes Sep 1, 2020

View reviewed changes

Zhi Lin and others added 8 commits September 2, 2020 09:04

fix gspmm

d103680

remove unrelated changes

add support for old kernels back, switch not yet

d9bc465

fix some type bugs. eliminate reshape operator

fix edge_mapping

970e244

change to platform agnostic gather_row

6dfbbc9

direct access if edge has mapping, otherwise too slow

ec0629f

dimensions of feature tensor have same type minor fixes

lint

e535d31

Merge remote-tracking branch 'origin/master' into tvm-kernel

57a2d5c

upd

fd1af56

yzh119 approved these changes Sep 11, 2020

View reviewed changes

Merge branch 'master' into tvm-kernel

2bd6539

yzh119 changed the title ~~[WIP] FeatGraph TVM kernels support~~ [performance] FeatGraph TVM kernels support Sep 11, 2020

fix spmm scheduling

d75abf7

yzh119 mentioned this pull request Sep 14, 2020

How does SpMM kernel work? #2189

Closed

docstring

7ce287a

yzh119 mentioned this pull request Sep 22, 2020

Optimized DGL Kernels? #2222

Closed

yzh119 mentioned this pull request Nov 11, 2020

Ready to be open sourced dglai/FeatGraph#4

Merged

yzh119 closed this Dec 31, 2020

yzh119 mentioned this pull request Jan 5, 2021

[WIP][TVM] Integrate featgraph kernels #2489

Closed

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[performance] FeatGraph TVM kernels support #2136

[performance] FeatGraph TVM kernels support #2136

kira-lin commented Sep 1, 2020 •

edited

Loading

yzh119 left a comment

yzh119 Sep 1, 2020

yzh119 Sep 1, 2020

yzh119 Sep 1, 2020

kira-lin Sep 2, 2020

yzh119 left a comment

jermainewang commented Sep 14, 2020

yzh119 commented Dec 31, 2020

[performance] FeatGraph TVM kernels support #2136

[performance] FeatGraph TVM kernels support #2136

Conversation

kira-lin commented Sep 1, 2020 • edited Loading

Description

Checklist

yzh119 left a comment

Choose a reason for hiding this comment

yzh119 Sep 1, 2020

Choose a reason for hiding this comment

yzh119 Sep 1, 2020

Choose a reason for hiding this comment

yzh119 Sep 1, 2020

Choose a reason for hiding this comment

kira-lin Sep 2, 2020

Choose a reason for hiding this comment

yzh119 left a comment

Choose a reason for hiding this comment

jermainewang commented Sep 14, 2020

yzh119 commented Dec 31, 2020

kira-lin commented Sep 1, 2020 •

edited

Loading