International Journal of Intelligent Systems

Volume 37, Issue 10 pp. 7502-7525

RESEARCH ARTICLE

Measuring and sampling: A metric-guided subgraph learning framework for graph neural network

Jiyang Bai,

Jiyang Bai

orcid.org/0000-0002-0621-8815

Department of Computer Science, Florida State University, Tallahassee, Florida, USA

Search for more papers by this author

Yuxiang Ren,

Corresponding Author

Yuxiang Ren

[email protected]

IFM Lab, Department of Computer Science, Florida State University, Tallahassee, Florida, USA

Correspondence Yuxiang Ren, Department of Computer Science, Florida State University, Tallahassee, FL 32306, USA.

Email: [email protected]

Search for more papers by this author

Jiawei Zhang,

Jiawei Zhang

IFM Lab, Department of Computer Science, University of California, Davis, Davis, California, USA

Search for more papers by this author

Jiyang Bai,

Jiyang Bai

orcid.org/0000-0002-0621-8815

Department of Computer Science, Florida State University, Tallahassee, Florida, USA

Search for more papers by this author

Yuxiang Ren,

Corresponding Author

Yuxiang Ren

[email protected]

IFM Lab, Department of Computer Science, Florida State University, Tallahassee, Florida, USA

Correspondence Yuxiang Ren, Department of Computer Science, Florida State University, Tallahassee, FL 32306, USA.

Email: [email protected]

Search for more papers by this author

Jiawei Zhang,

Jiawei Zhang

IFM Lab, Department of Computer Science, University of California, Davis, Davis, California, USA

Search for more papers by this author

First published: 28 April 2022

https://doi.org/10.1002/int.22891

Citations: 2

Jiyang Bai and Yuxiang Ren should be considered joint first author.

Share a link

Email
Wechat
Bluesky

Abstract

Graph neural networks (GNNs) have shown convincing performance in learning powerful node representations that preserve both node attributes and graph structural information. However, many GNNs encounter problems in effectiveness and efficiency when they are designed with a deeper network structure or handle large-sized graphs. Several sampling algorithms have been proposed for improving and accelerating the training of GNNs, yet they ignore understanding the source of GNNs performance gain. The measurement of information within graph data can help the sampling algorithms to keep high-value information while removing redundant information and even noise. In this paper, we propose a Metric-Guided (MeGuide) subgraph learning framework for GNNs. MeGuide employs two novel metrics: Feature Smoothness and Connection Failure Distance to guide the subgraph sampling and mini-batch based training. Feature Smoothness is designed for analyzing the feature of nodes to retain the most valuable information, while Connection Failure Distance can measure the structural information to control the size of subgraphs. We demonstrate the effectiveness and efficiency of MeGuide in training various GNNs on multiple data sets.

Open Research

DATA AVAILABILITY STATEMENT

The data that support the findings of this study are openly available at https://github.com/tkipf/gcn/tree/master/gcn/data, and https://github.com/GraphSAINT/GraphSAINT, reference number.^{14, 19, 43}

REFERENCES

1Zhang J. Social network fusion and mining: a survey. arXiv preprint arXiv:180409874. 2018.
Google Scholar
2Ren Y, Zhang J. HGAT: hierarchical graph attention network for fake news detection. arXiv. 2020;p. arXiv–2002.
Google Scholar
3Wang Q, Mao Z, Wang B, Guo L. Knowledge graph embedding: a survey of approaches and applications. IEEE Trans Knowl Data Eng. 2017; 29(12): 2724-2743.
10.1109/TKDE.2017.2754499
Web of Science® Google Scholar
4Bai J, Ren Y, Zhang J. Ripple walk training: a subgraph-based training framework for large and deep graph neural network. IJCNN; 2021.
Google Scholar
5Xu K, Hu W, Leskovec J, Jegelka S. How powerful are graph neural networks? ICLR; 2019.
Google Scholar
6Chiang WL, Liu X, Si S, Li Y, Bengio S, Hsieh CJ. Cluster-GCN: an efficient algorithm for training deep and large graph convolutional networks. KDD; 2019.
Google Scholar
7Zhao L, Akoglu L. PairNorm: tackling oversmoothing in GNNs. arXiv:190912223. 2019.
Google Scholar
8Li Q, Han Z, Wu XM. Deeper insights into graph convolutional networks for semi-supervised learning. AAAI; 2018.
Google Scholar
9Rong Y, Huang W, Xu T, Huang J. DropEdge: towards the very deep graph convolutional networks for node classification. arXiv:1907.10903; 2019.
Google Scholar
10Hamilton W, Ying Z, Leskovec J. Inductive representation learning on large graphs. NIPS; 2017.
Google Scholar
11Chen J, Ma T, Xiao C. Fastgcn: fast learning with graph convolutional networks via importance sampling. arXiv:180110247. 2018.
Google Scholar
12Chen J, Zhu J, Song L. Stochastic training of graph convolutional networks with variance reduction. ICML; 2018.
Google Scholar
13Zou D, Hu Z, Wang Y, Jiang S, Sun Y, Gu Q. Layer-dependent importance sampling for training deep and large graph convolutional networks. NeurIPS; 2019.
Google Scholar
14Zeng H, Zhou H, Srivastava A, Kannan R, Prasanna V. Graphsaint: graph sampling based inductive learning method. arXiv preprint arXiv:190704931. 2019.
Google Scholar
15Hou Y, Zhang J, Cheng J, et al. Measuring and improving the use of graph information in graph neural networks. International Conference on Learning Representations; 2019.
Google Scholar
16Bruna J, Zaremba W, Szlam A, LeCun Y. Spectral networks and locally connected networks on graphs. arXiv:13126203. 2013.
Google Scholar
17Veličković P, Cucurull G, Casanova A, Romero A, Liò P, Bengio Y. Graph attention networks. ICLR; 2018.
Google Scholar
18Monti F, Boscaini D, Masci J, Rodola E, Svoboda J, Bronstein MM. Geometric deep learning on graphs and manifolds using mixture model cnns. CVPR; 2017.
Google Scholar
19Kipf TN, Welling M. Semi-supervised classification with graph convolutional networks. ICLR; 2017.
Google Scholar
20Xinyi Z, Chen L. Capsule graph neural network. International Conference on Learning Representations; 2018.
Google Scholar
21Sun FY, Hoffmann J, Verma V, Tang J. Infograph: unsupervised and semi-supervised graph-level representation learning via mutual information maximization. arXiv preprint arXiv:190801000. 2019.
Google Scholar
22Ying Z, You J, Morris C, Ren X, Hamilton W, Leskovec J. Hierarchical graph representation learning with differentiable pooling. Proceedings on Advances in Neural Information Processing Systems; 2018: 4800-4810.
Google Scholar
23Ren Y, Bai J, Zhang J. Label contrastive coding based graph neural network for graph classification. Database Systems for Advanced Applications. Springer International Publishing; 2021: 123-140.
10.1007/978-3-030-73194-6_10
Google Scholar
24Zhou J, Cui G, Hu S, et al. Graph neural networks: a review of methods and applications. AI Open. 2020; 1: 57-81.
10.1016/j.aiopen.2021.01.001
Web of Science® Google Scholar
25Wu Z, Pan S, Chen F, Long G, Zhang C, Philip SY. A comprehensive survey on graph neural networks. IEEE Trans Neural Network Learn Syst. 2020; 32(1): 4-24.
10.1109/TNNLS.2020.2978386
Web of Science® Google Scholar
26Defferrard M, Bresson X, Vandergheynst P. Convolutional neural networks on graphs with fast localized spectral filtering. NIPS; 2016.
Google Scholar
27Levie R, Monti F, Bresson X, Bronstein MM. Cayleynets: graph convolutional neural networks with complex rational spectral filters. IEEE Transactions on Signal Processing. 2018.
Google Scholar
28Liao R, Zhao Z, Urtasun R, Zemel RS. Lanczosnet: multi-scale deep graph convolutional networks. arXiv:190101484. 2019.
Google Scholar
29Henaff M, Bruna J, LeCun Y. Deep convolutional networks on graph-structured data. arXiv:150605163. 2015.
Google Scholar
30Li R, Wang S, Zhu F, Huang J. Adaptive graph convolutional neural networks. AAAI; 2018.
Google Scholar
31Ying R, He R, Chen K, Eksombatchai P, Hamilton WL, Leskovec J. Graph convolutional neural networks for web-scale recommender systems. KDD; 2018.
Google Scholar
32Gao H, Wang Z, Ji S. Large-scale learnable graph convolutional networks. KDD; 2018.
Google Scholar
33Xu K, Li C, Tian Y, Sonobe T, Kawarabayashi Ki, Jegelka S. Representation learning on graphs with jumping knowledge networks. ICML; 2018.
Google Scholar
34Lee JB, Rossi RA, Kong X, Kim S, Koh E, Rao A. Graph convolutional networks with motif-based attention. CIKM; 2019.
Google Scholar
35Klicpera J, Bojchevski A, Günnemann S. Predict then propagate: graph neural networks meet personalized PageRank. 2019.
Google Scholar
36Haonan L, Huang SH, Ye T, Xiuyan G. Graph star net for generalized multi-task learning. arXiv:190612330. 2019.
Google Scholar
37Abu-El-Haija S, Perozzi B, Kapoor A. Mixhop: higher-order graph convolution architectures via sparsified neighborhood mixing. arXiv:190500067. 2019.
Google Scholar
38Chen M, Wei Z, Ding B, et al. Scalable graph neural networks via bidirectional propagation. arXiv preprint arXiv:201015421. 2020.
Google Scholar
39Zhang J, Meng L. GResNet: graph residual network for reviving deep GNNs from suspended animation. arXiv:1909.05729; 2019.
Google Scholar
40Kullback S, Leibler RA. On information and sufficiency. Annal Math Stat. 1951; 22(1): 79-86.
10.1214/aoms/1177729694
Web of Science® Google Scholar
41Zhang J, Meng L. GResNet: graph residual network for reviving deep GNNs from suspended animation. arXiv preprint arXiv:190905729. 2019.
Google Scholar
42Huang W, Rong Y, Xu T, Sun F, Huang J. Tackling over-smoothing for general graph convolutional networks. arXiv preprint arXiv:200809864. 2020.
Google Scholar
43Sen P, Namata G, Bilgic M, Getoor L, Galligher B, Eliassi-Rad T. Collective classification in network data. AI Magazine. 2008.
Google Scholar
44McAuley J, Leskovec J. Image labeling on a network: using social-network metadata for image classification. European Conference on Computer Vision. Springer; 2012: 828-841.
10.1007/978-3-642-33765-9_59
Google Scholar
45Kingma DP, Ba JL. ADAM: a method for stochastic optimization. ICLR; 2015.
Google Scholar

Citing Literature

All articles

Measuring and sampling: A metric-guided subgraph learning framework for graph neural network

Abstract

Open Research

DATA AVAILABILITY STATEMENT

REFERENCES

Citing Literature

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

Measuring and sampling: A metric-guided subgraph learning framework for graph neural network

Abstract

Open Research

DATA AVAILABILITY STATEMENT

REFERENCES

Citing Literature

References

Related

Information