https://github.com/Data-Centric-GraphML/awesome-papers

A collection of papers and resources about Data-centric Graph Machine Learning (DC-GML).
https://github.com/Data-Centric-GraphML/awesome-papers
List: awesome-papers
Last synced: 6 months ago
JSON representation
A collection of papers and resources about Data-centric Graph Machine Learning (DC-GML).
Host: GitHub
URL: https://github.com/Data-Centric-GraphML/awesome-papers
Owner: Data-Centric-GraphML
License: mit
Created: 2023-08-02T13:47:42.000Z (almost 2 years ago)
Default Branch: main
Last Pushed: 2023-11-01T02:25:17.000Z (over 1 year ago)
Last Synced: 2024-05-20T13:02:32.139Z (about 1 year ago)
Homepage:
Size: 13 MB
Stars: 42
Watchers: 2
Forks: 3
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project

ultimate-awesome - awesome-papers - A collection of papers and resources about Data-centric Graph Machine Learning (DC-GML). (Other Lists / Julia Lists)
README

        # Awesome-Data-Centric-GraphML

A collection of papers and resources about Data-centric Graph Machine Learning (DC-GML).

We undertake a comprehensive **review** and provide a promising **outlook** for data-centric graph machine learning (DC-GML), and propose a systematic framework for DC-GML that encompasses all stages of the graph data lifecycle, including graph data collection, exploration, improvement, exploitation, and maintenance. More details can be found in our review & outlook work: https://arxiv.org/abs/2309.10979

```

@article{zheng2023towards,

  title={Towards Data-centric Graph Machine Learning: Review and Outlook},

  author={Zheng, Xin and Liu, Yixin and Bao, Zhifeng and Fang, Meng and Hu, Xia and Liew, Alan Wee-Chung and Pan, Shirui},

  journal={arXiv preprint arXiv:2309.10979},

  year={2023}}

```







## Updates

- **2023.11**

  - Invited to give a tutorial in Australia Database Conference (ADC), 2023. The tutorial slides can be found in the folder.

Table of Contents

=================

* [Awesome-Data-Centric-GraphML](#awesome-data-centric-graphml)

   * [How To Enhance Graph Data Availability and Quality?](#how-to-enhance-graph-data-availability-and-quality)

      * [Graph Structure Enhancement](#graph-structure-enhancement)

         * [Graph Structure Learning](#graph-structure-learning)

         * [Graph Sparsification](#graph-sparsification)

         * [Graph Diffusion](#graph-diffusion)

      * [Graph Feature Enhancement](#graph-feature-enhancement)

         * [Graph Feature Completion](#graph-feature-completion)

         * [Graph Feature Denoising](#graph-feature-denoising)

      * [Graph Label Enhancement](#graph-label-enhancement)

         * [Graph Pseudo-labeling](#graph-pseudo-labeling)

         * [Graph Label Denoising](#graph-label-denoising)

         * [Graph Class-imbalanced Sampling](#graph-class-imbalanced-sampling)

      * [Graph Size Enhancement](#graph-size-enhancement)

         * [Graph Size Reduction](#graph-size-reduction)

         * [Graph Data Augmentation](#graph-data-augmentation)

   * [How To Learn From Graph Data With Limited-availability and Low-quality?](#how-to-learn-from-graph-data-with-limited-availability-and-low-quality)

      * [Graph Self-supervised Learning](#graph-self-supervised-learning)

      * [Graph Semi-supervised Learning](#graph-semi-supervised-learning)

      * [Graph Active Learning](#graph-active-learning)

      * [Graph Transfer Learning](#graph-transfer-learning)

   * [How To Build Graph MLOps System: The Graph Data-centric View.](#how-to-build-graph-mlops-system-the-graph-data-centric-view)

      * [Graph Data Collection](#graph-data-collection)

      * [Graph Data Exploration](#graph-data-exploration)

      * [Graph Data Maintenance](#graph-data-maintenance)

      * [Graph MLOps](#graph-mlops)

## How To Enhance Graph Data Availability and Quality?

The answer to this question corresponds to **'Graph Data Improvement'** stage in DC-GML framework, incorporating four aspects of graph data characteristics, i.e., *Graph Structure Enhancement, Graph Feature Enhancement, Graph Label Enhancement, and Graph Size Enhancement*.

### Graph Structure Enhancement

#### Graph Structure Learning

- [KDD'2020-Pro-GNN] Graph structure learning for robust graph neural networks. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3394486.3403049)

- [ICML'2019-LDS] Learning discrete structures for graph

neural networks. [[paper]](http://proceedings.mlr.press/v97/franceschi19a/franceschi19a.pdf)

- [WWW'2021-GEN] Graph structure estimation neural networks. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3442381.3449952?casa_token=WBEXRhs6I_YAAAAA:cVk3EONP8EwXVzuUSZp8Qp-gZOLEVJYHDLV-hXTJ-gh5v-I_LuTYfvxKlk_Y5rOUfFKYZW9ty5xKg1w)

- [CVPR'2019-GLCN] Semi-supervised learning with graph learning convolutional networks. [[paper]](http://openaccess.thecvf.com/content_CVPR_2019/papers/Jiang_Semi-Supervised_Learning_With_Graph_Learning-Convolutional_Networks_CVPR_2019_paper.pdf)

- [NIPS'2020-IDGL] Iterative deep graph learning for graph neural networks: Better and robust node embeddings. [[paper]](https://proceedings.neurips.cc/paper/2020/file/e05c7ba4e087beea9410929698dc41a6-Paper.pdf)

#### Graph Sparsification

- [AIS'2016] Graph sparsification approaches for laplacian smoothing. [[paper]](http://proceedings.mlr.press/v51/sadhanala16.pdf)

- [SIGMOD'2011] Local graph sparsification for scalable clustering. [[paper]](https://dl.acm.org/doi/pdf/10.1145/1989323.1989399?casa_token=_CYKRbVjXTcAAAAA:WGIgqzcngwWBBkw9of3Wbjrc8JES8cAw39VQKKkTlVzB_MA_IQoCDyTd7rLe_1609i0wVXIPt8O3dpY)

- [SICOMP'2011] Spectral sparsification of graphs.[[paper]](https://arxiv.org/pdf/0808.4134)

- [NIPS'2019] On differentially private graph sparsification and applications. [[paper]](https://proceedings.neurips.cc/paper/2019/file/e44e875c12109e4fa3716c05008048b2-Paper.pdf)

- [ICDM'2022-GraphSparsify] A generic graph sparsification framework using deep reinforcement learning. [[paper]](https://ieeexplore.ieee.org/iel7/10027565/10027596/10027736.pdf?casa_token=ST116wBmpbMAAAAA:jfGOFwHoX4AsOiSUQYYNV6BUuC_mUKaPG5aKVTvSzK0PkJulHXAYk-Is9MzRSLISUhHZnQlIaEDN)

#### Graph Diffusion

- [ICLR'2019-PPNP/APPNP] Predict then propagate: graph neural networks meet personalized pagerank. [[paper]](https://arxiv.org/pdf/1810.05997)

- [NIPS'2019-GDC] Diffusion improves graph learning. [[paper]](https://proceedings.neurips.cc/paper/2019/file/23c894276a2c5a16470e6a31f4618d73-Paper.pdf)

- [ICLR'2021] Adaptive universal generalized pagerank graph neural network. [[paper]](https://arxiv.org/pdf/2006.07988)

- [NIPS'2021-ADC] Adaptive diffusion in graph neural networks. [[paper]](https://proceedings.neurips.cc/paper/2021/file/c42af2fa7356818e0389593714f59b52-Paper.pdf)

### Graph Feature Enhancement

#### Graph Feature Completion

- [NN'2020-GINN] Missing data imputation with adversarially-trained graph convolutional networks. [[paper]](https://www.sciencedirect.com/science/article/pii/S0893608020302185?casa_token=I7BtgyazKl8AAAAA:edsPKQbKSp1fbciuDc76aYnSPSS6T03ESUb_i2KDUJ6o2cA5LVCS-K8SfkhgWCqi5Bqog9hihvU)

- [FGCS'2021-GCN_MF] Graph convolutional networks for graphs containing missing features. [[paper]](https://www.sciencedirect.com/science/article/pii/S0167739X20330405)

- [TPAMI'2020-SAT] Learning on attribute-missing graphs. [[paper]](https://ieeexplore.ieee.org/iel7/34/4359286/09229522.pdf?casa_token=TsPVGT_mK-AAAAAA:XJ-eH3DCQ5OnkXvj5UhHBb8IC8O_I3WyworEudS9D2c-E7usbAovpaNswysThFSCPWWq94Az9EU-)

- [WWW'2021-HGNN-AC] Heterogeneous graph neural network via attribute completion. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3442381.3449914?casa_token=N3XrnbFD-VsAAAAA:aJnfUQvXnC-7T2INzKCmo3kLYGa1qJ1Z_EgTPH5gSRblwPFvlUCxg1sXgmnOcBSRLR-VuE6_Cv-DLM8)

- [IEEETransCybern'2022-Amer] Amer: A new attribute-missing network embedding approach. [[paper]](https://ieeexplore.ieee.org/iel7/6221036/6352949/09765782.pdf?casa_token=TNBxp_Z_qf8AAAAA:VPKCn0964kuE4tJZPTvUG1xAP2MEy_QvA0ym6ry8Cvu28BuFqUkG2P9xYGmXDx13TUvC_6NBu5V1)

- [arxiv'2021-SAGA] Siamese attribute-missing graph auto-encoder. [[paper]](https://arxiv.org/pdf/2112.04842)

#### Graph Feature Denoising

- [SPM'2013] The emerging field of signal processing on graphs: Extending high-dimensional data analysis to networks and other irregular domains. [[paper]](https://ieeexplore.ieee.org/iel7/79/6494646/06494675.pdf?casa_token=mD3rZmt2imAAAAAA:dtVEN95FqjgPHlO9pQ_c-xTtDMPuknx3pdjPVIkffAMEkoCRW-26vAq8gbjlXCh1Q02tEUbPHg)

- [GlobalSIP'2014] Signal denoising on graphs via graph filtering. [[paper]](https://ieeexplore.ieee.org/iel7/7010655/7032060/07032244.pdf?casa_token=EyFQK1dNZTUAAAAA:wzIP7ZxTWQQHcy4pfRmk3_JrDv39r8VyIXueZ_Y25VVeXHixoOP0mIMckQAn2prDBZO6EnfpFM8P)

- [IET-SP'2018] Graph polynomial filter for signal denoising. [[paper]](https://scholar.google.com/scholar?output=instlink&q=info:N5LbqcYw_6AJ:scholar.google.com/&hl=zh-CN&as_sdt=0,5&scillfp=17975566870893489622&oi=lle)

- [AIS'2015] Trend filtering on graphs. [[paper]](http://proceedings.mlr.press/v38/wang15d.pdf)

- [ICASSP'2020] Graph auto-encoder for graph signal denoising. [[paper]](https://ieeexplore.ieee.org/iel7/9040208/9052899/09053623.pdf?casa_token=wp6nKhBq4eQAAAAA:VbeeBWniW1nbGLsSSTDaOlUt8Co50jVEejxpZQziCDzx8NnIJ_0ZPGVdJNi5GG9pRKAaA-rENrOL)

- [TSP'2021] Graph unrolling networks: Interpretable neural networks for graph signal denoising. [[paper]](https://ieeexplore.ieee.org/iel7/78/4359509/09453145.pdf?casa_token=WBFBG92sg-wAAAAA:CFc6t1fhHyEQleL3nCQiBE5XNjTGFqS80VXxkHSDPEPzS7OEed6ydOV7M0ZN-yKlrkljWt-sXKh7)

- [TSP'2022] Untrained graph neural networks for denoising. [[paper]](https://ieeexplore.ieee.org/iel7/78/4359509/09959969.pdf?casa_token=79riV29MxvkAAAAA:kEUgypSxBhsj1TqD-3vTPxUY1JoHN5E6vTh2uFkuJrSTiA7bGt6qIorsfuHB3kxczHZGkyH8E0Um)

- [WWW'2023-MAGNET] Robust graph representation learning for local corruption recovery. [[paper]](https://yuguangwang.github.io/papers/L_p_graph_regularizer_ICML%20TAG%202022.pdf)

### Graph Label Enhancement

#### Graph Pseudo-labeling

- [AAAI'2018] Deeper insights into graph convolutional networks for semi-supervised learning. [[paper]](https://ojs.aaai.org/index.php/AAAI/article/view/11604/11463)

- [AAAI'2020] Multi-stage self-supervised learning for graph convolutional networks on graphs with few labeled nodes. [[paper]](https://ojs.aaai.org/index.php/AAAI/article/download/6048/5904)

- [CIKM'2021-IFC-GCN] Rectifying pseudo labels: Iterative feature clustering for graph representation learning. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3459637.3482469?casa_token=j-229ROWIHwAAAAA:ubON8KC_ifOtZWU2MqrFg3ZAiloxu2JGcOWWSdIZcKj-qlfgeRzL0xclkB8DF6lxs_Qcmj1lz9UFTOo)

- [arXiv'2019-DSGCN] Dynamic self-training framework for graph convolutional networks. [[paper]](https://openreview.net/pdf?id=SJgCEpVtvr)

- [WSDM'2022-RS-GNN] Towards robust graph neural networks for noisy graphs with sparse labels. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3488560.3498408)

- [DMKD'2023-InfoGNN] Informative pseudo-labeling for graph neural networks with few labels. [[paper]](https://link.springer.com/article/10.1007/s10618-022-00879-4)

#### Graph Label Denoising

- [WSDM'2023-CLNode] CLNode: Curriculum learning for node classification. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3539597.3570385?casa_token=cNDTalzkl6IAAAAA:0-d5yP4lt002LAiXl9dHRFQ7iARGawdtBSZ4rVR29UeBHabh8yC7YvjAJdAac3SKLutBKX_HrN1CNts)

- [arXiv'2019-D-GNN] Learning graph neural networks with noisy labels. [[paper]](https://arxiv.org/pdf/1905.01591)

- [CIKM'2021-IFC-GCN] Rectifying pseudo labels: Iterative feature clustering for graph representation learning. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3459637.3482469?casa_token=j-229ROWIHwAAAAA:ubON8KC_ifOtZWU2MqrFg3ZAiloxu2JGcOWWSdIZcKj-qlfgeRzL0xclkB8DF6lxs_Qcmj1lz9UFTOo)

- [KDD'2021-NRGNN] Nrgnn: Learning a label noise resistant graph neural network on sparsely and noisily labeled graphs. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3447548.3467364)

- [WSDM'2023-RTGNN] Robust training of graph neural networks via noise governance. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3539597.3570369?casa_token=9LQOlvbtC2AAAAAA:qmY_CT3ipOgGls6v5El4psDQ8tCMZpZfkRiSfBVAI32kXMTNkH2p9ZTQYLC2yUVoOjZJDEB5g27sEDo)

#### Graph Class-imbalanced Sampling

- [WSDM'2021-GraphSMOTE] Graphsmote: Imbalanced node classification on graphs with graph neural networks. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3437963.3441720)

- [KDD'2021-ImGAGN] Imgagn: Imbalanced network embedding via generative adversarial graph networks. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3447548.3467334?casa_token=4G06vl98C9IAAAAA:YgtmaAMJO5uyHdy3XRNwiw9MQ-SwbVMNSC0M1FkVpbiw0n1XGkGkICXjl3urOfXVUDjsbjiwFWYhxOk)

- [WWW'2021-PC-GNN] Pick and choose: a GNN-based imbalanced learning approach for fraud detection. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3442381.3449989)

- [WWW'2021-GraphMixup] Mixup for node and graph classification. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3442381.3449796)

- [ICLR'2021-GraphENS] GraphENS: Neighbor-aware ego network synthesis for class-imbalanced node classification. [[paper]](https://openreview.net/pdf?id=MXEl7i-iru)

- [arXiv'2023-GraphSR] GraphSR: A Data Augmentation Algorithm for Imbalanced Node Classification. [[paper]](https://arxiv.org/pdf/2302.12814)

- [NIPS'2021-ReNode] Topology-imbalance learning for semi-supervised node classification. [[paper]](https://proceedings.neurips.cc/paper/2021/file/fa7cdfad1a5aaf8370ebeda47a1ff1c3-Paper.pdf)

- [arXiv'2022-TopoImb] TopoImb: Toward topology-level imbalance in learning from graphs. [[paper]](https://arxiv.org/pdf/2212.08689)

- [IJCAI'2013-igBoost] Graph classification with imbalanced class distributions and noise. [[paper]](https://www.researchgate.net/profile/Shirui-Pan-3/publication/262204599_Graph_classification_with_imbalanced_class_distributions_and_noise/links/564bc59d08ae3374e5dddb5b/Graph-classification-with-imbalanced-class-distributions-and-noise.pdf)

- [CIKM'2022-G2GNN] Imbalanced graph classification via graph-of-graph neural networks. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3511808.3557356?casa_token=gzP3pWpfsWMAAAAA:BPdp0A0Mh3tWlXo6i4Mvd5kfVo0geTqqfH_hOyCpAki9krAMdq6fZKCScHffiQdfq9mZSPYBRR85uL4)

### Graph Size Enhancement

#### Graph Size Reduction

- [ICML'2009-Herding] Herding dynamical weights to learn. [[paper]](https://dl.acm.org/doi/pdf/10.1145/1553374.1553517?casa_token=mSkkhW5jA84AAAAA:31mF183Hzi4X1un4BsuRLuaFVId7Febx4PXXQbbZ0uquebekHdDhncU3QhaH0zxb6ItnWYdWehkfhFM)

- [CVPR'2017-ICARL] ICARL: Incremental classifier and representation learning. [[paper]](https://openaccess.thecvf.com/content_cvpr_2017/papers/Rebuffi_iCaRL_Incremental_Classifier_CVPR_2017_paper.pdf)

- [ICLR'2018-K-center] Active learning for convolutional neural networks: A core-set approach. [[paper]](https://arxiv.org/pdf/1708.00489.pdf)

- [ICAIS'2020-Coarsening] Graph coarsening with preserved spectral properties. [[paper]](http://proceedings.mlr.press/v108/jin20a/jin20a.pdf)

- [arXiv'2021] Graph domain adaptation: A generative view. [[paper]](https://arxiv.org/pdf/2106.07482)

- [ICLR'2021-GCond] Graph condensation for graph neural networks. [[paper]](https://arxiv.org/pdf/2110.07580)

- [KDD'2022-DosCond] Condensing graphs via one-step gradient matching. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3534678.3539429?casa_token=gijVmZummzYAAAAA:nq1yIq5wN-sLT-qsLavBYyys3163tZNUB_3Hj7bOC6bkE4wP5BHZEcf-faFnk5djxCujlvTLQ3BSnlc)

- [NeurIPS-Workshop'2022] Faster hyperparameter search on graphs via calibrated dataset condensation. [[paper]](https://openreview.net/pdf?id=wcbgjg0X7LJ)

- [arXiv'2023-SFGC] Structure-free graph condensation: From large-scale graphs to condensed graph-free data. [[paper]](https://arxiv.org/pdf/2306.02664)

#### Graph Data Augmentation

- [ACM SIGKDD Explorations Newsletter'2022-Survey]  Data augmentation for deep graph learning: A survey. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3575637.3575646?casa_token=RTnILzVw3x4AAAAA:la46qhiVu6kPiJ1k4dopSZTYb0iA4oo2r8sIEXSGUUkoTW5byrlx-9VVf5-OOkoc0cDfcC-GOYpzc2k)

- [arXiv'2202-Survey] Graph data augmentation for graph machine learning: A survey. [[paper]](https://arxiv.org/pdf/2202.08871)

- [ICLR'2020-DropEdge] DropEdge: Towards deep graph convolutional networks on node classification. [[paper]](https://arxiv.org/pdf/1907.10903)

- [NeurIPS'2020-GRAND] Graph random neural networks for semi-supervised learning on graphs. [[paper]](https://proceedings.neurips.cc/paper/2020/file/fb4c835feb0a65cc39739320d7a51c02-Paper.pdf)

- [AAAI'2022-NASA] Regularizing graph neural networks via consistency-diversity graph augmentations. [[paper]](https://ojs.aaai.org/index.php/AAAI/article/view/20307/20066)

- [KDD'2020-NodeAug] NodeAug: Semi-supervised node classification with data augmentation. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3394486.3403063?casa_token=5O6rVP5WUNcAAAAA:39vUmeVOiby_U-6UC3f4_vw5YEox2awfj22tgTeoZMa8f2IPeQ0w-2x23QC8V9_nwSO2F8Y9tFVWUbY)

- [AAAI'2021-GAUG] Data augmentation for graph neural networks. [[paper]](https://ojs.aaai.org/index.php/AAAI/article/view/17315/17122)

- [AAAI'2021-GraphMix] Graphmix: Improved training of gnns for semi-supervised learning. [[paper]](https://ojs.aaai.org/index.php/AAAI/article/view/17203/17010)

- [WWW'2021-GraphMixup] Mixup for node and graph classification. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3442381.3449796)

- [WSDM'2021-GraphSMOTE] Graphsmote: Imbalanced node classification on graphs with graph neural networks. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3437963.3441720)

- [CVPR'2022-FLAG] Robust optimization as data augmentation for large-scale graphs. [[paper]](https://openaccess.thecvf.com/content/CVPR2022/papers/Kong_Robust_Optimization_As_Data_Augmentation_for_Large-Scale_Graphs_CVPR_2022_paper.pdf)

- [ICML'2022-G-Mixup] G-mixup: Graph data augmentation for graph classification. [[paper]](https://proceedings.mlr.press/v162/han22c/han22c.pdf)

- [ICML'2022-LAGNN] Local augmentation for graph neural networks. [[paper]](https://proceedings.mlr.press/v162/liu22s/liu22s.pdf)

## How To Learn From Graph Data With Limited-availability and Low-quality? 

The answer to this question corresponds to **'Graph Data Exploitation'** stage in DC-GML framework, incorporating four strategies to learn from graph data with low-quality and limited-availability, i.e., *Graph Self-supervised Learning, Graph Semi-supervised Learning, Graph Active Learning, and Graph Transfer Learning*.

### Graph Self-supervised Learning

- [TKDE'2022-Survey] Graph self-supervised learning: A survey. [[paper]](https://ieeexplore.ieee.org/iel7/69/10113816/09770382.pdf?casa_token=ydQeuh-OkpYAAAAA:hFnyLT6NKFztPKkO47dh8cdQ49fm5Sal8jmpoxdz5a4jBXqbL08A3miB1Z2XfGkrGChYovXRyb2W)

- [arXiv'2016-GAE] Variational graph auto-encoders. [[paper]](https://arxiv.org/pdf/1611.07308.pdf%5D)

- [CIKM'2017-MGAE] MGAE: Marginalized graph autoencoder for graph clustering. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3132847.3132967?casa_token=JNQueyr_GKgAAAAA:7xzVsWp65NPw-I2MqMxV-OkI21tM6chr7nfSHdcihKJGaEL5IL3OFYD17uEXSCI4a4JS4wTFAsSshwI)

- [IJCAI'2018-ARGA] Adversarially regularized graph autoencoder for graph embedding. [[paper]](https://arxiv.org/pdf/1802.04407)

- [ICLR'2019-DGI] Deep graph infomax. [[paper]](https://arxiv.org/pdf/1809.10341)

- [ICML'2020-MVGRL] Contrastive multi-view representation learning on graphs. [[paper]](http://proceedings.mlr.press/v119/hassani20a/hassani20a.pdf)

- [NeurIPS'2020-GraphCL] Graph contrastive learning with augmentations. [[paper]](https://proceedings.neurips.cc/paper_files/paper/2020/file/3fe230348e9a12c13120749e3f9fa4cd-Paper.pdf)

- [arXiv'2020-PairwiseDistance/NodeProperty] Self-supervised learning on graphs: Deep insights and new direction. [[paper]](https://arxiv.org/pdf/2006.10141)

- [NeurIPS'2020-GROVER] Self-supervised graph transformer on large-scale molecular data. [[paper]](https://proceedings.neurips.cc/paper/2020/file/94aef38441efa3380a3bed3faf1f9d5d-Paper.pdf)

- [WWW'2020-GMI] Graph representation learning via graphical mutual information maximization. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3366423.3380112?casa_token=8_d4PlRfYmIAAAAA:uYIQuMXnOVCRWjFlpm3Q2TKpnR-wrMBM_-MRjnS7A_8F8oEqKR5IAK0mXX7bDcP3o9L3_hk_-3MWjn4)

- [ICML'2020] When does self-supervision help graph convolutional networks? [[paper]](http://proceedings.mlr.press/v119/you20a/you20a.pdf)

- [WWW'2021-GCA] Graph contrastive learning with adaptive augmentation. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3442381.3449802?casa_token=J5uSGyBfZzgAAAAA:3ppijn_2zTmgQmUeVpULosGAFxH8EBDZQoupLyb_JMTUBdELSQ27XCKREpudR4rnwn1ZPGEwNkmm2mI)

- [ICML'2021-JOAO] Graph contrastive learning automated. [[paper]](http://proceedings.mlr.press/v139/you21a/you21a.pdf)

- [NeurIPS'2021-AD-GCL] Adversarial graph augmentation to improve graph contrastive learning. [[paper]](https://proceedings.neurips.cc/paper/2021/file/854f1fb6f65734d9e49f708d6cd84ad6-Paper.pdf)

- [KDD'2022-GraphMAE] GraphMAE: Self-supervised masked graph autoencoders. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3534678.3539321)

- [Information Sciences'2022-S2GRL] A new self-supervised task on graphs: Geodesic distance prediction. [[paper]](https://www.sciencedirect.com/science/article/pii/S0020025522006375?casa_token=I1kGX3li2DYAAAAA:JoWLxh9juXpnsr9g1PY2J56GT5rFe2EcVEbLxVVkN0uiWHQ8rJuyS2UQQZ9Eg5a98RavqlGGUus)

- [ICLR'2022-AutoSSL] Automated self-supervised learning for graphs. [[paper]](https://arxiv.org/pdf/2106.05470)

### Graph Semi-supervised Learning

- [ICML'2003] Semi-supervised learning using gaussian fields and harmonic functions. [[paper]](https://cdn.aaai.org/ICML/2003/ICML03-118.pdf)

- [NeurIPS'2003] Learning with local and global consistency. [[paper]](https://proceedings.neurips.cc/paper/2003/file/87682805257e619d49b8e0dfdc14affa-Paper.pdf)

- [ICML'2005] Learning from labeled and unlabeled data on a directed graph. [[paper]](https://dl.acm.org/doi/pdf/10.1145/1102351.1102482?casa_token=nvP_3IHEcZgAAAAA:rWVcWavK2RWe4SBAlQTZlW0oaAJV91Y-d6QX3K161YKOyaSYHdJk_Bz1482do0ybB16auyTsFNMbUO8)

- [AAAI'2018] Deeper insights into graph convolutional networks for semi-supervised learning. [[paper]](https://ojs.aaai.org/index.php/AAAI/article/view/11604/11463)

- [KDD'2020-NodeAug] NodeAug: Semi-supervised node classification with data augmentation. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3394486.3403063?casa_token=5O6rVP5WUNcAAAAA:39vUmeVOiby_U-6UC3f4_vw5YEox2awfj22tgTeoZMa8f2IPeQ0w-2x23QC8V9_nwSO2F8Y9tFVWUbY)

- [NeurIPS'2020-GRAND] Graph random neural networks for semi-supervised learning on graphs. [[paper]](https://proceedings.neurips.cc/paper/2020/file/fb4c835feb0a65cc39739320d7a51c02-Paper.pdf)

- [AAAI'2020-M3S] Multi-stage self-supervised learning for graph convolutional networks on graphs with few labeled nodes. [[paper]](https://ojs.aaai.org/index.php/AAAI/article/download/6048/5904)

- [WSDM'2021-SimP-GCN] Node similarity preserving graph convolutional networks. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3437963.3441735)

- [ACM-TIS'2021-GCN-LPA] Combining graph convolutional neural networks and label propagation. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3490478?casa_token=PmUrlevC108AAAAA:KDwh2PRvk-k_1XmmD0KtZhlIyG7GUUzw1pYJe4fCHVKh6JL0Tl6IFRdg0DXpKunxc4-XYBysIUEdU0E)

- [AAAI'2021-CG3] Contrastive and generative graph convolutional networks for graph-based semi-supervised learning. [[paper]](https://ojs.aaai.org/index.php/AAAI/article/view/17206/17013)

- [NeurIPS'2021-GCPN] Contrastive graph poisson networks: Semi-supervised learning with extremely limited labels. [[paper]](https://proceedings.neurips.cc/paper/2021/file/31c0b36aef265d9221af80872ceb62f9-Paper.pdf)

- [AAAI'2022-Meta-PN] Meta propagation networks for graph few-shot semi-supervised learning. [[paper]](https://ojs.aaai.org/index.php/AAAI/article/view/20605/20364)

- [World Wide Web'2022-CycProp] Cyclic label propagation for graph semi-supervised learning. [[paper]](https://idp.springer.com/authorize/casa?redirect_uri=https://link.springer.com/article/10.1007/s11280-021-00906-2&casa_token=y5doXTXXcdQAAAAA:FBx5NfHPBmJEjeSSjdTtJ-RfdEq57KqFn-fNqyQP-364LUhUxBEKGd_GHqQ2p9Hme8P1OnVuBWCFK42aBUA)

### Graph Active Learning

- [arXiv'2017-AGE] Active learning for graph embedding. [[paper]](https://arxiv.org/pdf/1705.05085)

- [IJCAI'2018-ANRMAB] Active discriminative network representation learning. [[paper]](https://opus.lib.uts.edu.au/bitstream/10453/131527/1/Active%20discriminative%20network%20representation%20learning.pdf)

- [IJCAI'2019-ActiveHNE] ActiveHNE: active heterogeneous network embedding. [[paper]](https://arxiv.org/pdf/1905.05659)

- [arXiv'2019-FeatProp] Active learning for graph neural networks via node feature propagation. [[paper]](https://arxiv.org/pdf/1910.07567)

- [WWW'2020-ATNE] Active domain transfer on network embedding. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3366423.3380024?casa_token=FecoeNTSq2QAAAAA:BzlicC0uYb8kyV8FgxwwGWg5V3G9DRb8cnATqEG_TURzQUKCIZYcF83JuQ0Jl14Y-EUXIygS0UAaDyI)

- [KDD'2020-ASGN] ASGN: An active semi-supervised graph neural network for molecular property prediction. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3394486.3403117?casa_token=QSPH_GxeKq8AAAAA:GCbb1_8dboX0xxf94atEz2iBRtxb_RtuLWOaHYZW9hbfkojnPwhUN0U2LSMOxfYoAiw1BkMXoKsyjgI)

- [NeurIPS'2020-GPA] Graph policy network for transferable active learning on graphs. [[paper]](https://proceedings.neurips.cc/paper/2020/file/73740ea85c4ec25f00f9acbd859f861d-Paper.pdf)

- [ACML'2020-MetAL] Metal: Active semi-supervised learning on graphs via meta-learning. [[paper]](http://proceedings.mlr.press/v129/madhawa20a/madhawa20a.pdf)

- [TNNLS'2020-SEAL] Seal: Semisupervised adversarial active learning on attributed graphs. [[paper]](https://ieeexplore.ieee.org/iel7/5962385/6104215/09158558.pdf?casa_token=E_ZvrdYJhakAAAAA:0gkVyZk9eCLyBj7U04Z1_HBmA8LBZtZ4KtdwxUdY4Nl6wXCGB8i4ZRxyi0rEShdk3swoEVwijf8X)

- [VLDB Endowment'2021-GRAIN] GRAIN: improving data efficiency of graph neural networks via diversified in fluence maximization. [[paper]](https://arxiv.org/pdf/2108.00219)

- [NeurIPS'2021-RIM] RIM: Reliable influence-based active learning on graphs. [[paper]](https://proceedings.neurips.cc/paper/2021/file/eb86d510361fc23b59f18c1bc9802cc6-Paper.pdf)

- [WWW'2021-Attent] Attent: Active attributed network alignment. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3442381.3449886?casa_token=IfxcTs5UUWgAAAAA:pJ-lyCKpCf1Jc3Kr7GG-JVWqYPH9jDYJjEwMa-3W-woQYcop5H3Zx264KGy3vCJ2EJU_WqAk2EH-RQI)

- [ICMD'2021-ALG] ALG: Fast and accurate active learning framework for graph convolutional networks. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3448016.3457325?casa_token=mvig94UQ6ngAAAAA:vU5pYuZ6WohLXhjiaCsuthFrGhPwqH4lYRDRLYjtXeiPb1_rvscz04dBZIZhGg4gX3mon6rFz1Xs6hs)

- [WWW'2022-ALLIE] ALLIE: Active learning on large-scale imbalanced graphs. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3485447.3512229)

- [AAAI'2022-BIGENE] Batch active learning with graph neural networks via multi-agent deep reinforcement learning. [[paper]](https://ojs.aaai.org/index.php/AAAI/article/view/20897/20656)

- [ICLR'2022-IGP] Information Gain Propagation: A new way to graph active learning with soft labels. [[paper]](https://arxiv.org/pdf/2203.01093)

- [KDD'2022-JuryGCN] JuryGCN: quantifying jackknife uncertainty on graph convolutional networks. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3534678.3539286?casa_token=pMOGKpsGGzwAAAAA:-hnHQLGfe7DXEX9DtkVcKC-t_RnRICOjtK9uXDAc_4GULkfy_l0AgGveJw9fGntKT3WivfcfeaKmwhk)

### Graph Transfer Learning

- [IJCAI'2019-DANE] DANE: domain adaptive network embedding. [[paper]](https://arxiv.org/pdf/1906.00684)

- [WWW'2020-UDA-GCN] Unsupervised domain adaptive graph convolutional networks. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3366423.3380219?casa_token=fswpPMwhRoUAAAAA:YGi0Ah4-jcilsyQn0FT-NG4XVwS3iTk8WXp30RiyP2rbeS7qdFiLEMF-3AQ1rFYHtJMNp-1L_QDCvOU)

- [AAAI'2020-ACDNE] Adversarial deep network embedding for cross-network node classification. [[paper]](https://ojs.aaai.org/index.php/AAAI/article/view/5692/5548)

- [ICDM'2020-OpenWGL] Openwgl: Open-world graph learning. [[paper]](https://ieeexplore.ieee.org/iel7/9338245/9338248/09338284.pdf?casa_token=R6FDgOWers4AAAAA:aBaB05Ecwql1oUk3CarNCI1TSGDb_9cVfjBft-TH1v4hvnoEhwWcEiEu2cw0VKQCgXxAeldEFDiH)

- [ICML'2020-PGL] Progressive graph learning for open-set domain adaptation. [[paper]](http://proceedings.mlr.press/v119/luo20b/luo20b.pdf)

- [NeurIPS'2021-SRGNN] Shift-robust gnns: Overcoming the limitations of localized graph training data. [[paper]](https://proceedings.neurips.cc/paper_files/paper/2021/file/eb55e369affa90f77dd7dc9e2cd33b16-Paper.pdf)

- [arXiv'2021-SOGA] Source free unsupervised graph domain adaptation. [[paper]](https://arxiv.org/pdf/2112.00955)

- [arXiv'2021] Graph domain adaptation: A generative view. [[paper]](https://arxiv.org/pdf/2106.07482)

- [NeurIPS-Workshop'2022-SRNC] Shift-robust node classification via graph clustering co-training. [[paper]](https://openreview.net/pdf?id=CXm7uzRlvxf)

## How To Build Graph MLOps System: The Graph Data-centric View.

The answer to this question corresponds to three stages of **'Graph Data Collection, Graph Data Exploration, and Graph Data Maintenance'** in DC-GML framework. Along with **Graph Data Improvement and Graph Data Exploitation**, we build a graph MLOps from the graph data-centric view.

### Graph Data Collection

- Amazon Mechanical Turk: https://www.mturk.com/

- [SIGIR-Workshop'2011] Semi-supervised consensus labeling for crowdsourcing. [[paper]](https://www.ischool.utexas.edu/~ml/papers/tang-cir11.pdf)

- [Cloud Computing'2021] Knowledge graphs meet crowdsourcing: a brief survey. [[paper]](https://link.springer.com/chapter/10.1007/978-3-030-69992-5_1)

- [Journal of Classification'1997] Estimation and prediction for stochastic blockmodels for graphs with latent block structure. [[paper]](https://idp.springer.com/authorize/casa?redirect_uri=https://link.springer.com/content/pdf/10.1007/s003579900004.pdf&casa_token=HmfwBFmsgzwAAAAA:m_PjqmuzWqzS3EWaOvjQD6BN-oNlQ-N3Wiie67xOvRqhaBVsKcmgreLA8w17PJU3B2QRzIeNSM8o4N3UhQ8)

- Probabilistic graphical models: principles and techniques. [[book]](https://list01.bio.ens.psl.eu/wws/d_read/machine_learning/BayesianNetworks/koller.pdf)

- [NeurIPS'2019] Gnnexplainer: Generating explanations for graph neural networks. [[paper]](https://proceedings.neurips.cc/paper_files/paper/2019/file/d80b7040b773199015de6d3b4293c8ff-Paper.pdf)

- [KDD'2014] Focused clustering and outlier detection in large attributed graphs. [[paper]](https://dl.acm.org/doi/pdf/10.1145/2623330.2623682?casa_token=bFPgGTyUJrwAAAAA:UxQuO_EC7-SHHC3CPnBlhi04YtzUhiwQ2FSS29OMhdIBz3Wq0WVoSDgNtwM06cKifzqFOH2bj_DJc_g)

- [Journal of Machine Learning Research'2023] Graph clustering with graph neural networks. [[paper]](https://www.jmlr.org/papers/volume24/20-998/20-998.pdf)

- [WWW'2021] Pathfinder discovery networks for neural message passing. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3442381.3449882?casa_token=OeRAWAYACGkAAAAA:KcUEIrMuvK8IwYpfQeXKoSWnL2ibQYXzke4bL_-bEBljWWIojJMNy_sPtEtBmIl2I942D4R19UpsHn8)

- [arXiv'2020] Benchmarking graph neural networks. [[paper]](https://www.jmlr.org/papers/volume24/22-0567/22-0567.pdf)

- [arXiv'2022] Synthetic graph generation to benchmark graph learning. [[paper]](https://arxiv.org/pdf/2204.01376)

- [KDD'2022] Graphworld: Fake graphs bring real insights for gnns. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3534678.3539203)

### Graph Data Exploration

- NetworkX: https://networkx.org/

- igraph: https://igraph.org/

- Neo4j: https://neo4j.com/

- [ECML-PKDD'2021] Graphsvx: Shapley value explanations for graph neural networks. [[paper]](https://arxiv.org/pdf/2104.10482)

### Graph Data Maintenance

- [arXiv'2022-TrustworthyGNN-Survey] Trustworthy graph neural networks: Aspects, methods and trends. [[paper]](https://arxiv.org/pdf/2205.07424)

- [IEEE Network'2010] Privacy and security for online social networks: challenges and opportunities. [[paper]](https://ieeexplore.ieee.org/iel5/65/5510907/05510913.pdf?casa_token=--aFnZdnJPMAAAAA:GY_7DMBORUJ5xD6NmiEDLdSQyCv7BTjuFZy3RY3LGF53EIiTYf7BB_0oKbfGOQPjq4OkD7drRZNs)

- [SIGMOD'2008] Towards identity anonymization on graphs. [[paper]](https://dl.acm.org/doi/pdf/10.1145/1376616.1376629?casa_token=-rPE_pQvmwAAAAAA:equ17IAF6blOiwP0ke0qTz8l4pRE9hYUvjG8ONxpO-OFyV3yAWSkIAuEw-yBOlsC89hWmcoCiP7U9qo)

- [Multimedia Tools and Applications'2018] Privacy preservation based on clustering perturbation algorithm for social network. [[paper]](https://idp.springer.com/authorize/casa?redirect_uri=https://link.springer.com/article/10.1007/s11042-017-5502-3&casa_token=ExB8rAhIa0gAAAAA:T0OZ9Se-h1YcuxwT_PifyHTa_sfvE9x884x0T5mAcoGikV8gn_WBIG-EZ9wulZSxSFqa4a0H98hJW1JUXIk)

- [EDBT/ICDT Workshops'2015] Privacy-Integrated Graph Clustering Through Differential Privacy. [[paper]](https://www.academia.edu/download/77577505/EDBTICDT-WS2015-complete.pdf#page=252)

- [Information Sciences'2020] PGAS: Privacy-preserving graph encryption for accurate constrained shortest distance queries. [[paper]](https://www.sciencedirect.com/science/article/pii/S0020025519306991?casa_token=tpKpvhkxlcYAAAAA:FWf7bFBr-nQ90BMOgKaeZGdMk7bA6BdbONNtfCEde5uNb8FvM8Wqlpq1zKlzZLcH4vZrd53onoE)

- [KDD'2022] Federatedscope-gnn: Towards a unified, comprehensive and efficient package for federated graph learning. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3534678.3539112?casa_token=lLfb0irOvVcAAAAA:peP_OcLHTBrw8vOr-vdfk6ZtcBCzDpxPW_MpyQm9joQg1awXqso9xQYCCdwu0ioNqOFbhgv4dOmSoTw)

- [AAAI'2023] Federated learning on Non-IID graphs via structural knowledge sharing. [[paper]](https://ojs.aaai.org/index.php/AAAI/article/download/26187/25959)

- [IEEE Communications Magazine'1994] Access control: principle and practice. [[paper]](https://ieeexplore.ieee.org/iel1/35/7577/00312842.pdf?casa_token=ue7Oix7c3DwAAAAA:xvf-oW5BVRFKhutUkmBaEc2MqiMSN0yW9yuY7WzHrblUkaZ9fKZziY9BwOzlATnLgFPBHvtZ0FF7)

- [Global Summit on Computer and Information Technology'2014] Implementation of elliptic curve digital signature algorithm (ECDSA). [[paper]](https://www.researchgate.net/profile/Abdessalem-Abidi/publication/286584272_Implementation_of_elliptic_curve_digital_signature_algorithm_ECDSA/links/570b712e08ae8883a1fe0f70/Implementation-of-elliptic-curve-digital-signature-algorithm-ECDSA.pdf)

- [Computers and Security'2021] Threat detection and investigation with system-level provenance graphs: a survey. [[paper]](https://www.sciencedirect.com/science/article/pii/S0167404821001061?casa_token=2AkT5nELWYMAAAAA:nczh2dVDq2nc9mSMJaOo9VjPod7g4BTqHYV-pzzC0UA_eZjRpEhvL1wDaAVRJ7uxwqAn8bnTszQ)

### Graph MLOps

- Kubeflow: https://github.com/kubeflow/kubeflow

- Amazon SageMaker: https://aws.amazon.com/sagemaker/

- Amazon Neptune: https://neptune.ai/product

- GraphStorm: https://github.com/awslabs/graphstorm/wiki

- Real-time Fraud Detection with Graph Neural Network on DGL: https://github.com/awslabs/realtime-fraud-detection-with-gnn-on-dgl
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/Data-Centric-GraphML/awesome-papers

Awesome Lists containing this project

README