This repository contains the source code and datasets for the paper "Balanced Multi-Relational Graph Clustering", accepted by the 32nd ACM International Conference on Multimedia (MM 2024).
Paper Link: https://arxiv.org/abs/2407.16863
All the datasets and the trained model parameters can be downloaded from datasets link.
Place the 'data' and 'best' folders from the downloaded files into the BMGC directory.
This code requires the following:
- Python==3.9.16
- PyTorch==1.13.1
- DGL==0.9.1
- Numpy==1.24.2
- Scipy==1.10.1
- Scikit-learn==1.2.1
- Munkres==1.1.4
- kmeans-pytorch==0.3
- PyTorch_Geometric==2.2.0
python main.py
or python large.py
(to run the MAG dataset)
MAG is a large-scale citation network, constituting the largest dataset in multi-relation graph clustering thus far. MAG is a subset extracted from OGBN-MAG, consisting of the four largest classes. Its multi-relational structures are constructed from the heterogeneous graph structures of OGBN-MAG using different meta-paths. MAG contains 113,919 paper nodes with graphs generated by two meta-paths (paper-author-paper and paper-cite-paper). Each paper is associated with a 128-dimensional word2vec feature vector. The processed MAG can be downloaded from the above "datasets link".
If the MAG dataset contributes to your research, we would sincerely appreciate it if you could cite our paper. Thank you!
@inproceedings{shen2024balanced,
title={Balanced Multi-Relational Graph Clustering},
author={Shen, Zhixiang and He, Haolan and Kang, Zhao},
booktitle={Proceedings of the 32nd ACM International Conference on Multimedia},
pages={4120--4128},
year={2024}
}