Log in

Metapath and attribute-based academic collaborator recommendation in heterogeneous academic networks

  • Published:
Scientometrics Aims and scope Submit manuscript

Abstract

Academic collaboration is fundamental to the advancement of scientific research. However, with the growing number of publications and researchers, it becomes increasingly challenging to identify suitable collaborators. Academic collaborator recommendation is a promising solution to this problem. Traditional recommendation methods based on collaborative filtering suffer serious data sparsity. In recent years, network topology-based methods have shown good recommendation performance while alleviating the data sparsity issue to some extent by exploiting the relationships between nodes and their attributes. Nevertheless, these methods are typically based on homogeneous collaboration networks that consist only of scholar nodes and collaboration relationships, leading to suboptimal performance. In reality, collaboration involves many different types of nodes and relations that accumulate multiplex information. To address this issue, we construct a heterogeneous academic information network comprising four types of nodes: scholars, papers, organizations, and publication venues. An academic collaborator recommendation model is designed to capture multi-type attribute features and network topology features of nodes through metapaths based on the network. Specifically, the attribute features of nodes are embedded by a node type-aware embedding method. The topology features are then extracted through the node type-aware aggregation and metapath instance aggregation procedure. After that, we utilize a metapath aggregation method to gather different types of metapaths, each representing a factor that affects collaboration. Thus, the topology information and attribute information are preserved, while encompassing multi-type factors of collaboration. Finally, we compute the vector similarity to determine collaborators. Through rigorous experimentation on a large-scale interdisciplinary academic dataset, we found that the proposed model exhibits outstanding performance in practical applications. Unlike traditional approaches confined to homogeneous collaboration networks, our model delves deeper by mining and leveraging diverse node attributes and multiple collaboration influencing factors. This approach significantly enhances the accuracy and effectiveness of collaborator recommendations. Ultimately, we aspire to contribute to a more efficient and accessible platform that simplifies the search for suitable collaborators.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (France)

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Algorithm 1
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Notes

  1. https://www.aminer.cn/open/article?id=55af4228dabfae1ce3ed1253.

References

  • Abramo, G., D’Angelo, C. A., & Di Costa, F. (2009). Research collaboration and productivity: Is there correlation? Higher Education, 57, 155–171.

    Article  Google Scholar 

  • Abramo, G., D’Angelo, C. A., & Di Costa, F. (2012). Identifying interdisciplinarity through the disciplinary classification of coauthors of scientific publications. Journal of the American Society for Information Science and Technology, 63(11), 2206–2222.

    Article  Google Scholar 

  • Ahn, S. J., & Kim, M. (2021). Variational graph normalized autoencoders. In Proceedings of the 30th ACM international conference on information & knowledge management (pp. 2827–2831).

  • Amini, B., Ibrahim, R., Othman, M. S., & Selamat, A. (2014). Capturing scholar’s knowledge from heterogeneous resources for profiling in recommender systems. Expert Systems with Applications, 41(17), 7945–7957.

    Article  Google Scholar 

  • Bornmann, L., & Leydesdorff, L. (2015). Topical connections between the institutions within an organisation (institutional co-authorships, direct citation links and co-citations). Scientometrics, 102, 455–463.

    Article  Google Scholar 

  • Cen, Y., Zou, X., Zhang, J., Yang, H., Zhou, J., & Tang, J. (2019). Representation learning for attributed multiplex heterogeneous network. In Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining (pp. 1358–1368).

  • Chuan, P. M., Son, L. H., Ali, M., Khang, T. D., Huong, L. T., & Dey, N. (2018). Link prediction in co-authorship networks based on hybrid content similarity metric. Applied Intelligence, 48, 2470–2486.

    Article  Google Scholar 

  • Diederik, P. K., & Jimmy Lei, B. (2014). Adam: A method for stochastic optimization. International Conference on Learning Representations, 2014, 1–10.

    Google Scholar 

  • Dong, Y., Chawla, N. V., & Swami, A. (2017). Metapath2vec: Scalable representation learning for heterogeneous networks. In Proceedings of the 23rd ACM SIGKDD international conference on knowledge discovery and data mining (pp. 135–144).

  • Du, O., & Li, Y. (2022). Academic collaborator recommendation based on attributed network embedding. Journal of Data and Information Science, 7(1), 37–56.

    Article  Google Scholar 

  • Fu, X., & King, I. (2024). MECCH: Metapath context convolution-based heterogeneous graph neural networks. Neural Networks, 170, 266–275.

    Article  Google Scholar 

  • Fu, X., Zhang, J., Meng, Z., & King, I. (2020). Magnn: Metapath aggregated graph neural network for heterogeneous graph embedding. In Proceedings of the web conference 2020 (pp. 2331–2341).

  • Guan, M., Cai, X., Shang, J., Hao, F., Liu, D., Jiao, X., & Ni, W. (2023). HMSG: Heterogeneous graph neural network based on metapath SubGraph learning. Knowledge-Based Systems, 279, 110930.

    Article  Google Scholar 

  • He, C., Wu, J., & Zhang, Q. (2022). Proximity-aware research leadership recommendation in research collaboration via deep neural networks. Journal of the Association for Information Science and Technology, 73(1), 70–89.

    Article  Google Scholar 

  • Hong, R., He, Y., Wu, L., Ge, Y., & Wu, X. (2021). Deep Attributed network embedding by preserving structure and attribute information. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 51(3), 1434–1445.

    Article  Google Scholar 

  • Huang, X., Song, Q., Li, Y., & Hu, X. (2019). Graph recurrent networks with attributed random walks. In Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining (pp. 732–740).

  • Lee, D. H., Brusilovsky, P., & Schleyer, T. (2011). Recommending collaborators using social features and MeSH terms. Proceedings of the American Society for Information Science and Technology, 48(1), 1–10.

    Google Scholar 

  • Liao, L., He, X., Zhang, H., & Chua, T. S. (2018). Attributed social network embedding. IEEE Transactions on Knowledge and Data Engineering, 30(12), 2257–2270.

    Article  Google Scholar 

  • Liu, Z., **e, X., & Chen, L. (2018). Context-aware academic collaborator recommendation. In Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining (pp. 1870–1879).

  • Liu, X., Wu, K., Liu, B., & Qian, R. (2023). HNERec: Scientific collaborator recommendation model based on heterogeneous network embedding. Information Processing & Management, 60(2), 103253.

    Article  Google Scholar 

  • Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2013). Efficient estimation of word representations in vector space. Preprint retrieved from https://arxiv.org/abs/1301.3781

  • Mo, Y., Peng, L., Xu, J., Shi, X., & Zhu, X. (2022). Simple unsupervised graph representation learning. In Proceedings of the AAAI conference on artificial intelligence (Vol. 36, No. 7, pp. 7797–7805).

  • Shen, Y., Li, H., Li, D., Zheng, J., & Wang, W. (2022). ANGraph: Attribute-interactive neighborhood-aggregative graph representation learning. Neural Computing and Applications, 34(20), 17937–17949.

    Article  Google Scholar 

  • Shi, C., Hu, B., Zhao, W. X., & Philip, S. Y. (2018). Heterogeneous information network embedding for recommendation. IEEE Transactions on Knowledge and Data Engineering, 31(2), 357–370.

    Article  Google Scholar 

  • Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, L., & Polosukhin, I. (2017). Attention is all you need. Advances in Neural Information Processing Systems, 30, 10.

    Google Scholar 

  • Wang, W., Yu, S., Bekele, T. M., Kong, X., & **a, F. (2017). Scientific collaboration patterns vary with scholars’ academic ages. Scientometrics, 112, 329–343.

    Article  Google Scholar 

  • Wang, W., Liu, J., Yang, Z., Kong, X., & **a, F. (2019a). Sustainable collaborator recommendation based on conference closure. IEEE Transactions on Computational Social Systems, 6(2), 311–322.

    Article  Google Scholar 

  • Wang, X., Ji, H., Shi, C., Wang, B., Ye, Y., Cui, P., & Yu, P. S. (2019b). Heterogeneous graph attention network. In The world wide web conference (pp. 2022–2032).

  • Wang, Y., Duan, Z., Liao, B., Wu, F., & Zhuang, Y. (2019c). Heterogeneous attributed network embedding with graph convolutional networks. In Proceedings of the AAAI conference on artificial intelligence (Vol. 33, No. 01, pp. 10061–10062).

  • Wang, W., Liu, J., Tang, T., Tuarob, S., **a, F., Gong, Z., & King, I. (2020). Attributed collaboration network embedding for academic relationship mining. ACM Transactions on the Web (TWEB), 15(1), 1–20.

    Google Scholar 

  • Wang, W., Liu, J., Tang, T., Tuarob, S., **a, F., Gong, Z., & King, I. (2021). Attributed collaboration network embedding for academic relationship mining. ACM Transactions on the Web, 15(1), 1–20.

    Article  Google Scholar 

  • West, J. D., Jacquet, J., King, M. M., Correll, S. J., & Bergstrom, C. T. (2013). The role of gender in scholarly authorship. PLoS ONE, 8(7), e66212.

    Article  Google Scholar 

  • **, X., Wei, J., Guo, Y., & Duan, W. (2022). Academic collaborations: A recommender framework spanning research interests and network topology. Scientometrics, 127(11), 6787–6808.

    Article  Google Scholar 

  • Yang, H., Pan, S., Zhang, P., Chen, L., Lian, D., & Zhang, C. (2018). Binarized attributed network embedding. In 2018 IEEE international conference on data mining (ICDM) (pp. 1476–1481). IEEE.

  • Yang, C., Liu, T., Chen, X., Bian, Y., & Liu, Y. (2020). HNRWalker: Recommending academic collaborators with dynamic transition probabilities in heterogeneous networks. Scientometrics, 123, 429–449.

    Article  Google Scholar 

  • Yang, Y., Guan, Z., Li, J., Zhao, W., Cui, J., & Wang, Q. (2021). Interpretable and efficient heterogeneous graph convolutional network. IEEE Transactions on Knowledge and Data Engineering. https://doi.org/10.1109/TKDE.2021.3101356

    Article  Google Scholar 

  • Yu, L., Sun, L., Du, B., Liu, C., Lv, W., & **ong, H. (2022). Heterogeneous graph representation learning with relation awareness. IEEE Transactions on Knowledge and Data Engineering. https://doi.org/10.1109/TKDE.2022.3160208

    Article  Google Scholar 

  • Zhang, H., Qiu, L., Yi, L., & Song, Y. (2018a). Scalable multiplex network embedding. In IJCAI (Vol. 18, pp. 3082–3088).

  • Zhang, J., Shi, X., **e, J., Ma, H., King, I., & Yeung, D. Y. (2018b). Gaan: Gated attention networks for learning on large and spatiotemporal graphs. Preprint retrieved from https://arxiv.org/abs/1803.07294

  • Zhang, C., Wu, X., Yan, W., Wang, L., & Zhang, L. (2019). Attribute-aware graph recurrent networks for scholarly friend recommendation based on internet of scholars in scholarly big data. IEEE Transactions on Industrial Informatics, 16(4), 2707–2715.

    Article  Google Scholar 

  • Zhao, D., & Qin, H. (2023). Collaborator recommendation based on multiple information graphs. In 2023 IEEE 6th information technology, networking, electronic and automation control conference (ITNEC) (Vol. 6, pp. 1125–1128). IEEE.

  • Zhou, X., Liang, W., Kevin, I., Wang, K., Huang, R., & **, Q. (2018). Academic influence aware and multidimensional network analysis for research collaboration navigation based on scholarly big data. IEEE Transactions on Emerging Topics in Computing, 9(1), 246–257.

    Article  Google Scholar 

Download references

Funding

Funding was provided Natural Science Foundation of Shannxi Province, China (Grant No. 2023-JC-YB-625).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hui Li.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, H., Hu, Y. Metapath and attribute-based academic collaborator recommendation in heterogeneous academic networks. Scientometrics (2024). https://doi.org/10.1007/s11192-024-05043-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s11192-024-05043-x

Keywords

Navigation