Abstract
Bipartite graphs are naturally used to model relationships between two types of entities, such as people-location, user-post, and investor-stock. When modeling real-world applications like disease outbreaks, edges are often enriched with temporal information, leading to temporal bipartite graphs. While reachability has been extensively studied on (temporal) unipartite graphs, it remains largely unexplored on temporal bipartite graphs. To fill this research gap, we study the reachability problem on temporal bipartite graphs in this paper. Specifically, a vertex u reaches a vertex w in a temporal bipartite graph G if u and w are connected through a series of consecutive wedges with time constraints. To efficiently answer if a vertex can reach the other vertex, we propose an index-based method by adapting the idea of 2-hop labeling. Effective optimization strategies and parallelization techniques are devised to accelerate the index construction process. To better support real-life scenarios, we further show how the index is leveraged to efficiently answer other types of queries, e.g., single-source reachability and earliest-arrival path queries. In addition, we propose an efficient method to handle incremental maintenance of the index structure. Extensive experiments on 16 real-world graphs demonstrate the effectiveness and efficiency of our proposed techniques.
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00778-024-00854-z/MediaObjects/778_2024_854_Fig1_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00778-024-00854-z/MediaObjects/778_2024_854_Fig2_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00778-024-00854-z/MediaObjects/778_2024_854_Fig3_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00778-024-00854-z/MediaObjects/778_2024_854_Fig4_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00778-024-00854-z/MediaObjects/778_2024_854_Fig5_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00778-024-00854-z/MediaObjects/778_2024_854_Fig6_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00778-024-00854-z/MediaObjects/778_2024_854_Fig7_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00778-024-00854-z/MediaObjects/778_2024_854_Fig8_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00778-024-00854-z/MediaObjects/778_2024_854_Fig9_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00778-024-00854-z/MediaObjects/778_2024_854_Fig10_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00778-024-00854-z/MediaObjects/778_2024_854_Fig11_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00778-024-00854-z/MediaObjects/778_2024_854_Fig12_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00778-024-00854-z/MediaObjects/778_2024_854_Fig13_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00778-024-00854-z/MediaObjects/778_2024_854_Fig14_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00778-024-00854-z/MediaObjects/778_2024_854_Fig15_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00778-024-00854-z/MediaObjects/778_2024_854_Fig16_HTML.png)
Similar content being viewed by others
Notes
There are also some other vertex ordering schemes, e.g., betweenness-based scheme and significant path-based scheme. Please see [32] for more details.
References
Coronavirus Research Center: https://coronavirus.jhu.edu/
KONECT: http://konect.cc/
https://covid.cdc.gov/covid-data-tracker/#trends_dailytrendscases
Wolfram Mathworld: https://mathworld.wolfram.com/RandomNumber.html
Abraham, I., Delling, D., Goldberg, A.V., Werneck, R.F.F.: Hierarchical hub labelings for shortest paths. In European Symposium on Algorithms, pages 24–35, (2012)
Ahmed, H., Zhang, Y., Zafar, M.S., Sheikh, N., Tai, Z.: Node embedding over attributed bipartite graphs. In International Conference on Knowledge Science, Engineering and Management, pages 202–210. Springer, (2020)
Akiba, T., Iwata, Y., Yoshida, Y.: Fast exact shortest-path distance queries on large networks by pruned landmark labeling. In Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, pages 349–360, (2013)
Barabasi, A.-L.: The origin of bursts and heavy tails in human dynamics. Nature 435(7039), 207–211 (2005)
Beamer, S., Asanovic, K., Patterson, D.: Direction-optimizing breadth-first search. In SC’12: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, pages 1–10. IEEE, (2012)
Bramandia, R., Choi, B., Ng, W.K.: Incremental maintenance of 2-hop labeling of large graphs. IEEE Trans. Knowl. Data Eng. 22(5), 682–698 (2010)
Bui-Xuan, B., Ferreira, A., Jarry, A.: Computing shortest, fastest, and foremost journeys in dynamic networks. Int. J. Found. Comput. Sci. 14(2), 267–285 (2003)
Casteigts, A., Himmel, A., Molter, H., Zschoche, P.: The computational complexity of finding temporal paths under waiting time constraints. CoRR, abs/1909.06437, (2019)
Chen, L., Gupta, A., Kurul, M.E.: Stack-based algorithms for pattern matching on dags. In Proceedings of the VLDB Endowment, pages 493–504, (2005)
Chen, X., Wang, K., Lin, X., Zhang, W., Qin, L., Zhang, Y.: Efficiently answering reachability and path queries on temporal bipartite graphs. Proc. VLDB Endow. 14(10), 1845–1858 (2021)
Chen, Y., Chen, Y.: Decomposing dags into spanning trees: A new way to compress transitive closures. In 2011 IEEE 27th International Conference on Data Engineering, pages 1007–1018, (2011)
Cohen, E., Halperin, E., Kaplan, H., Zwick, U.: Reachability and distance queries via 2-hop labels. SIAM J. Comput. (2002). https://doi.org/10.1137/S0097539702403098
Eubank, S., Guclu, H., Kumar, V.A., Marathe, M.V., Srinivasan, A., Toroczkai, Z., Wang, N.: Modelling disease outbreaks in realistic urban social networks. Nature 429(6988), 180–184 (2004)
Huang, S., Fu, A.W.-C., Liu, R.: Minimum spanning trees in temporal graphs. In Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, SIGMOD ’15, page 419–430, New York, NY, USA, (2015). Association for Computing Machinery
Jagadish, H.V.: A compression technique to materialize transitive closure. ACM Trans. Database Syst. (TODS) 15(4), 558–598 (1990)
Jiang, M., Fu, A.W.-C., Wong, R.C.-W., Xu, Y.: Hop doubling label indexing for point-to-point distance querying on scale-free networks. Proceedings of the VLDB Endowment 7(12), 1203–1214 (2014)
Jiang, Z.-Q., Zhou, W.-X.: Complex stock trading network among investors. Phys. A 389(21), 4929–4941 (2010)
**, R., **ang, Y., Ruan, N., Fuhry, D.: 3-hop: a high-compression indexing scheme for reachability query. In Proceedings of the 2009 ACM SIGMOD International Conference on Management of data, pages 813–826, (2009)
Kasukawa, T., Sugimoto, M., Hida, A., Minami, Y., Mori, M., Honma, S., Honma, K.-I., Mishima, K., Soga, T., Ueda, H.R.: Human blood metabolite timetable indicates internal body time. Proc. Natl. Acad. Sci. 109(37), 15036–15041 (2012)
Kempe, D., Kleinberg, J. M., Kumar, A.: Connectivity and inference problems for temporal networks. In Proceedings of the Thirty-Second Annual ACM Symposium on Theory of Computing, pages 504–513, (2000)
Kleinberg, J.: Bursty and hierarchical structure in streams. Data Min. Knowl. Disc. 7(4), 373–397 (2003)
Kleinberg, J.: Cascading behavior in networks: algorithmic and economic issues. Algorithmic game theory 24, 613–632 (2007)
Ley, M.: The dblp computer science bibliography: Evolution, research issues, perspectives. In International symposium on string processing and information retrieval, pages 1–10. Springer, (2002)
Li, W., Qiao, M., Qin, L., Zhang, Y., Chang, L., Lin, X.: Scaling distance labeling on small-world networks. In Proceedings of the 2019 International Conference on Management of Data, pages 1060–1077, (2019)
Li, Y., Fang, J., Zeng, Y., Maag, B., Tong, Y., Zhang, L.: Two-sided online bipartite matching in spatial data: experiments and analysis. GeoInformatica 24(1), 175–198 (2020)
Li, Y., Lou, Z., Shi, Y., Han, J.: Temporal motifs in heterogeneous information networks. In MLG Workshop@ KDD, (2018)
Li, Y., U, L.H., Yiu, M.L., Kou, N.M.: An experimental study on hub labeling based shortest path algorithms. Proc.VLDB Endow. 11(4), 445–457 (2017)
Malik, H.A.M., Mahesar, A.W., Abid, F., Waqas, A., Wahiddin, M.R.: Two-mode network modeling and analysis of dengue epidemic behavior in gombak, malaysia. Appl. Math. Model. 43, 207–220 (2017)
O’Connor, C.M., Adams, J.U., Fairman, J.: Essentials of cell biology. Cambridge, MA: NPG Education 1, 54 (2010)
Pavlopoulos, G.A., Kontou, P.I., Pavlopoulou, A., Bouyioukos, C., Markou, E., Bagos, P.G.: Bipartite graphs in systems biology and medicine: a survey of methods and applications. GigaScience 7(4), giy014 (2018)
Pikies, T., Kubale, M.: Chromatic cost coloring of weighted bipartite graphs. Appl. Math. Comput. 375, 125073 (2020)
Sariyüce, A. E., Pinar, A.: Peeling bipartite networks for dense subgraph discovery. In Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining, pages 504–512, (2018)
Semertzidis, K., Pitoura, E., Lillis, K.: Timereach: Historical reachability queries on evolving graphs. In Proceedings of the 18th International Conference on Extending Database Technology, EDBT, pages 121–132, (2015)
Seufert, S., Anand, A., Bedathur, S. J, Weikum, G.: FERRARI: flexible and efficient reachability range assignment for graph indexing. In 2013 IEEE 29th International Conference on Data Engineering (ICDE), pages 1009–1020, (2013)
Simon, K.: An improved algorithm for transitive closure on acyclic digraphs. Theoret. Comput. Sci. 58, 325–346 (1988)
Smart, A.G., Amaral, L.A., Ottino, J.M.: Cascading failure and robustness in metabolic networks. Proc. Natl. Acad. Sci. 105(36), 13223–13228 (2008)
Sun, X.Q., Shen, H.W., Cheng, X.Q.: Trading network predicts stock price. Sci. Rep. 4(1), 3711–3711 (2014)
Tong, Y., Zeng, Y., Ding, B., Wang, L., Chen, L.: Two-sided online micro-task assignment in spatial crowdsourcing. IEEE Trans. Knowl. Data Eng. 33(5), 2295–2309 (2019)
Trißl, S., Leser, U.: Fast and practical indexing and querying of very large graphs. In Proceedings of the 2007 ACM SIGMOD international conference on Management of data, pages 845–856, (2007)
Valstar, L. D., Fletcher, G. H., Yoshida, Y.: Landmark indexing for evaluation of label-constrained reachability queries. In Proceedings of the 2017 ACM International Conference on Management of Data, pages 345–358, (2017)
van Schaik, S. J., de Moor, O.: A memory efficient reachability data structure through bit vector compression. In Proceedings of the 2011 ACM SIGMOD International Conference on Management of data, pages 913–924, (2011)
Wang, H., He, H., Yang, J., Yu, P.S., Yu, J.X.: Dual labeling: Answering graph reachability queries in constant time. In 2006 IEEE 22nd International Conference on Data Engineering (ICDE), page 75, (2006)
Wang, J., de Vries, A.P., Reinders, M.J.T.: Unifying user-based and item-based collaborative filtering approaches by similarity fusion. In Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, pages 501–508, (2006)
Wang, S. Lin, W. Yang, Y., **ao, X., Zhou, S.: Efficient route planning on public transportation networks: A labelling approach. In Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, pages 967–982, (2015)
Wei, H., Yu, J.X., Lu, C., **, R.: Reachability querying: an independent permutation labeling approach. Proceedings of the VLDB Endowment 7(12), 1191–1202 (2014)
Wei, H., Yu, J.X., Lu, C., **, R.: Reachability querying: an independent permutation labeling approach. VLDB J. 27(1), 1–26 (2018)
Wen, D., Huang, Y., Zhang, Y., Qin, L., Zhang, W., Lin, X.: Efficiently answering span-reachability queries in large temporal graphs. In 2020 IEEE 36th International Conference on Data Engineering (ICDE), pages 1153–1164, (2020)
Wen, D., Yang, B., Zhang, Y., Qin, L., Cheng, D., Zhang, W.: Span-reachability querying in large temporal graphs. VLDB J. 31(4), 629–647 (2022)
Wu, H., Cheng, J., Lu, Y., Ke, Y., Huang, Y., Yan, D., Wu, H.: Core decomposition in large temporal graphs. In 2015 IEEE International Conference on Big Data (Big Data), pages 649–658, (2015)
Wu, H., Huang, Y., Cheng, J., Li, J., Ke, Y.: Reachability and time-based path queries in temporal graphs. In 2016 IEEE 32nd International Conference on Data Engineering (ICDE), pages 145–156, (2016)
Wu, H., Zhao, Y., Cheng, J., Yan, D.: Efficient processing of growing temporal graphs. In: Candan, S., Chen, L., Pedersen, T.B., Chang, L., Hua, W. (eds.) Database Syst. Adv. Appl., pp. 387–403. Springer, Cham (2017)
Wu, T., Yu, S., Liao, W., Chang, C.: Temporal bipartite projection and link prediction for online social networks. In 2014 IEEE International Conference on Big Data (Big Data), pages 52–59, (2014)
**e, N., Zhou, W., Shen, C., Li, T., Chen, S., Wei, J.: City disaster susceptibility comparisons using weighted bipartite graphs. International Journal of Next-Generation Computing, 9(1), (2018)
Yan, H., Jiang, Y., Liu, G.: Telecomm fraud detection via attributed bipartite network. In 2018 15th International Conference on Service Systems and Service Management (ICSSSM), pages 1–6. IEEE, (2018)
Yildirim, H., Chaoji, V., Zaki, M.J.: GRAIL: scalable reachability index for large graphs. Proc. VLDB Endow. 3(1), 276–284 (2010)
Yildirim, H., Chaoji, V., Zaki, M.J.: DAGGER: A scalable index for reachability queries in large dynamic graphs. CoRR, abs/1301.0977, (2013)
Yu, J.X., Cheng, J.: Graph reachability queries: A survey. In Managing and Mining Graph Data, pages 181–215. (2010)
Zhang, T., Gao, Y., Chen, L., Guo, W., Pu, S., Zheng, B., Jensen, C.S.: Efficient distributed reachability querying of massive temporal graphs. VLDB J. 28(6), 871–896 (2019)
Zignani, M.: Human mobility model based on time-varying bipartite graph. In 2011 IEEE International Symposium on a World of Wireless, Mobile and Multimedia Networks, pages 1–4, (2011)
Acknowledgements
Kai Wang is supported by NSFC 62302294, Wenjie Zhang is supported by ARC DP230101445 and ARC FT210100303.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Wang, K., Cai, M., Chen, X. et al. Efficient algorithms for reachability and path queries on temporal bipartite graphs. The VLDB Journal (2024). https://doi.org/10.1007/s00778-024-00854-z
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00778-024-00854-z