Abstract
Temporal point process data has been widely observed in many applications including finance, health, and infrastructures, so that it has become an important topic in data analytics domain. Generally, a point process only records occurrence of a type of event as 1 or 0. To interpret the temporal point process, it is important to estimate the intensity of the occurrence of events, which is challenging especially when the intensity is dynamic over time, for example non-homogeneous Poisson process (NHPP) which is exactly what we will analyse in this paper. We performed a joint task to determine which two NHPP sequences are in the same group and to estimate the intensity resides in that group. Distance dependent Chinese Restaurant Process (ddCRP) provides a prior to cluster data points within a Bayesian nonparametric framework, alleviating the required knowledge to set the number of clusters which is sensitive in clustering problems. However, the distance in previous studies of ddCRP is designed for data points, in this paper such distance is measured by dynamic time war** (DTW) due to its wide application in ordinary time series (e.g. observed values are in \(\mathcal {R}\)). The empirical study using synthetic and real-world datasets shows promising outcome compared with the alternative techniques.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Adams, R.P., Murray, I., MacKay, D.J.: Tractable nonparametric Bayesian inference in poisson processes with gaussian process intensities. In: Proceedings of the 26th Annual International Conference on Machine Learning, pp. 9–16. ACM (2009)
Aghabozorgi, S., Shirkhorshidi, A.S., Wah, T.Y.: Time-series clustering-a decade review. Inform. Syst. 53, 16–38 (2015)
Blei, D.M., Frazier, P.I.: Distance dependent chinese restaurant processes. J. Mach. Learn. Res. 12(Aug), 2461–2488 (2011)
Blei, D.M., Griffiths, T.L., Jordan, M.I.: The nested chinese restaurant process and bayesian nonparametric inference of topic hierarchies. J. ACM (JACM) 57(2), 7 (2010)
Cuturi, M., Blondel, M.: Soft-DTW: a differentiable loss function for time-series. In: Proceedings of the 34th International Conference on Machine Learning, vol. 70, pp. 894–903. JMLR.org (2017)
Geler, Z., Kurbalija, V., Radovanović, M., Ivanović, M.: Impact of the Sakoe-Chiba band on the DTW time series distance measure for kNN classification. In: Buchmann, R., Kifor, C.V., Yu, J. (eds.) KSEM 2014. LNCS (LNAI), vol. 8793, pp. 105–114. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-12096-6_10
Griffiths, T.L., Jordan, M.I., Tenenbaum, J.B., Blei, D.M.: Hierarchical topic models and the nested Chinese restaurant process. In: Advances in Neural Information Processing Systems, pp. 17–24 (2004)
Han, J., Pei, J., Kamber, M.: Data Mining: Concepts and Techniques. Elsevier, Amsterdam (2011)
Hardiman, S.J., Bercot, N., Bouchaud, J.P.: Critical reflexivity in financial markets: a hawkes process analysis. Eur. Phys. J. B 86(10), 442 (2013)
Hrycej, T.: Gibbs sampling in bayesian networks. Artif. Intell. 46(3), 351–363 (1990)
Kottas, A.: Dirichlet process mixtures of beta distributions, with applications to density and intensity estimation. In: Workshop on Learning with Nonparametric Bayesian Methods, 23rd International Conference on Machine Learning (ICML) (2006)
Liao, T.W.: Clustering of time series data–a survey. Pattern Recogn. 38(11), 1857–1874 (2005)
Mueen, A., Keogh, E.: Extracting optimal performance from dynamic time war**. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 2129–2130. ACM (2016)
Niekum, S.: A brief introduction to Bayesian nonparametric methods for clustering and time series analysis. Technical report CMU-RI-TR-15-02, Robotics Institute, Carnegie Mellon University (2015)
Petitjean, F., Forestier, G., Webb, G.I., Nicholson, A.E., Chen, Y., Keogh, E.: Faster and more accurate classification of time series by exploiting a novel dynamic time war** averaging algorithm. Knowl. Inf. Syst. 47(1), 1–26 (2016)
Rakthanmanon, T., et al.: Searching and mining trillions of time series subsequences under dynamic time war**. In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 262–270. ACM (2012)
Robert, C., Casella, G.: Monte Carlo Statistical Methods. Springer, Heidelberg (2013)
Wolff, R.W.: Poisson arrivals see time averages. Oper. Res. 30(2), 223–231 (1982)
Zhang, B., Zhang, L., Guo, T., Wang, Y., Chen, F.: Simultaneous urban region function discovery and popularity estimation via an infinite urbanization process model. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 2692–2700. ACM (2018)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Chang, Y. et al. (2019). Exploring Latent Structure Similarity for Bayesian Nonparameteric Model with Mixture of NHPP Sequence. In: Gedeon, T., Wong, K., Lee, M. (eds) Neural Information Processing. ICONIP 2019. Lecture Notes in Computer Science(), vol 11954. Springer, Cham. https://doi.org/10.1007/978-3-030-36711-4_36
Download citation
DOI: https://doi.org/10.1007/978-3-030-36711-4_36
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-36710-7
Online ISBN: 978-3-030-36711-4
eBook Packages: Computer ScienceComputer Science (R0)