Model-Based Clustering Methods for Time Series

Bock, Hans-Hermann

doi:10.1007/978-3-319-01264-3_1

Hans-Hermann Bock²²

Part of the book series: Studies in Classification, Data Analysis, and Knowledge Organization ((STUDIES CLASS))

962 Accesses
1 Citations

Abstract

This paper considers the problem of clustering n observed time series \(\mathbf{x}_{k} =\{\ x_{k}(t)\ \vert \ t \in \mathcal{T}\}\), k = 1, …, n, with time points t in a suitable time range \(\mathcal{T}\), into a suitable number m of clusters \(C_{1},\ldots,C_{m} \subset \{ 1,\ldots,n\}\) each one comprising time series with a ‘similar’ structure. Classical approaches might typically proceed by first computing a dissimilarity matrix and then applying a traditional, possibly hierarchical clustering method. In contrast, here we will present a brief survey about various approaches that start by defining probabilistic clustering models for the time series, i.e., with class-specific distribution models, and then determine a suitable (hopefully optimum) clustering by statistical tools like maximum likelihood and optimization algorithms. In particular, we will consider models with class-specific Gaussian processes and Markov chains.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

TSX-Means: An Optimal K Search Approach for Time Series Clustering

Nonlinear Time Series Clustering Based on Kolmogorov-Smirnov 2D Statistic

Article 09 October 2018

Multiscale and Multivariate Time Series Clustering: A New Approach

References

Akaike H (1974) A new look at the statistical model identification. IEEE Trans Autom Control 19(6):716–723
Article MathSciNet MATH Google Scholar
Banfield JD, Raftery AE (1993) Model-based Gaussian and non-Gaussian clustering. Biometrics 49(3):803–821
Article MathSciNet MATH Google Scholar
Biernacki C, Govaert G (1997) Using the classification likelihood to choose the number of clusters. Comput Sci Stat 29(2):451–457
Google Scholar
Biernacki C, Celeux G, Govaert G (2000) Assessing a mixture model for clustering with the integrated completed likelihood. IEEE Trans Pattern Anal Mach Intell 22(7):719–725
Article Google Scholar
Bouveyron C, Jacques J (2011) Model-based clustering of time series in group-specific functional subspaces. Adv Data Anal Classif 5:281–300
Article MathSciNet MATH Google Scholar
Chiou JM, Li PL (2007) Functional clustering and identifying substructures of longitudinal data. J R Stat Soc B (Stat Methodol) 69(4):679–699
Article MathSciNet Google Scholar
Chouakria AD, Nagabhushan PN (2007) Adaptive dissimilarity index for measuring time series proximity. Adv Data Anal Classif 1:5–21
Article MathSciNet MATH Google Scholar
Claeskens G, Hjort NL (2003) “The focused information criterion” (with discussion). J Am Stat Assoc 98:879–899
Article MathSciNet MATH Google Scholar
Claeskens G, Hjort NL (2008) Model selection and model averaging. Cambridge University Press, Cambridge/New York
Book MATH Google Scholar
De la Cruz-Mesía R, Quintana FA, Marshall G (2008) Model-based clustering for longitudinal data. Comput Stat Data Anal 52(3):1441–1457
Article MATH Google Scholar
Delaigle A, Hall P (2010) Defining probability density for a distribution of random functions. Ann Stat 38:1171–1193
Article MathSciNet MATH Google Scholar
Delaigle A, Hall P, Bathia N (2012) Componentwise classification and clustering of functional data. Biometrika 99(2):299–313
Article MathSciNet MATH Google Scholar
Ferraty F, Vieu P (2010) Nonparametric functional data analysis: theory and practice. Springer, New York
Google Scholar
Ferrazzi F, Magni P, Bellazzi R (2005) Random walk models for Bayesian clustering of gene expression profiles. Appl Bioinf 4:263–276
Article Google Scholar
Frühwirth-Schnatter S (2006) Finite mixture and Markov switching models. Springer series in statistics. Springer, New York
MATH Google Scholar
Frühwirth-Schnatter S (2011) Panel data analysis: a survey on model-based clustering of time series. Adv Data Anal Classif 5(4):251–280
Article MathSciNet MATH Google Scholar
Horenko I (2010) Finite element approach to clustering of multidimensional time series. SIAM J Sci Comput 32(1):62–83
Article MathSciNet MATH Google Scholar
Jacques J, Preda C (2012) Functional data clustering using density approximation. In: Journées de Statistique de la SFdS, Université Libre de Bruxelles, pp 21–25
Google Scholar
Kalpakis K, Gada D, Puttagunta V (2001) Distance measures for effective clustering of ARIMA time-series. In: Proceedings IEEE international conference on data mining, San Jose, pp 273–280
Google Scholar
Liao TW (2005) Clustering of time series data – a survey. Pattern Recognit 38(11):1857–1874
Article MATH Google Scholar
McNicholas PD, Murphy TB (2010) Model-based clustering of longitudinal data. Can J Stat 38(1):153–168
MathSciNet MATH Google Scholar
Pamminger C, Frühwirth-Schnatter S (2010) Model-based clustering of time series. Bayesian Anal 5:345–368
Article MathSciNet Google Scholar
Peng J, Müller HG (2008) Distance-based clustering of sparsely observed stochastic processes, with applications to online auctions. Ann Appl Stat 2:1056–1077
Article MathSciNet MATH Google Scholar
Ramsay J, Silverman BW (2005) Functional data analysis, 2nd edn. Springer series in statistics, Springer, New York
Google Scholar
Samé A, Chamroukhi F, Govaert G, Aknin P (2011) Model-based clustering and segmentation of time series with changes in regime. Adv Data Anal Classif 5(4):301–321
Article MathSciNet MATH Google Scholar
Schwarz G (1978) Estimating the dimension of a model. Ann Stat 6(2):461–464
Article MATH Google Scholar
Sebastiani P, Ramoni M, Cohen P, Warwick J, Davis J (1999) Discovering dynamics using Bayesian clustering. In: Hand D, Kok J, Berthold M (eds) Advances in intelligent data analysis. Lecture notes in computer science, vol 1642. Springer, Berlin, pp 199–209
Chapter Google Scholar
Song X, Jermaine C, Ranka S, Gums J (2008) A Bayesian mixture model with linear regression mixing proportions. In: Proceedings of the 14th ACM SIGKDD international conference on knowledge discovery and data mining, KDD’08, Las Vegas. ACM, New York, pp 659–667
Google Scholar
Spiegelhalter DJ, Best NG, Carlin BP, van der Linde A (2002) Bayesian measures of model complexity and fit. J R Stat Soc 64(4):583–639
Article MathSciNet MATH Google Scholar
Vilar JA, Pértega S (2004) Discriminant and cluster analysis for Gaussian stationary processes: local linear fitting approach. J Nonparametr Stat 16:443–462
Article MathSciNet MATH Google Scholar
Wasserman L (2000) Bayesian model selection and model averaging. J Math Psychol 44(1):92–107
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Statistics, RWTH Aachen University, Aachen, Germany
Hans-Hermann Bock

Authors

Hans-Hermann Bock
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hans-Hermann Bock .

Editor information

Editors and Affiliations

Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany
Wolfgang Gaul
Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany
Andreas Geyer-Schulz
The Institute of Statistical Mathematics, Tokyo, Japan
Yasumasa Baba
Graduate School of Management and Information Systems, Tama University, Tokyo, Japan
Akinori Okada

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bock, HH. (2014). Model-Based Clustering Methods for Time Series. In: Gaul, W., Geyer-Schulz, A., Baba, Y., Okada, A. (eds) German-Japanese Interchange of Data Analysis Results. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Cham. https://doi.org/10.1007/978-3-319-01264-3_1

Download citation

DOI: https://doi.org/10.1007/978-3-319-01264-3_1
Published: 10 October 2013
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-01263-6
Online ISBN: 978-3-319-01264-3
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics

Model-Based Clustering Methods for Time Series

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

TSX-Means: An Optimal K Search Approach for Time Series Clustering

Nonlinear Time Series Clustering Based on Kolmogorov-Smirnov 2D Statistic

Multiscale and Multivariate Time Series Clustering: A New Approach

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Model-Based Clustering Methods for Time Series

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

TSX-Means: An Optimal K Search Approach for Time Series Clustering

Nonlinear Time Series Clustering Based on Kolmogorov-Smirnov 2D Statistic

Multiscale and Multivariate Time Series Clustering: A New Approach

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation