Clustering Financial Time Series by Dependency

Alonso, Andrés M.; Gamboa, Carolina; Peña, Daniel

doi:10.1007/978-3-031-30164-3_1

Andrés M. Alonso²²,
Carolina Gamboa²³ &
Daniel Peña²²

Part of the book series: Studies in Classification, Data Analysis, and Knowledge Organization ((STUDIES CLASS))

Included in the following conference series:

Scientific Meeting of the Classification and Data Analysis Group of the Italian Statistical Society

514 Accesses

Abstract

In this paper, we propose a procedure for clustering financial time series by dependency on their volatilities. Our procedure is based on the generalized cross correlation between the estimated volatilities of a time series. Monte Carlo experiments are carried out to analyze the improvements obtained by clustering using the squared residuals instead of the levels of the series. Our procedure was able to recover the original clustering structures in all cases in our Monte Carlo study. Finally, the methodology is applied to a set of financial data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 109.00; Price excludes VAT (USA)

Softcover Book: USD 139.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Alonso, A. M., & Peña, D. (2019). Clustering time series by linear dependency. Statistics and Computing, 29, 655–676. https://doi.org/10.1007/s11222-018-9830-6.
Alonso, A. M., D’Urso, P., Gamboa, C., & Guerrero, V. (2021). Cophenetic-based fuzzy clustering of time series by linear dependency. International Journal of Approximate Reasoning, 137, 114–136. https://doi.org/10.1016/j.ijar.2021.07.006.
Bollerslev, T. (1986). Generalized autoregressive conditional heteroskedasticity. Journal of Econometrics, 31, 307–327. https://doi.org/10.1016/0304-4076(86)90063-1.
Caiado, J., Crato, N., & Peña, D. (2006). A periodogram-based metric for time series classification. Computational Statistics & Data Analysis, 50, 2668–2684. https://doi.org/10.1016/j.csda.2005.04.012.
Caiado, J., Maharaj, E. A., & D’Urso, P. (2015). Time-series clustering. In: Handbook of cluster analysis (pp. 262–285). Chapman and Hall/CRC. https://doi.org/10.1201/9780429058264.
Díaz, S. P., & Vilar, J. A. (2010). Comparing several parametric and nonparametric approaches to time series clustering: A simulation study. Journal of Classification, 27, 333–362. https://doi.org/10.1007/s00357-010-9064-6.
D’Urso, P., Cappelli, C., Di Lallo, D., & Massari, R. (2013). Clustering of financial time series. Physica A: Statistical Mechanics and its Applications, 392, 2114–2129. https://doi.org/10.1016/j.physa.2013.01.027.
Engle, R. F. (1982). Autoregressive conditional heteroscedasticity with estimates of the variance of United Kingdom inflation. Econometrica, 50, 987–1007. https://doi.org/10.2307/1912773.
Galeano, P., & Peña, D. (2000). Multivariate analysis in vector time series. Resenhas do Instituto de Matemática e Estatística da Universidade de S ao Paulo, 4, 383–403.
MathSciNet MATH Google Scholar
Hubert, L., & Arabie, P. (1985). Comparing partitions. Journal of Classification, 2, 193–218. https://doi.org/10.1007/BF01908075.
Jeong, Y.-S., Jeong, M. K., & Omitaomu, O. A. (2011). Weighted dynamic time war** for time series classification. Pattern Recognition, 44, 2231–2240. https://doi.org/10.1016/j.patcog.2010.09.022.
Lafuente-Rego, B., & Vilar, J. A. (2016). Clustering of time series using quantile autocovariances. Advances in Data Analysis and Classification, 10, 391–415. https://doi.org/10.1007/s11634-015-0208-8.
La Rocca, M., & Vitale, V. (2021). Clustering time series by nonlinear dependence. In M. Corazza et al. (Eds.), Mathematical and Statistical Methods for Actuarial Sciences and Finance (pp. 291–297). https://doi.org/10.1007/978-3-030-78965-7_43.
Otranto, E. (2008). Clustering heteroskedastic time series by model-based procedures. Computational Statistics & Data Analysis, 52, 4685–4698. https://doi.org/10.1016/j.csda.2008.03.020.
Piccolo, D. (1990). A distance measure for classifying ARIMA models. Journal of Time Series Analysis, 11, 153–164. https://doi.org/10.1111/j.1467-9892.1990.tb00048.x.
Rousseeuw, P. J. (1987). Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. Journal of Computational and Applied Mathematics, 20, 53–65. https://doi.org/10.1016/0377-0427(87)90125-7.
Tsay, R. S. (2010). Analysis of financial time series. Wiley. https://doi.org/10.1002/9780470644560.
Tsay, R. S. (2014). Multivariate time series analysis. Wiley. https://doi.org/10.1002/9780470644560.ch8.
Zhang, B., & An, B. (2018). Clustering time series based on dependence structure. PloS One, 13, e0206753. https://doi.org/10.1371/journal.pone.0206753.
Zhou, Z. (2012). Measuring nonlinear dependence in time-series, a distance correlation approach. Journal of Time Series Analysis, 33, 438–457. https://doi.org/10.1111/j.1467-9892.2011.00780.x.

Download references

Acknowledgements

The authors gratefully acknowledge the financial support from the Spanish government Agencia Estatal de Investigación (PID2019-108311GB-I00/AEI/10.13039/501100011033) and Comunidad de Madrid.

Author information

Authors and Affiliations

Department of Statistics and Institute Flores de Lemus, Universidad Carlos III de Madrid, 28903, Getafe, Spain
Andrés M. Alonso & Daniel Peña
Department of Statistics, Universidad Carlos III de Madrid, 28903, Getafe, Spain
Carolina Gamboa

Authors

Andrés M. Alonso
View author publications
You can also search for this author in PubMed Google Scholar
Carolina Gamboa
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Peña
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Andrés M. Alonso .

Editor information

Editors and Affiliations

Department of Statistics, Computer Science, Applications 'G. Parenti', University of Florence, Florence, Italy
Leonardo Grilli
Department of Statistics, Computer Science, Applications 'G. Parenti', University of Florence, Florence, Italy
Monia Lupparelli
Department of Statistics, Computer Science, Applications 'G. Parenti', University of Florence, Florence, Italy
Carla Rampichini
Department of Statistics, Computer Science, Applications 'G. Parenti', University of Florence, Florence, Italy
Emilia Rocco
Department of Statistical Sciences, Sapienza University of Rome, Rome, Italy
Maurizio Vichi

Appendix

Correlation of Two Dependent Chi-square Variables

Let $\eta _{t,x}$ and $\eta _{t,y}$ be normal variables with mean 0, variance 1, and ${\text {cor}}(\eta _{t,x}, \eta _{t,y}) = \rho $. Suppose that $\eta _{t,y} = \rho \eta _{t,x} + a \eta _{t,z}$ with $\eta _{t,z}$ a standard normal variable independent of $\eta _{t,x}$ and $a^{2} = 1- \rho ^{2}$. Thus

$$\begin{aligned} \eta ^{2}_{t,x}\eta ^{2}_{t,y} = \eta _{t,x}^{2}(\rho ^{2} \eta ^{2}_{t,x} + 2\rho a \eta _{t,x}\eta _{t,z}+a^{2} \eta ^{2}_{t,z}) = \eta _{t,x}^{4}\rho ^{2} + 2 a \rho \eta ^{3}_{t,x}\eta _{t,z}+a^{2} \eta ^{2}_{t,z } \eta ^{2}_{t,x}.\end{aligned}$$

Using the moments of standard normal variables and the independence between $\eta _{t,x}$ and $\eta _{t,z}$, we have that $E(\eta ^{2}_{t,x}\eta ^{2}_{t,y}) = 3\rho ^{2}+ a^{2} = 2\rho ^{2}+1$. Thus, we conclude that

$$\begin{aligned} \text {cov}(\eta ^{2}_{t,x}, \eta ^{2}_{t,y}) = E(\eta ^{2}_{t,x}\eta ^{2}_{t,y}) - E(\eta ^{2}_{t,x})E(\eta ^{2}_{t,y}) = E(\eta ^{2}_{t,x}\eta ^{2}_{t,y}) -1 = 2\rho ^{2}. \end{aligned}$$

On the other hand, we know that $var(\eta _{t,x}^{2}) = var(\eta _{t,y}^{2}) = 2$, therefore,

$$\begin{aligned} \text {cor}(\eta ^{2}_{t,x}, \eta ^{2}_{t,y}) = \frac{2\rho ^{2}}{2} = \rho ^{2}. \end{aligned}$$

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Alonso, A.M., Gamboa, C., Peña, D. (2023). Clustering Financial Time Series by Dependency. In: Grilli, L., Lupparelli, M., Rampichini, C., Rocco, E., Vichi, M. (eds) Statistical Models and Methods for Data Science. CLADAG 2021. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Cham. https://doi.org/10.1007/978-3-031-30164-3_1

Download citation

DOI: https://doi.org/10.1007/978-3-031-30164-3_1
Published: 25 July 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-30163-6
Online ISBN: 978-3-031-30164-3
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics

Clustering Financial Time Series by Dependency

Abstract

Access this chapter

Subscribe and save

Buy Now

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Appendix

Appendix

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation