Abstract
Most of the current data sources generate large amounts of data over time. Renewable energy generation is one example of such data sources. Machine learning is often applied to forecast time series. Since data flows are usually large, trends in data may change and learned patterns might not be optimal in the most recent data. In this paper, we analyse wind energy generation data extracted from the Sistema de Información del Operador del Sistema (ESIOS) of the Spanish power grid. We perform a study to evaluate detecting concept drifts to retrain models and thus improve the quality of forecasting. To this end, we compare the performance of a linear regression model when it is retrained randomly and when a concept drift is detected, respectively. Our experiments show that a concept drift approach improves forecasting between a 7.88% and a 33.97% depending on the concept drift technique applied.
Supported by the Spanish Ministry of Science and Innovation (PID2020-117954RB-C22) and the Andalusian Regional Government (US-1263341, P18-RT-2778).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Díaz Cordero, G.: El cambio climático. Ciencia y sociedad (2012)
Lizano, B.: Calentamiento Global: “la máxima expresión de la civilización petrofósil”. Revista del CESLA. https://www.redalyc.org/articulo.oa?id=243329724003
Rafique, M.M., Bahaidarah, H.M., Anwar, M.K.: Enabling private sector investment in off-grid electrification for cleaner production: optimum designing and achievable rate of unit electricity. J. Clean. Prod. 206, 508–523 (2019). https://doi.org/10.1016/j.jclepro.2018.09.123
Mills, A.D., Levin, T., Wiser, R., Seel, J., Botterud, A.: Impacts of variable renewable energy on wholesale markets and generating assets in the united states: a review of expectations and evidence. Renew. Sustain. Energy Rev. 120, 109670 (2020). https://doi.org/10.1016/j.rser.2019.109670
Jaworski, M.: Regression function and noise variance tracking methods for data streams with concept drift. Int. J. Appl. Math. Comput. Sci. 28(3), 559–567 (2018)
Barddal, J.P., Gomes, H.M., Enembreck, F.: Advances on concept drift detection in regression tasks using social networks theory. Int. J. Nat. Comput. Res. (IJNCR) 5(1), 26–41 (2015)
Zenisek, J., Holzinger, F., Affenzeller, M.: Machine learning based concept drift detection for predictive maintenance. Comput. Ind. Eng. 137, 106031 (2019). https://doi.org/10.1016/j.cie.2019.106031
Ikonomovska, E., Gama, J., Sebastião, R., Gjorgjevik, D.: Regression trees from data streams with drift detection. In: Gama, J., Costa, V.S., Jorge, A.M., Brazdil, P.B. (eds.) DS 2009. LNCS (LNAI), vol. 5808, pp. 121–135. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-04747-3_12
Baier, L., Kühl, N., Satzger, G., Hofmann, M., Mohr, M.: Handling concept drifts in regression problems – the error intersection approach. In: WI2020 Zentrale Tracks, pp. 210–224. GITO Verlag (2020). https://doi.org/10.30844/wi_2020_c1-baier
Ray, S.: A quick review of machine learning algorithms. In: 2019 International conference on machine learning, big data, pp. 35–39. cloud and parallel computing (COMITCon), IEEE (2019)
Api esios documentation. https://api.esios.ree.es/
Bifet, A., Gavalda, R.: Learning from time-changing data with adaptive windowing. In: Proceedings of the 2007 SIAM International Conference on Data Mining, pp. 443–448. SIAM (2007)
Raab, C., Heusinger, M., Schleif, F.-M.: Reactive soft prototype computing for concept drift streams. Neurocomputing 416, 340–351 (2020)
Lima, M., Filho, T.S., de A. Fagundes, R.A.: A comparative study on concept drift detectors for regression. In: Britto, A., Valdivia Delgado, K. (eds.) BRACIS 2021. LNCS (LNAI), vol. 13073, pp. 390–405. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-91702-9_26
Page, E.S.: Continuous inspection schemes. Biometrika 41(1/2), 100–115 (1954)
Montiel, J., Read, J., Bifet, A., Abdessalem, T.: Scikit-multiflow: a multi-output streaming framework. J. Mach. Learn. Res. 19(72), 1–5 (2018)
Buitinck, L. et al.: API design for machine learning software: experiences from the scikit-learn project. In: ECML PKDD Workshop: Languages for Data Mining and Machine Learning, pp. 108–122 (2013)
Bergmeir, C., Benítez, J.M.: On the use of cross-validation for time series predictor evaluation. Inf. Sci. 191, 192–213 (2012). https://doi.org/10.1016/j.ins.2011.12.028. Data Mining for Software Trustworthiness
Bergmeir, C., Costantini, M., Benítez, J.M.: On the usefulness of cross-validation for directional forecast evaluation, computational statistics & data analysis. cFEnetwork: Ann. Comput. Financ. Econometr. 76, 132–143 (2014). https://doi.org/10.1016/j.csda.2014.02.001
Shcherbakov, M.V., et al.: A survey of forecast error measures. World Appl. Sci. J. 24(24), 171–176 (2013)
Hewamalage, H., Montero-Manso, P., Bergmeir, C., Hyndman, R.J.: A look at the evaluation setup of the m5 forecasting competition. ar**v preprint ar**v:2108.03588 (2021)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this paper
Cite this paper
Cabello-López, T., Cañizares-Juan, M., Carranza-García, M., Garcia-Gutiérrez, J., Riquelme, J.C. (2022). Concept Drift Detection to Improve Time Series Forecasting of Wind Energy Generation. In: García Bringas, P., et al. Hybrid Artificial Intelligent Systems. HAIS 2022. Lecture Notes in Computer Science(), vol 13469. Springer, Cham. https://doi.org/10.1007/978-3-031-15471-3_12
Download citation
DOI: https://doi.org/10.1007/978-3-031-15471-3_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-15470-6
Online ISBN: 978-3-031-15471-3
eBook Packages: Computer ScienceComputer Science (R0)