Abstract
For the category-imbalanced applications, traditional ensemble data stream mining algorithms will result in low accuracy for the small classes and fail to meet the needs of applications. This paper provides a novel class-imbalanced data learning method based on MAE named CIMAE to solve the above problem. Instead of directly using each incoming data, it acquires data blocks for online training each time by setting up a sample library and a sliding window. Compared with traditional data stream mining algorithms, the results showed that CIMAE achieves the state-the-of-art performance for class-imbalanced application.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Thamaraiselvi, G., Kaliammal, A.: Data mining: concepts and techniques. Ap Professional (2004)
Street, W.N.: A streaming ensemble algorithm (SEA) for large-scale classification. In: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 377–382 (2001)
Taillard, E.: Some efficient heuristic methods for the flow shop sequencing problem. Eur. J. Oper. Res. 47(1), 65–74 (1990)
Lam, L., Suen, C.Y.: Application of majority voting to pattern recognition: an analysis of its behavior and performance. IEEE Trans. Syst. Man Cybern. A Syst. Hum. 27(5), 553–568 (1997)
Wang, H., Fan, W., Yu, P.S., et al.: Mining concept-drifting data streams using ensemble classifiers. In: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 226–235. ACM (2003)
Jiang, Y., Zhao, Q., Lu, Y.: Adaptive ensemble with human memorizing characteristics for data stream mining. Math. Probl. Eng. 2015, 1–10 (2015)
Datar, M., Gionis, A., Indyk, P., et al.: Maintaining stream statistics over sliding windows. SIAM J. Comput. 31(6), 1794–1813 (2002)
Coon, D.: Introduction to psychology: exploration and application 28(2), 89–92 (1998)
Hu, W., Jiang, Y., Liu, G., et al.: DDC: distributed data collection framework for failure prediction in Tianhe supercomputers. In: International Workshop on Advanced Parallel Processing Technologies, pp. 18–32. Springer International Publishing (2015)
Goldberg, D.E.: Genetic algorithms in search, optimization and machine learning. xiii(7), 2104–2116 (2006)
MartÃnez-Muñoz, G., Hernández-Lobato, D., Suárez, A.: An analysis of ensemble pruning techniques based on ordered aggregation. IEEE Trans. Pattern Anal. Mach. Intell. 31(2), 245 (2009)
Acknowledgement
This work is sponsored by the National Key Research and Development Program of China (2017YFC0806502, 2017YFC0803700, 2017YFC0821600) and by the Shanghai Rising-Star Program (17QB1401000) and by the Application Innovation Plan of Ministry of Public Security (2017YYCXSXST030) and by the program of Science and Technology Commission of Shanghai municipality (Nos. 15530701300, 1759800900).
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Sun, Q., Wu, Y., Duan, H., Wang, J., Mei, L. (2019). An Ensemble Data Stream Mining Algorithm for Class-Imbalanced Applications. In: Abawajy, J., Choo, KK., Islam, R., Xu, Z., Atiquzzaman, M. (eds) International Conference on Applications and Techniques in Cyber Security and Intelligence ATCI 2018. ATCI 2018. Advances in Intelligent Systems and Computing, vol 842. Springer, Cham. https://doi.org/10.1007/978-3-319-98776-7_12
Download citation
DOI: https://doi.org/10.1007/978-3-319-98776-7_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-98775-0
Online ISBN: 978-3-319-98776-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)