Abstract
Ambiguity, that is the lack of information to produce a specific classification, is an important issue in decision–making and supervised classification. In case of ambiguity, human–decision makers can resort to abstaining from making precise classifications (especially when error-related costs are high), but this behaviour has been scarcely addressed, and applied, in machine learning algorithms. This contribution grounds on previous works in the areas of three–way decisions, cautious classification and orthopairs, and proposes a set of techniques we developed to address this form of ambiguity, by providing both a general–purpose technique to create three–way algorithms from probabilistic ones, and also more specific techniques which could be applied to popular machine learning frameworks. We also evaluate the proposed idea, by performing a set of experiments where we compare classical classification algorithms with the corresponding three–way generalizations, in order to study the trade–off between classification accuracy and abstention: the results are promising.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Bartlett, P.L., Wegkamp, M.H.: Classification with a reject option using a hinge loss. J. Mach. Learn. Res. 9, 1823–1840 (2008)
Bello, R., Falcon, R.: Rough sets in machine learning: a review. In: Wang, G., Skowron, A., Yao, Y., Ślęzak, D., Polkowski, L. (eds.) Thriving Rough Sets, pp. 87–118. Springer International Publishing, Cham (2017)
Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)
Cabitza, F., Ciucci, D., Rasoini, R.: A giant with feet of clay: on the validity of the data that feed machine learning in medicine. In: Cabitza, F., Batini, C., Magni, M. (eds.) Organizing for the Digital World. LNISO, vol. 28, pp. 121–136. Springer, Cham (2019). https://doi.org/10.1007/978-3-319-90503-7_10
Campagner, A., Cabitza, F., Ciucci, D.: Exploring medical data classification with three-way decision tree. In: Proceedings of the 12th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2019) - Volume 5: HEALTHINF. pp. 147–158. SCITEPRESS (2019)
Campagner, A., Ciucci, D.: Three-way and semi-supervised decision tree learning based on orthopartitions. In: Medina, J., Ojeda-Aciego, M., Verdegay, J.L., Pelta, D.A., Cabrera, I.P., Bouchon-Meunier, B., Yager, R.R. (eds.) IPMU 2018. CCIS, vol. 854, pp. 748–759. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-91476-3_61
Campagner, A., Ciucci, D.: Orthopartitions and soft clustering. Knowl. Based Syst. (Submitted)
Chow, C.: On optimum recognition error and reject tradeoff. IEEE Trans. Inform. Theory 16, 41–46 (1970)
Ciucci, D.: Orthopairs: a simple and widely used way to model uncertainty. Fundamenta Informaticae 108, 287–304 (2011)
Ciucci, D.: Orthopairs and granular computing. Granular Comput. 1, 159–170 (2016)
Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)
Daniel, W.W.: Applied Nonparametric Statistics. Duxbury Thomson Learning (1990)
Deo, R.: Machine learning in medicine. Circulation 132 (2015)
Ellerman, D.: An introduction to logical entropy and its relation to Shannon entropy. Int. J. Semant. Comput. 7(2), 121–145 (2013)
Feldman, K., Faust, L., Wu, X., Huang, C., Chawla, N.V.: Beyond volume: the impact of complex healthcare data on the machine learning pipeline. CoRR abs/1706.01513 (2017)
Ferri, C., Hernández-Orallo, J.: Cautious classifiers. In: ROC Analysis in Artificial Intelligence, 1st International Workshop, ROCAI-2004, pp. 27–36 (2004)
Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. MIT Press, Cambridge (2016)
Hajian, S., Bonchi, F., Castillo, C.: Algorithmic bias: from discrimination discovery to fairness-aware data mining. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 2125–2126, August 2016
Han, P.K., Klein, W.M., Arora, N.K.: Varieties of uncertainty in health care: a conceptual taxonomy. Med. Decis. Making 31(6), 828–838 (2011)
Hechtlinger, Y., Póczos, B., Wasserman, L.A.: Cautious deep learning. ar**v/CoRR abs/1805.09460 (2018)
Hüllermeier, E.: Fuzzy sets in machine learning and data mining. Appl. Soft Comput. 11(2), 1493–1505 (2011)
Hüllermeier, E.: Does machine learning need fuzzy logic? Fuzzy Sets Syst. 281, 292–299 (2015). Special Issue Celebrating the 50th Anniversary of Fuzzy Sets
Koller, D., Friedman, N.: Probabilistic Graphical Models: Principles and Techniques - Adaptive Computation and Machine Learning. The MIT Press, Cambridge (2009)
Kooi, T., et al.: Large scale deep learning for computer aided detection of mammographic lesions. Med. Image Anal. 35, 303–312 (2017)
Li, J.D.: A two-step rejection procedure for testing multiple hypotheses. J. Stat. Plann. Infer. 138(6), 1521–1527 (2008)
Obermeyer, Z., Emanuel, E.J.: Predicting the future - big data, machine learning, and clinical medicine. N. Engl. J. Med. 375(13), 1216–1219 (2016)
Pawlak, Z.: Rough sets. Int. J. Comput. Inform. Sci. 11(5), 341–356 (1982)
Shafer, G.: A Mathematical Theory of Evidence. Princeton University Press, Princeton (1976)
Smets, P., Kennes, R.: The transferable belief model. Artif. Intell. 66(2), 191–234 (1994)
Svensson, C., Hübler, R., Figge, M.: Automated classification of circulating tumor cells and the impact of interobsever variability on classifier training and performance. J. Immunol. Res. 2015, 1–9 (2015)
Yao, Y.: An outline of a theory of three-way decisions. In: Yao, J.T., Yang, Y., Słowiński, R., Greco, S., Li, H., Mitra, S., Polkowski, L. (eds.) RSCTC 2012. LNCS (LNAI), vol. 7413, pp. 1–17. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-32115-3_1
Zadeh, L.: Fuzzy sets. Inf. Control 8(3), 338–353 (1965)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Campagner, A., Cabitza, F., Ciucci, D. (2019). Three–Way Classification: Ambiguity and Abstention in Machine Learning. In: Mihálydeák, T., et al. Rough Sets. IJCRS 2019. Lecture Notes in Computer Science(), vol 11499. Springer, Cham. https://doi.org/10.1007/978-3-030-22815-6_22
Download citation
DOI: https://doi.org/10.1007/978-3-030-22815-6_22
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-22814-9
Online ISBN: 978-3-030-22815-6
eBook Packages: Computer ScienceComputer Science (R0)