Three–Way Classification: Ambiguity and Abstention in Machine Learning

Campagner, Andrea; Cabitza, Federico; Ciucci, Davide

doi:10.1007/978-3-030-22815-6_22

Andrea Campagner^21,23,
Federico Cabitza^21,22 &
Davide Ciucci ORCID: orcid.org/0000-0002-8083-7809²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11499))

Included in the following conference series:

International Joint Conference on Rough Sets

991 Accesses
1 Altmetric

Abstract

Ambiguity, that is the lack of information to produce a specific classification, is an important issue in decision–making and supervised classification. In case of ambiguity, human–decision makers can resort to abstaining from making precise classifications (especially when error-related costs are high), but this behaviour has been scarcely addressed, and applied, in machine learning algorithms. This contribution grounds on previous works in the areas of three–way decisions, cautious classification and orthopairs, and proposes a set of techniques we developed to address this form of ambiguity, by providing both a general–purpose technique to create three–way algorithms from probabilistic ones, and also more specific techniques which could be applied to popular machine learning frameworks. We also evaluate the proposed idea, by performing a set of experiments where we compare classical classification algorithms with the corresponding three–way generalizations, in order to study the trade–off between classification accuracy and abstention: the results are promising.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: EUR 29.95; Price includes VAT (Germany)

eBook: EUR 42.79; Price includes VAT (Germany)

Softcover Book: EUR 53.49; Price includes VAT (Germany)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Credal Decision Trees to Classify Noisy Data Sets

Towards a Logic-Based View of Some Approaches to Classification Tasks

Credal C4.5 with Refinement of Parameters

References

Bartlett, P.L., Wegkamp, M.H.: Classification with a reject option using a hinge loss. J. Mach. Learn. Res. 9, 1823–1840 (2008)
MathSciNet MATH Google Scholar
Bello, R., Falcon, R.: Rough sets in machine learning: a review. In: Wang, G., Skowron, A., Yao, Y., Ślęzak, D., Polkowski, L. (eds.) Thriving Rough Sets, pp. 87–118. Springer International Publishing, Cham (2017)
Chapter Google Scholar
Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)
Article Google Scholar
Cabitza, F., Ciucci, D., Rasoini, R.: A giant with feet of clay: on the validity of the data that feed machine learning in medicine. In: Cabitza, F., Batini, C., Magni, M. (eds.) Organizing for the Digital World. LNISO, vol. 28, pp. 121–136. Springer, Cham (2019). https://doi.org/10.1007/978-3-319-90503-7_10
Chapter Google Scholar
Campagner, A., Cabitza, F., Ciucci, D.: Exploring medical data classification with three-way decision tree. In: Proceedings of the 12th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2019) - Volume 5: HEALTHINF. pp. 147–158. SCITEPRESS (2019)
Google Scholar
Campagner, A., Ciucci, D.: Three-way and semi-supervised decision tree learning based on orthopartitions. In: Medina, J., Ojeda-Aciego, M., Verdegay, J.L., Pelta, D.A., Cabrera, I.P., Bouchon-Meunier, B., Yager, R.R. (eds.) IPMU 2018. CCIS, vol. 854, pp. 748–759. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-91476-3_61
Chapter Google Scholar
Campagner, A., Ciucci, D.: Orthopartitions and soft clustering. Knowl. Based Syst. (Submitted)
Google Scholar
Chow, C.: On optimum recognition error and reject tradeoff. IEEE Trans. Inform. Theory 16, 41–46 (1970)
Article Google Scholar
Ciucci, D.: Orthopairs: a simple and widely used way to model uncertainty. Fundamenta Informaticae 108, 287–304 (2011)
MathSciNet MATH Google Scholar
Ciucci, D.: Orthopairs and granular computing. Granular Comput. 1, 159–170 (2016)
Article Google Scholar
Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)
MATH Google Scholar
Daniel, W.W.: Applied Nonparametric Statistics. Duxbury Thomson Learning (1990)
Google Scholar
Deo, R.: Machine learning in medicine. Circulation 132 (2015)
Article Google Scholar
Ellerman, D.: An introduction to logical entropy and its relation to Shannon entropy. Int. J. Semant. Comput. 7(2), 121–145 (2013)
Article Google Scholar
Feldman, K., Faust, L., Wu, X., Huang, C., Chawla, N.V.: Beyond volume: the impact of complex healthcare data on the machine learning pipeline. CoRR abs/1706.01513 (2017)
Google Scholar
Ferri, C., Hernández-Orallo, J.: Cautious classifiers. In: ROC Analysis in Artificial Intelligence, 1st International Workshop, ROCAI-2004, pp. 27–36 (2004)
Google Scholar
Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. MIT Press, Cambridge (2016)
MATH Google Scholar
Hajian, S., Bonchi, F., Castillo, C.: Algorithmic bias: from discrimination discovery to fairness-aware data mining. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 2125–2126, August 2016
Google Scholar
Han, P.K., Klein, W.M., Arora, N.K.: Varieties of uncertainty in health care: a conceptual taxonomy. Med. Decis. Making 31(6), 828–838 (2011)
Article Google Scholar
Hechtlinger, Y., Póczos, B., Wasserman, L.A.: Cautious deep learning. ar**v/CoRR abs/1805.09460 (2018)
Google Scholar
Hüllermeier, E.: Fuzzy sets in machine learning and data mining. Appl. Soft Comput. 11(2), 1493–1505 (2011)
Article Google Scholar
Hüllermeier, E.: Does machine learning need fuzzy logic? Fuzzy Sets Syst. 281, 292–299 (2015). Special Issue Celebrating the 50th Anniversary of Fuzzy Sets
Article MathSciNet Google Scholar
Koller, D., Friedman, N.: Probabilistic Graphical Models: Principles and Techniques - Adaptive Computation and Machine Learning. The MIT Press, Cambridge (2009)
Google Scholar
Kooi, T., et al.: Large scale deep learning for computer aided detection of mammographic lesions. Med. Image Anal. 35, 303–312 (2017)
Article Google Scholar
Li, J.D.: A two-step rejection procedure for testing multiple hypotheses. J. Stat. Plann. Infer. 138(6), 1521–1527 (2008)
Article MathSciNet Google Scholar
Obermeyer, Z., Emanuel, E.J.: Predicting the future - big data, machine learning, and clinical medicine. N. Engl. J. Med. 375(13), 1216–1219 (2016)
Article Google Scholar
Pawlak, Z.: Rough sets. Int. J. Comput. Inform. Sci. 11(5), 341–356 (1982)
Article Google Scholar
Shafer, G.: A Mathematical Theory of Evidence. Princeton University Press, Princeton (1976)
MATH Google Scholar
Smets, P., Kennes, R.: The transferable belief model. Artif. Intell. 66(2), 191–234 (1994)
Article MathSciNet Google Scholar
Svensson, C., Hübler, R., Figge, M.: Automated classification of circulating tumor cells and the impact of interobsever variability on classifier training and performance. J. Immunol. Res. 2015, 1–9 (2015)
Article Google Scholar
Yao, Y.: An outline of a theory of three-way decisions. In: Yao, J.T., Yang, Y., Słowiński, R., Greco, S., Li, H., Mitra, S., Polkowski, L. (eds.) RSCTC 2012. LNCS (LNAI), vol. 7413, pp. 1–17. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-32115-3_1
Chapter Google Scholar
Zadeh, L.: Fuzzy sets. Inf. Control 8(3), 338–353 (1965)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Dipartimento di Informatica, Sistemistica e Comunicazione, University of Milano–Bicocca, Viale Sarca 336, 20126, Milan, Italy
Andrea Campagner, Federico Cabitza & Davide Ciucci
IRCCS Istituto Ortopedico Galeazzi, Via Galeazzi 4, 20161, Milan, Italy
Federico Cabitza
Deloitte Italia, Via Tortona 25, Milan, Italy
Andrea Campagner

Authors

Andrea Campagner
View author publications
You can also search for this author in PubMed Google Scholar
Federico Cabitza
View author publications
You can also search for this author in PubMed Google Scholar
Davide Ciucci
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Davide Ciucci .

Editor information

Editors and Affiliations

University of Debrecen, Debrecen, Hungary
Tamás Mihálydeák
Southwest Petroleum University, Chengdu, China
Fan Min
Chongqing University of Posts and Telecommunications, Chongqing, China
Guoyin Wang
Indian Institute of Technology Kanpur, Kanpur, India
Mohua Banerjee
Fujian Normal University, Fuzhou, China
Ivo Düntsch
University of Rzeszów, Rzeszow, Poland
Zbigniew Suraj
University of Milano-Bicocca, Milan, Italy
Davide Ciucci

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Campagner, A., Cabitza, F., Ciucci, D. (2019). Three–Way Classification: Ambiguity and Abstention in Machine Learning. In: Mihálydeák, T., et al. Rough Sets. IJCRS 2019. Lecture Notes in Computer Science(), vol 11499. Springer, Cham. https://doi.org/10.1007/978-3-030-22815-6_22

Download citation

DOI: https://doi.org/10.1007/978-3-030-22815-6_22
Published: 09 June 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-22814-9
Online ISBN: 978-3-030-22815-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Three–Way Classification: Ambiguity and Abstention in Machine Learning

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Credal Decision Trees to Classify Noisy Data Sets

Towards a Logic-Based View of Some Approaches to Classification Tasks

Credal C4.5 with Refinement of Parameters

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Three–Way Classification: Ambiguity and Abstention in Machine Learning

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Credal Decision Trees to Classify Noisy Data Sets

Towards a Logic-Based View of Some Approaches to Classification Tasks

Credal C4.5 with Refinement of Parameters

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation