Selective Sampling Methods in One-Class Classification Problems

Juszczak, Piotr; Duin, Robert P. W.

doi:10.1007/3-540-44989-2_18

Piotr Juszczak⁷ &
Robert P. W. Duin⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2714))

Included in the following conference series:

1664 Accesses

Abstract

Selective sampling, a part of the active learning method, reduces the cost of labeling supplementary training data by asking only for the labels of the most informative, unlabeled examples. This additional information added to an initial, randomly chosen training set is expected to improve the generalization performance of a learning machine. We investigate some methods for a selection of the most informative examples in the context of one-class classification problems i.e. problems where only (or nearly only) the examples of the so-called target class are available. We applied selective sampling algorithms to a variety of domains, including real-world problems: mine detection and texture segmentation. The goal of this paper is to show why the best or most often used selective sampling methods for two- or multi-class problems are not necessarily the best ones for the one-class classification problem. By modifying the sampling methods, we present a way of selecting a small subset from the unlabeled data to be presented to an expert for labeling such that the performance of the retrained one-class classifier is significantly improved.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: EUR 29.95; Price includes VAT (Germany)

eBook: EUR 42.79; Price includes VAT (Germany)

Softcover Book: EUR 53.49; Price includes VAT (Germany)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Comparison of Active Learning Strategies and Proposal of a Multiclass Hypothesis Space Search

Classification with label noise: a Markov chain sampling framework

Article 06 October 2018

Managing Imbalanced Data Sets in Multi-label Problems: A Case Study with the SMOTE Algorithm

References

A. Blum, T. Mitchell ‘Combining labeled and unlabeled data with co-training’ Proceedings of the 1998 Conference on Computational Learning Theory
Google Scholar
C. Cambell, N. Cristianini, A. Smola, ‘Query learning with large margin classifiers’
Google Scholar
D. Cohn, L. Atlas, R. Ladner, ‘Improving generalization with active learning’, 1992
Google Scholar
D. Cohn, Z. Ghahramani, M.I. Jordan ‘Active learning with statistical models’, Journal of artificial intelligence research 4, 1996 129–145
MATH Google Scholar
Y. Freund, H. S. Seung, E. Shamir, N. Tishby, ‘Selective sampling using the query by committee algorithm’, Machine Learning, 28, 133–168 (1997)
Article MATH Google Scholar
N. Japkowicz, ‘Concept-learning in the absence of counter-examples: an autoassociation-based approach to classification’, PhD thesis 1999
Google Scholar
M.J. Kearns, U.V. Vazirani, ‘An introduction to computational learning theory’, The MIT Press 1994, ISBN 0-262-11193-4
Google Scholar
Ion Muslea, Steve Minton, Craig Knoblock, ‘Selective sampling with redundant views’, Proceedings of the 15th National Conference on Artificial Intelligence, 621–626, AAAI-2000.
Google Scholar
D.M.J. Tax and P. Juszczak, ‘Kernel whitening for data description’, International Workshop on Pattern Recognition with Support Vector Machines 2002
Google Scholar
D.M.J. Tax, ‘One-class classification’, PhD thesis, Delft University of Technology, ISBN:90-75691-05-x, 2001
Google Scholar
D.M.J. Tax, R.P.W. Duin, ‘Support Vector Data Description’, Pattern Recognition Letters, December 1999, vol. 20(11–13), pp. 1191–1199
Article Google Scholar
M.K. Warmuth, G. Rätsch, M. Mathieson, J. Liao, C. Lemmen, ‘Active learning in the drug discovery process’
Google Scholar

Download references

Author information

Authors and Affiliations

Pattern Recognition Group, Faculty of Applied Sciences, Delft University of Technology, Lorentzweg 1, 2628 CJ, Delft, The Netherlands
Piotr Juszczak & Robert P. W. Duin

Authors

Piotr Juszczak
View author publications
You can also search for this author in PubMed Google Scholar
Robert P. W. Duin
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Bogazici University, Bebek, 34342, Istanbul, Turkey
Okyay Kaynak & Ethem Alpaydin &
Laboratory of Computer and Information Science, Helsinki University of Technology, P.O.B. 5400, 02015, Finland
Erkki Oja
Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, Hong Kong
Lei Xu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Juszczak, P., Duin, R.P.W. (2003). Selective Sampling Methods in One-Class Classification Problems. In: Kaynak, O., Alpaydin, E., Oja, E., Xu, L. (eds) Artificial Neural Networks and Neural Information Processing — ICANN/ICONIP 2003. ICANN ICONIP 2003 2003. Lecture Notes in Computer Science, vol 2714. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44989-2_18

Download citation

DOI: https://doi.org/10.1007/3-540-44989-2_18
Published: 18 June 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40408-8
Online ISBN: 978-3-540-44989-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Selective Sampling Methods in One-Class Classification Problems

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Comparison of Active Learning Strategies and Proposal of a Multiclass Hypothesis Space Search

Classification with label noise: a Markov chain sampling framework

Managing Imbalanced Data Sets in Multi-label Problems: A Case Study with the SMOTE Algorithm

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Selective Sampling Methods in One-Class Classification Problems

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Comparison of Active Learning Strategies and Proposal of a Multiclass Hypothesis Space Search

Classification with label noise: a Markov chain sampling framework

Managing Imbalanced Data Sets in Multi-label Problems: A Case Study with the SMOTE Algorithm

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation