Learning by MDL

Rissanen, J.; Yu, Bin

doi:10.1007/978-1-4612-4088-4_1

J. Rissanen⁴ &
Bin Yu⁵

Part of the book series: Progress in Computer Science and Applied Logic ((PCS,volume 14))

270 Accesses

Abstract

Machine learning has been formalized as the problem of estimating a conditional distribution as the ‘concept’ to be learned. The learning algorithm is based upon the MDL (Minimum Description Length) principle. The asymptotically optimal learning rate is determined for a typical example.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: EUR 29.95; Price includes VAT (Germany)

eBook: EUR 85.59; Price includes VAT (Germany)

Softcover Book: EUR 106.99; Price includes VAT (Germany)

Hardcover Book: EUR 106.99; Price includes VAT (Germany)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

An Introduction to the Minimum Description Length Principle

Machine Learning: A Concise Overview

Essential Concepts in Artificial Intelligence and Machine Learning

References

Angluin, D., Laird, P. (1988), ‘Learning from Noisy Examples’, Machine Learning 2, 343–370
Google Scholar
Elias, P. (1975), ‘Universal Codeword Sets and Representations of the Integers’, IEEE Trans. Inf. Theory, Vol. IT-21, no. 2, 194–203
Article MathSciNet Google Scholar
Haussler, D. (1989), ‘Generalizing the PAC Model for Neural Net and Other Learning Applications’, Technical Report UCSC CRL-89-30, University of Santa Cruz, September
Google Scholar
Kearns M., Li M. (1988), ‘Learning in the Presence of Malicious Errors’, Proc. of the 20th Annual ACM Symposium on Theory of Computing’, Illinois, May 1988
Google Scholar
Pollard, D. (1984), Convergence of Stochastic Processes, Springer-Verlag, New York, (215 pages)
MATH Google Scholar
Rissanen, J. (1983), ‘A Universal Prior for Integers and Estimation by Minimum Description Length’, Annals of Statistics, Vol. 11, No. 2, 416–431
Article MathSciNet MATH Google Scholar
Rissanen, J. (1984), ‘Universal Coding, Information, Prediction, and Estimation’, IEEE Trans. Inf. Theory, Vol. IT-30, No. 4, 629–636
Article MathSciNet Google Scholar
Rissanen, J. (1986), ‘Stochastic Complexity and Modeling’, Annals of Statistics, Vol 14, 1080–1100
Article MathSciNet MATH Google Scholar
Rissanen, J. (1989), Stochastic Complexity in Statistical Inquiry, World Scientific Publ. Co., New Jersey, (175 pages)
MATH Google Scholar
Rissanen, J. (1994), ‘Stochastic Complexity and Fisher Information’ (submitted to IEEE Trans. Inf. Theory)
Google Scholar
Rissanen, J., Speed, T., Yu, B. (1989), ‘Density Estimation by Stochastic Complexity’, IEEE Trans. Inf. Theory, Vol. IT-38, No. 2, 315–323
Google Scholar
Rivest, R.L. (1987), ‘Learning Decision Lists’, Machine Learning, 2, 229–246
MathSciNet Google Scholar
Stone, C.J. (1980), ‘Optimal Rates of Convergence for Nonparametric Estimators’, Annals of Statistics, Vol. 8, No. 6, 1348–1360
Article MathSciNet MATH Google Scholar
Suzuki, J. (1990), ‘Generalization of the Learning Method for Classifying Rules with Consistency Irrespective of the Representation Form and the Number of Classified Patterns’, Int. Symposium on Information Theory and its Applications, Hawaii, November 1990
Google Scholar
Valiant, L.G. (1984), ‘A Theory of the Learnable’, Comm. of the ACM, 27, 1134–1142
Article MATH Google Scholar
Yamanishi, K. (1990), ‘A Learning Criterion for Stochastic Rules’, Proc. of the Third Annual Workshop on Computational Learning Theory, August 1990
Google Scholar

Download references

Author information

Authors and Affiliations

IBM Almaden Research Center, San Jose, Ca, 95120-6099, USA
J. Rissanen
Dept. of Statistics, University of California, Berkeley, Ca, 94720, USA
Bin Yu

Authors

J. Rissanen
View author publications
You can also search for this author in PubMed Google Scholar
Bin Yu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Mathematics, University of Maryland, 20742, College Park, MD, USA
David W. Kueker
Department of Computer Science, University of Maryland, 20742, College Park, MD, USA
Carl H. Smith

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Rissanen, J., Yu, B. (1996). Learning by MDL. In: Kueker, D.W., Smith, C.H. (eds) Learning and Geometry: Computational Approaches. Progress in Computer Science and Applied Logic, vol 14. Birkhäuser Boston. https://doi.org/10.1007/978-1-4612-4088-4_1

Download citation

DOI: https://doi.org/10.1007/978-1-4612-4088-4_1
Publisher Name: Birkhäuser Boston
Print ISBN: 978-1-4612-8646-2
Online ISBN: 978-1-4612-4088-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Learning by MDL

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

An Introduction to the Minimum Description Length Principle

Machine Learning: A Concise Overview

Essential Concepts in Artificial Intelligence and Machine Learning

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Learning by MDL

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

An Introduction to the Minimum Description Length Principle

Machine Learning: A Concise Overview

Essential Concepts in Artificial Intelligence and Machine Learning

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation