Part of the book series: Progress in Computer Science and Applied Logic ((PCS,volume 14))

  • 270 Accesses

Abstract

Machine learning has been formalized as the problem of estimating a conditional distribution as the ‘concept’ to be learned. The learning algorithm is based upon the MDL (Minimum Description Length) principle. The asymptotically optimal learning rate is determined for a typical example.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
EUR 29.95
Price includes VAT (Germany)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
EUR 85.59
Price includes VAT (Germany)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
EUR 106.99
Price includes VAT (Germany)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info
Hardcover Book
EUR 106.99
Price includes VAT (Germany)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  • Angluin, D., Laird, P. (1988), ‘Learning from Noisy Examples’, Machine Learning 2, 343–370

    Google Scholar 

  • Elias, P. (1975), ‘Universal Codeword Sets and Representations of the Integers’, IEEE Trans. Inf. Theory, Vol. IT-21, no. 2, 194–203

    Article  MathSciNet  Google Scholar 

  • Haussler, D. (1989), ‘Generalizing the PAC Model for Neural Net and Other Learning Applications’, Technical Report UCSC CRL-89-30, University of Santa Cruz, September

    Google Scholar 

  • Kearns M., Li M. (1988), ‘Learning in the Presence of Malicious Errors’, Proc. of the 20th Annual ACM Symposium on Theory of Computing’, Illinois, May 1988

    Google Scholar 

  • Pollard, D. (1984), Convergence of Stochastic Processes, Springer-Verlag, New York, (215 pages)

    MATH  Google Scholar 

  • Rissanen, J. (1983), ‘A Universal Prior for Integers and Estimation by Minimum Description Length’, Annals of Statistics, Vol. 11, No. 2, 416–431

    Article  MathSciNet  MATH  Google Scholar 

  • Rissanen, J. (1984), ‘Universal Coding, Information, Prediction, and Estimation’, IEEE Trans. Inf. Theory, Vol. IT-30, No. 4, 629–636

    Article  MathSciNet  Google Scholar 

  • Rissanen, J. (1986), ‘Stochastic Complexity and Modeling’, Annals of Statistics, Vol 14, 1080–1100

    Article  MathSciNet  MATH  Google Scholar 

  • Rissanen, J. (1989), Stochastic Complexity in Statistical Inquiry, World Scientific Publ. Co., New Jersey, (175 pages)

    MATH  Google Scholar 

  • Rissanen, J. (1994), ‘Stochastic Complexity and Fisher Information’ (submitted to IEEE Trans. Inf. Theory)

    Google Scholar 

  • Rissanen, J., Speed, T., Yu, B. (1989), ‘Density Estimation by Stochastic Complexity’, IEEE Trans. Inf. Theory, Vol. IT-38, No. 2, 315–323

    Google Scholar 

  • Rivest, R.L. (1987), ‘Learning Decision Lists’, Machine Learning, 2, 229–246

    MathSciNet  Google Scholar 

  • Stone, C.J. (1980), ‘Optimal Rates of Convergence for Nonparametric Estimators’, Annals of Statistics, Vol. 8, No. 6, 1348–1360

    Article  MathSciNet  MATH  Google Scholar 

  • Suzuki, J. (1990), ‘Generalization of the Learning Method for Classifying Rules with Consistency Irrespective of the Representation Form and the Number of Classified Patterns’, Int. Symposium on Information Theory and its Applications, Hawaii, November 1990

    Google Scholar 

  • Valiant, L.G. (1984), ‘A Theory of the Learnable’, Comm. of the ACM, 27, 1134–1142

    Article  MATH  Google Scholar 

  • Yamanishi, K. (1990), ‘A Learning Criterion for Stochastic Rules’, Proc. of the Third Annual Workshop on Computational Learning Theory, August 1990

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1996 Birkhäuser Boston

About this chapter

Cite this chapter

Rissanen, J., Yu, B. (1996). Learning by MDL. In: Kueker, D.W., Smith, C.H. (eds) Learning and Geometry: Computational Approaches. Progress in Computer Science and Applied Logic, vol 14. Birkhäuser Boston. https://doi.org/10.1007/978-1-4612-4088-4_1

Download citation

  • DOI: https://doi.org/10.1007/978-1-4612-4088-4_1

  • Publisher Name: Birkhäuser Boston

  • Print ISBN: 978-1-4612-8646-2

  • Online ISBN: 978-1-4612-4088-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Navigation