Naive Bayes

Sarang, Poornachandra

doi:10.1007/978-3-031-02363-7_7

Poornachandra Sarang³

Part of the book series: The Springer Series in Applied Machine Learning ((SSAML))

1446 Accesses
1 Citations

Abstract

When you want to make quick predictions on a high-dimensional dataset, you use Naive Bayes. This is one of the most efficient algorithms for classification and probably the simplest. When you have several thousand data points and many features in your dataset, it trains quickly to help you get predictions in real time. It thus helps in building the fast machine learning models to make quick predictions. It is also easy to build. The algorithm is based upon Bayes’ theorem. I have presented a detailed computation on how to compute the various probability terms of Bayes’ law, so that you understand how the algorithm works. I also discuss the various advantages and disadvantages of the algorithm where it can be applied and where it cannot be. Finally, I will give you a few techniques for improving its performance. The sklearn library provides several implementations of the algorithm based on the Naive Bayes types—such as Multinomial, Bernoulli, and so on. You will learn these various types. Last, I discuss how to fit the model on huge datasets, followed by a complete classification example on a large text corpus.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 54.99; Price excludes VAT (USA)

Softcover Book: USD 64.99; Price excludes VAT (USA)

Hardcover Book: USD 69.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Author information

Authors and Affiliations

Mumbai, India
Poornachandra Sarang

Authors

Poornachandra Sarang
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Sarang, P. (2023). Naive Bayes. In: Thinking Data Science. The Springer Series in Applied Machine Learning. Springer, Cham. https://doi.org/10.1007/978-3-031-02363-7_7

Download citation

DOI: https://doi.org/10.1007/978-3-031-02363-7_7
Published: 01 March 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-02362-0
Online ISBN: 978-3-031-02363-7
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics