Bangla Spoken Numerals Recognition by Using HMM

  • Conference paper
  • First Online:
Computational Intelligence in Pattern Recognition

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1349))

Abstract

Speech is one of the most natural forms of vocalized communication media. Nowadays with the advancement of machine learning, different doors are opened to us for finding several standard ways to step out in the real world. ASR is just like the door to explore the concept of communication through speech between human and digital devices that can recognize speech. In this paper, we have designed a Hidden Markov Model-based isolated Bangla numerals recognition system where the Short-Term Fourier Transform is used for collecting the feature vectors. The defined system achieved 91.50% accuracy for our own dataset of 2000 uttered samples for 10 classes, which gives a satisfied result for this Bangla numerals recognition.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
EUR 29.95
Price includes VAT (Thailand)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
EUR 181.89
Price includes VAT (Thailand)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
EUR 219.99
Price excludes VAT (Thailand)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Karpagavalli, S., Chandra, E.: Phoneme and word based model for Tamil speech recognition using GMM-HMM. In: 2015 International Conference on Advanced Computing and Communication Systems, pp. 1–5. IEEE (2015)

    Google Scholar 

  2. Abdullah-al-MAMUN, M.D., Mahmud, F.: Performance analysis of isolated Bangla speech recognition system using Hidden Markov Model. Int. J. Sci. Eng. Res. 6(1)

    Google Scholar 

  3. Hammami, N., Bedda, M., Farah, N.: Spoken Arabic digits recognition using MFCC based on GMM. In: 2012 IEEE Conference on Sustainable Utilization and Development in Engineering and Technology (STUDENT), pp. 160–163. IEEE (2012)

    Google Scholar 

  4. Chauhan, V., Dwivedi, S., Karale, P., Potdar, S.M.: Speech to text converter using Gaussian mixture model (GMM). Int Res J Eng Technol 3(5), 160–164 (2016)

    Google Scholar 

  5. Rosdi, F., Ainon, R.N.: Isolated Malay speech recognition using Hidden Markov Models. In: 2008 International Conference on Computer and Communication Engineering, pp. 721–725. IEEE (2008)

    Google Scholar 

  6. Ali, M.A., Hossain, M., Bhuiyan, M.N.: Automatic speech recognition technique for Bangla words. Int. J. Adv. Sci. Technol. 50 (2013)

    Google Scholar 

  7. Najkar, N., Razzazi, F., Sameti, H.: A novel approach to HMM-based speech recognition systems using particle swarm optimization. Math Comput Model 52(11–12), 1910–1920 (2010)

    Article  Google Scholar 

  8. Paul, B., Bera, S., Paul, R., Phadikar, S.: Bengali spoken numerals recognition by MFCC and GMM technique. In: International Conference on Emerging Trends and Advances in Electrical Engineering and Renewable Energy (ETAEERE-2020) (2020)

    Google Scholar 

  9. https://kevinsprojects.wordpress.com/2014/12/13/short-time-fourier-transform-using-python-and-numpy/

  10. Bansal, P., Kant, A., Kumar, S., Sharda, A., Gupta, S.: Improved hybrid model of HMM/GMM for speech recognition. (2008)

    Google Scholar 

  11. Chavan, R.S., Sable, G.S.: An overview of speech recognition using HMM. Int J Comput Sci Mob Comput 2(6), 233–238 (2013)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Paul, B., Adhikary, D., Dey, T., Guchhait, S., Bera, S. (2022). Bangla Spoken Numerals Recognition by Using HMM. In: Das, A.K., Nayak, J., Naik, B., Dutta, S., Pelusi, D. (eds) Computational Intelligence in Pattern Recognition . Advances in Intelligent Systems and Computing, vol 1349. Springer, Singapore. https://doi.org/10.1007/978-981-16-2543-5_8

Download citation

Publish with us

Policies and ethics

Navigation