Log in

HPO Based Enhanced Elman Spike Neural Network for Detecting Speech of People with Dysarthria

  • Published:
Optical Memory and Neural Networks Aims and scope Submit manuscript

Abstract

Motor speech condition called dysarthria is caused by a lack of movement in the lips, tongue, vocal cords, and diaphragm are a few of the muscles needed to produce speech. Speech that is slurred, sluggish, or inaccurate might be the initial sign of dysarthria, which varies in severity. Parkinson’s disease, muscular dystrophy, multiple sclerosis, brain tumors, brain damage, and amyotrophic lateral sclerosis are among the health problems that can result from dysarthria. This research develops an efficient method for extracting features and classifying dysarthria affected persons from speech signals. This suggested method uses a speech signal as its source. The supplied speech signal is pre-processed to improve the identification of dysarthria speech. Pre-processing methods like the Butterworth band pass filter and Savitzky Golay digital FIR filter are used to smoothing the raw data. After pre-processing, the signals are input into the feature extraction techniques, such as Yule-Walker Autoregressive modelling, Mel frequency cepstral coefficients and Perceptual Linear Predictive to extract the important features. The dysarthria speech is finally detected using an improved Elman Spike Neural Network (EESNN) algorithm-based classifier. Hunter Prey Optimization (HPO) is used to select the weights of EESNN optimally. The proposed algorithm achieves 94.25% accuracy and 94.26% specificity values. Thus this proposed approach is the best choice for predicting dysarthria disease using speech signal.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1.
Fig. 2.
Fig. 3.
Fig. 4.
Fig. 5.
Fig. 6.
Fig. 7.
Fig. 8.
Fig. 9.
Fig. 10.
Fig. 11.
Fig. 12.

REFERENCES

  1. Gurugubelli Gurugubelli, K., and Vuppala, A.K., Analytic phase features for dysarthric speech detection and intelligibility assessment, Speech Commun., 2020, vol. 121, pp. 1–15.

    Article  Google Scholar 

  2. Millet, J. and Zeghidour, N., Learning to detect dysarthria from raw speech, in ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2019, pp. 5831–5835.

  3. Shih, D.H., Liao, C.H., Wu, T.W., Xu, X.Y., and Shih, M.H., Dysarthria speech detection using convolutional neural networks with gated recurrent unit, in Healthcare, MDPI, 2022, vol. 10, no. 10, p. 1956.

  4. Ijitona, T.B., Soraghan, J.J., Lowit, A., Di-Caterina, G., and Yue, H., Automatic detection of speech disorder in dysarthria using extended speech feature extraction and neural networks classification, 3rd Internationl Conference on Intelligent Signal Processing, London, United Kingdom, December 2017.

  5. Korzekwa, D., Barra-Chicote, R., Kostek, B., Drugman, T., and Lajszczak, M., Interpretable deep learning model for the detection and reconstruction of dysarthric speech. ar**v preprint ar**v:1907.04743, 2019.

  6. Novotný, M., Pospíšil, J., Čmejla, R., and Rusz, J., Automatic detection of voice onset time in dysarthric speech, in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2015, pp. 4340–4344.

  7. Kodrasi, I. and Bourlard, H., Super-Gaussianity of speech spectral coefficients as a potential biomarker for dysarthric speech detection, in ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2019, pp. 6400–6404.

  8. Kodrasi, I., Temporal envelope and fine structure cues for dysarthric speech detection using CNNs, IEEE Signal Process. Lett., 2021, vol. 28, pp. 1853–1857.

    Article  Google Scholar 

  9. Diwakar, G. and Karjigi, V., Improving speech to text alignment based on repetition detection for dysarthric speech, Circuits, Syst., Signal Process., 2020, vol. 39, 5543–5567.

    Article  Google Scholar 

  10. Wang, D., Deng, L., Yeung, Y.T., Chen, X., Liu, X., and Meng, H., Unsupervised domain adaptation for dysarthric speech detection via domain adversarial training and mutual information minimization. ar**v preprint ar**v:2106.10127, 2021.

  11. Sekhar, S.M., Kashyap, G., Bhansali, A., and Singh, K., Dysarthric-speech detection using transfer learning with convolutional neural networks, ICT Express, 2022, vol. 8, no. 1, pp. 61–64.

    Article  Google Scholar 

  12. Zaidi, B.F., Selouani, S.A., Boudraa, M., and Sidi Yakoub, M., Deep neural network architectures for dysarthric speech analysis and recognition, Neural Comput. Appl., 2021, vol. 33, pp. 9089–9108.

    Article  Google Scholar 

  13. Ramos, V.M., Hernandez-Diaz, H.A.K., Huici, M.E.H.D., Martens, H., van Nuffelen, G., and De Bodt, M., Acoustic features to characterize sentence accent production in dysarthric speech, Biomed. Signal Process. Control, 2020, vol. 57, p. 101750.

    Article  Google Scholar 

  14. Narendra, N.P., Schuller, B., and Alku, P., The detection of Parkinson’s disease from speech using voice source information, IEEE/ACM Trans. Audio, Speech, Lang. Process., 2021, vol. 29, pp. 1925–1936.

    Article  Google Scholar 

  15. Yılmaz, E., Mitra, V., Sivaraman, G., and Franco, H., Articulatory and bottleneck features for speaker-independent ASR of dysarthric speech, Comput. Speech Lang., 2019, vol. 58, pp. 319–334.

    Article  Google Scholar 

  16. Janbakhshi, P., Automatic Pathological Speech Assessment, EPFL, 2022, no. 9483.

  17. Madhu Keerthana, Y., Sreenivasa Rao, K., and Mitra, P., Dysarthric speech detection from telephone quality speech using epoch-based pitch perturbation features, Int. J. Speech Technol., 2022, vol. 25, no. 4, pp. 967–973.

    Article  Google Scholar 

  18. Mahata, S., Kar, R., and Mandal, D., Optimal rational approximation of bandpass Butterworth filter with symmetric fractional-order roll-off, AEU-Int. J. Electron. Commun., 2020, vol. 117, p. 153106.

    Article  Google Scholar 

  19. Zhang, G., Hao, H., Wang, Y., Jiang, Y., Shi, J., Yu, J., and Yu, B., Optimized adaptive Savitzky-Golay filtering algorithm based on deep learning network for absorption spectroscopy, Spectrochim. Acta, Part A, 2021, vol. 263, p. 120187.

    Article  Google Scholar 

  20. Giri, P., Grzesiek, A., Żuławiński, W., Sundar, S., and Wyłomańska, A., The modified Yule-Walker method for multidimensional infinite-variance periodic autoregressive model of order 1, J. Korean Stat. Soc., 2023, vol. 52, no. 2, pp. 462–493.

    Article  MathSciNet  Google Scholar 

  21. Pawar, M.D. and Kokate, R.D., Convolution neural network based automatic speech emotion recognition using Mel-frequency Cepstrum coefficients, Multimedia Tools Appl., 2021, vol. 80, pp. 15563–15587.

    Article  Google Scholar 

  22. Solairaj, A., Sugitha, G., and Kavitha, G., Enhanced Elman spike neural network based sentiment analysis of online product recommendation, Appl. Soft Comput., 2023, vol. 132, p. 109789.

    Article  Google Scholar 

  23. Naruei, I., Keynia, F., and Sabbagh Molahosseini, A., Hunter-prey optimization: Algorithm and applications, Soft Comput., 2022, vol. 26, no. 3, pp. 1279–1314.

    Article  Google Scholar 

  24. Dataset 1. https://www.kaggle.com/datasets/iamhungundji/dysarthria-detection.

Download references

ACKNOWLEDGMENTS

AUTHOR CONTRIBUTIONS. The corresponding author claims the major contribution of the paper including formulation, analysis and editing. The co-authors provides guidance to verify the analysis result and manuscript editing.

Funding

This work was supported by ongoing institutional funding. No additional grants to carry out or direct this particular research were obtained.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pranav Kumar.

Ethics declarations

CONFLICT OF INTEREST

The authors of this work declare that they have no conflicts of interest.

DATA AND MATERIAL AVAILABILITY

Not applicable.

CODE AVAILABILITY

Not applicable.

Additional information

Publisher’s Note.

Allerton Press remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Pranav Kumar, Ahmad, M.T. & Kumari, R. HPO Based Enhanced Elman Spike Neural Network for Detecting Speech of People with Dysarthria. Opt. Mem. Neural Networks 33, 205–220 (2024). https://doi.org/10.3103/S1060992X24700097

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.3103/S1060992X24700097

Keywords:

Navigation