Log in

Optical image embedding in speech signals with sensitivity analysis

  • Research Article
  • Published:
Journal of Optics Aims and scope Submit manuscript

Abstract

This paper is mainly concerned with the embedding of optical images in auxiliary media. Optical images may contain sensitive information. They are embedded in cover media such as speech signals. This process is regarded as a type of watermarking. The Singular Value Decomposition (SVD) of 2D matrices generated from the cover media is used for watermark embedding. It is well-known that the Singular Values (SVs) of 2D matrices have low sensitivity to variations in the cover signal represented as noise or enhancement through processing algorithms. Noise affects the watermarked speech signal and affects the extraction of the watermark. Different enhancement algorithms are considered and compared for testing of the proposed scheme. It is clear from the obtained results that the proposed scheme is highly efficient for optical image hiding, even with signal processing techniques applied to cover signals. Simulation experiments indicate the effect of the presence of noise on the watermark extraction and also the effect of applying speech enhancement on the watermark extraction. The correlation coefficient (Cr) between the embedded and extracted watermarks is used to indicate the performance of different enhancement methods. The adaptive Wiener filter leads to the highest Cr, which equals 0.7491. Signal-to-Noise Ratio (SNR) is used to evaluate the speech enhancement performance. The SNR reaches the highest value equal to 12.0481 dB with adaptive Wiener filter.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (Germany)

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

References

  1. F. Yan, C. Wang, J. Dou, Y. Liu, X. Yang, Application of speech recognition technology in power grid dispatching automation. IOP Conf. Ser.: Mater. Sci. Eng. 394(4), 042111 (2018)

    Article  Google Scholar 

  2. C. Macartney, T. Weyde, Improved speech enhancement with the wave-u-net. ar**v preprint ar**v:1811.11307 (2018).

  3. C.O. Sakar, G. Serbes, A. Gunduz, H.C. Tunc, H. Nizam, B. Sakar, M. Tutuncu, T. Aydin, M. Erdem Isenkul, H. Apaydin, A comparative analysis of speech signal processing algorithms for Parkinson’s disease classification and the use of the tunable Q-factor wavelet transform. Appl. Soft Comput. 74, 255–263 (2019)

    Article  Google Scholar 

  4. N. Upadhyay, A. Karmakar, Speech Enhancement using Spectral Subtraction-type Algorithms: A Comparison and Simulation Study. Procedia Comput. Sci. 54, 574–584 (2015)

    Article  Google Scholar 

  5. M. Karam, H.F. Khazaal, H. Aglan, C. Cole, Noise removal in speech processing using spectral subtraction. J. Signal Inf. Process. 2014, 1 (2014)

    Google Scholar 

  6. X. Yan, Z. Yang, T. Wang, H. Guo, An iterative graph spectral subtraction method for speech enhancement. Speech Commun. 123, 35–42 (2020)

    Article  Google Scholar 

  7. X. Hao, X. Su, S. Wen, Z. Wang, Y. Pan, F. Bao, W. Chen, Masking and inpainting: a two-stage speech enhancement approach for low SNR and non-stationary noise, in ICASSP 2020–2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 6959–6963 (2020)

  8. Y.G. Thimmaraja, B.G. Nagaraja, H.S. Jayanna, Speech enhancement and encoding by combining SS-VAD and LPC. Int. J. Speech Technol. 24(1), 165–172 (2021)

    Article  Google Scholar 

  9. M. G. Jimenez, D. E. Romero, G. J. Dolecek, Comb filters characteristics and applications, in Encyclopedia of Information Science and Technology, 3rd Edition, pp. 4062–4071 (2015)

  10. M. Kareem, A. Saleeb, S. M. El-Dolil, A. El-Fishawy,F. E. Abd El-Samie and M. I. Dessouky, Efficient comb-based filter for cancelable speaker identification system, in Proceedings of International Conference on Electronic Engineering (ICEEM), Menouf, Egypt, pp. 1–7 (2021)

  11. S. Abd El-Moneim, M.I. Dessouky, F.E. Abd El-Samie, M.A. Nassar, M. Abd El-Naby, Hybrid speech enhancement with empirical mode decomposition and spectral subtraction for efficient speaker identification. Int. J. Speech Technol. 18, 555–564 (2015)

    Article  Google Scholar 

  12. N. Upadhyay, R.K. Jaiswal, Single channel speech enhancement: using Wiener filtering with recursive noise estimation. Procedia Comput. Sci. 84, 22–30 (2016)

    Article  Google Scholar 

  13. K. Bhatt, C.S. Vinitha, R. Gupta, Secure speech enhancement using LPC based fem in wiener filter, in Data Engineering and Intelligent Computing: Advances in Intelligent Systems and Computing. ed. by S. Satapathy, V. Bhateja, K. Raju, B. Janakiramaiah (Springer, Singapore, 2018), pp.657–665

    Chapter  Google Scholar 

  14. M.A. Abd El-Fattah, M.I. Dessouky, A.M. Abbas, S.M. Diab, E.M. El- Rabaie, W. Al-Nuaimy, S.A. Alshebeili, F.E. Abd El-samie, Speech enhancement with an adaptive wiener filter. Int. J. Speech Technol. 17, 53–64 (2014)

    Article  Google Scholar 

  15. A. Yelwande, S. Kansal, A. Dixit, Adaptive wiener filter for speech enhancement, in Proceedings of International Conference on Information, Communication, Instrumentation and Control (2017)

  16. S. El Gazar, A.M. Abbas, S. El-Dolil, I.M. El-Dokany, M.I. Dessouky, E.-S.M. El-Rabaie, F.E. Abd El-Samie, Efficient SVD speech watermarking with encrypted images. Int. J. Speech Technol. 21(4), 953–965 (2018)

    Article  Google Scholar 

Download references

Funding

Open access funding provided by The Science, Technology & Innovation Funding Authority (STDF) in cooperation with The Egyptian Knowledge Bank (EKB).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Safaa El-Gazar.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Mordy, E.aE., El-Gazar, S., El-Dolil, S. et al. Optical image embedding in speech signals with sensitivity analysis. J Opt 53, 1733–1740 (2024). https://doi.org/10.1007/s12596-023-01178-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12596-023-01178-x

Keywords

Navigation