We are improving our search experience. To check which content you have full access to, or for advanced search, go back to the old search.

Search

Please fill in this field.
Filters applied:

Search Results

Showing 1-20 of 10,000 results
  1. Speech enhancement based on emphasizing the fundamental frequency integrated with SNMF/DNN

    Single-channel speech enhancement is a popular problem in speech enhancement and related fields, but the traditional research direction is to improve...

    Tao Shi, Rizwan Ullah, Hongbo Jia in Multimedia Tools and Applications
    Article 08 June 2024
  2. Meta-reinforcement learning based few-shot speech reconstruction for non-intrusive speech quality assessment

    Speech quality assessment (SQA) is meaningful for modern communication systems and Quality of Service (QoS). At present, the non-intrusive SQA...

    Weili Zhou, **xiong Lai, ... Ruijie Ji in Applied Intelligence
    Article 21 October 2022
  3. Improving speech command recognition through decision-level fusion of deep filtered speech cues

    Living beings communicate through speech, which can be analysed to identify words and sentences by recognizing the flow of spoken utterances....

    Sunakshi Mehra, Virender Ranga, Ritu Agarwal in Signal, Image and Video Processing
    Article 11 November 2023
  4. Schlieren imaging and video classification of alphabet pronunciations: exploiting phonetic flows for speech recognition and speech therapy

    Speech is a highly coordinated process that requires precise control over vocal tract morphology/motion to produce intelligible sounds while...

    Mohamed Talaat, Kian Barari, ... **xiang ** in Visual Computing for Industry, Biomedicine, and Art
    Article Open access 22 May 2024
  5. PublicVR: a virtual reality exposure therapy intervention for adults with speech anxiety

    Speech anxiety, or Glossophobia, currently affects approximately 75% of the population with potentially severe negative effects on those with this...

    Fotios Spyridonis, Damon Daylamani-Zad, James Nightingale in Virtual Reality
    Article Open access 30 April 2024
  6. Robust Automatic Speech Recognition Using Wavelet-Based Adaptive Wavelet Thresholding: A Review

    Automatic speech recognition (ASR) is one of the most fascinating fields of research and the performance of ASR systems is most promising in a closed...

    Mahadevaswamy Shanthamallappa, Kiran Puttegowda, ... Sudheesh Kannur Vasudeva Rao in SN Computer Science
    Article 01 February 2024
  7. Speech Enhancement with Generative Diffusion Models

    Abstract

    An alternative approach to speech denoising using generative diffusion models that model the distribution of training data is proposed. In...

    O. V. Girfanov, A. G. Shishkin in Automatic Documentation and Mathematical Linguistics
    Article 01 October 2023
  8. Enhanced speech emotion recognition using averaged valence arousal dominance map** and deep neural networks

    This study delves into advancements in speech emotion recognition (SER) by establishing a novel approach for emotion map** and prediction using the...

    Davit Rizhinashvili, Abdallah Hussein Sham, Gholamreza Anbarjafari in Signal, Image and Video Processing
    Article 10 July 2024
  9. Improvement of automatic speech recognition systems utilizing 2D adaptive wavelet transformation applied to recurrence plot of speech trajectories

    Spectral-based features, typically used in ASR systems, do not capture the phase information of speech signals. Thus, exploiting new features that do...

    Shabnam Firooz, Farshad Almasganj, Yasser Shekofteh in Signal, Image and Video Processing
    Article 15 December 2023
  10. MetaRL-SE: a few-shot speech enhancement method based on meta-reinforcement learning

    The goal of speech enhancement is to reduce and suppress the noise in noisy speech and improve the quality and intelligibility of damaged speech....

    Weili Zhou, Ruijie Ji, **xiong Lai in Multimedia Tools and Applications
    Article 26 April 2023
  11. A Novel Attention-Guided Generative Adversarial Network for Whisper-to-Normal Speech Conversion

    Whispered speech is a special voicing style of speech that is employed publicly to protect speech information. It is also the primary pronunciation...

    Teng Gao, Qing Pan, ... Hon Keung Kwan in Cognitive Computation
    Article 16 January 2023
  12. HATS: An Open Data Set Integrating Human Perception Applied to the Evaluation of Automatic Speech Recognition Metrics

    Conventionally, Automatic Speech Recognition (ASR) systems are evaluated on their ability to correctly recognize each word contained in a speech...
    Thibault Bañeras-Roux, Jane Wottawa, ... Richard Dufour in Text, Speech, and Dialogue
    Conference paper 2023
  13. Iterative-processed multiband speech enhancement for suppressing musical sounds

    A multiband spectral subtraction (MBSS) processing step transforms background noise into annoying musical sounds. The paper proposes an...

    Navneet Upadhyay in Multimedia Tools and Applications
    Article 21 October 2023
  14. Semantic speech analysis using machine learning and deep learning techniques: a comprehensive review

    Human cognitive functions such as perception, attention, learning, memory, reasoning, and problem-solving are all significantly influenced by...

    Suryakant Tyagi, Sándor Szénási in Multimedia Tools and Applications
    Article Open access 19 December 2023
  15. Adaptive attention mechanism for single channel speech enhancement

    The recent development of speech enhancement methods has incorporated attention mechanisms for learning long-term speech signal dependencies. The...

    Veeraswamy Parisae, S Nagakishore Bhavanam in Multimedia Tools and Applications
    Article 04 April 2024
  16. Ebola optimization based spiking neural network for automatic hate speech recognition

    In this paper, efficient machine learning technique is introduced to develop efficient machine learning model for hate speech recognition from the...

    A. Meenakshi, J. Anitha Ruth in International Journal of Information Technology
    Article 26 June 2024
  17. Man-Machine Speech Communication 17th National Conference, NCMMSC 2022, Hefei, China, December 15–18, 2022, Proceedings

    This book constitutes the refereed proceedings of the 17th National Conference on Man–Machine Speech Communication, NCMMSC 2022, held in China, in...
    Ling Zhenhua, Gao Jianqing, ... Jia Jia in Communications in Computer and Information Science
    Conference proceedings 2023
  18. Deep Learning Algorithms for Speech Emotion Recognition with Hybrid Spectral Features

    One of the popular research domains in Automatic Speech Recognition (ASR) is to identify emotions from the utterances of speech samples of human...

    Raghu Kogila, Manchala Sadanandam, Hanumanthu Bhukya in SN Computer Science
    Article 16 November 2023
  19. CommanderUAP: a practical and transferable universal adversarial attacks on speech recognition models

    Most of the adversarial attacks against speech recognition systems focus on specific adversarial perturbations, which are generated by adversaries...

    Zheng Sun, **xiao Zhao, ... Lei Ju in Cybersecurity
    Article Open access 05 June 2024
  20. Segmentation of Noisy Speech Signals

    Abstract

    One of the most important problems in digital speech-signal processing is distinguishing segments of active speech and of background noise or...

    S. D. Protserov, A. G. Shishkin in Scientific and Technical Information Processing
    Article 01 December 2022
Did you find what you were looking for? Share feedback.