Search
Search Results
-
Unsupervised phoneme segmentation of continuous Arabic speech
The development of a speech recognition system for the Arabic language presents a significant challenge, mainly due to the limited availability of...
-
End-to-end ASR framework for Indian-English accent: using speech CNN-based segmentation
The superiority of Automatic Speech Recognition (ASR) has significantly enhanced over time, with a focus from short utterance circumstances to longer...
-
A joint method for Chinese word segmentation and part-of-speech labeling based on deep neural network
Aiming at the sequential tasks of Chinese word segmentation and part-of-speech labeling, this paper proposes a parallel model for word segmentation...
-
Temporal feature-based approaches for enhancing phoneme boundary detection and masking in speech
Automatic phoneme boundary detection is a key problem in speech processing and applications. The accurate phoneme segmentation in continuous speech...
-
Modern Standard Arabic speech disorders corpus for digital speech processing applications
Digital speech processing applications including automatic speech recognition (ASR), speaker recognition, speech translation, and others, essentially...
-
Sentence-Level Automatic Speech Segmentation for Amharic
The extraction of information from a large archive requires extracting both audio file structure and its linguistic content. One of these processes... -
Phoneme Segmentation-Based Unsupervised Pattern Discovery and Clustering of Speech Signals
This paper proposes a new method that detects the repeated keyword/phrase patterns from speech utterances by performing pattern discovery at the...
-
Processing of Chinese language and text information system under the background of speech recognition
With the popularization of computers, artificial intelligence technology has become more and more mature, among which speech recognition technology...
-
Reducing DFT Leakage in Speech Recognition Using Pitch Segmentation
Frame segmentation which is dividing a signal into frames, is one of the most important part in speech recognition. In general, the frame’s size of... -
Dynamic Thresholding with Short-Time Signal Features in Continuous Bangla Speech Segmentation
This paper presents short-term signal processing strategies for segmenting continuous Bangla speech based on short-time signal features (also known... -
Guaranteed Significance Level Criterion in Automatic Speech Signal Segmentation
Abstract —The article considers the problem of automatic segmentation of a speech signal into phonetic units in conditions of their a priori uncertain...
-
Deep Learning-Based Empirical and Sub-Space Decomposition for Speech Enhancement
This research presents a single-channel speech enhancement approach based on the combination of the adaptive empirical wavelet transform and the...
-
MSLID-TCN: multi-stage linear-index dilated temporal convolutional network for temporal action segmentation
Temporal Convolutional Network (TCN) has received extensive attention in the field of speech synthesis. Many researchers use TCN-based models for...
-
Simultaneous Speech Extraction for Multiple Target Speakers Under Meeting Scenarios
The common target speech separation directly estimates the target source, ignoring the interrelationship between different speakers at each frame. We...
-
Time-domain adaptive attention network for single-channel speech separation
Recent years have witnessed a great progress in single-channel speech separation by applying self-attention based networks. Despite the excellent...
-
A generic optimization and learning framework for Parkinson disease via speech and handwritten records
Parkinson’s disease (PD) is a neurodegenerative disorder with slow progression whose symptoms can be identified at late stages. Early diagnosis and...
-
Survey on Arabic speech emotion recognition
Emotions represent a fundamental aspect when evaluating user satisfaction or collecting customer feedback in human interactions, as well as in the...
-
Improving speech recognition systems for the morphologically complex Malayalam language using subword tokens for language modeling
This article presents the research work on improving speech recognition systems for the morphologically complex Malayalam language using subword...
-
Robust speech recognition in sports competition review based on natural language processing
Mass communication media is develo** at a fast speed, and they have obtained richer methods through different ways of combining, which has...
-
Exploring Generation of Pronunciation Lexicon for Low-Resource Language Automatic Speech Recognition Based on Generic Phone Recognizer
The lexicon is an essential component in the hybrid automatic speech recognition (ASR) system. However, a high-quality lexicon requires significant...