Search Page | SpringerLink

Unsupervised phoneme segmentation of continuous Arabic speech

The development of a speech recognition system for the Arabic language presents a significant challenge, mainly due to the limited availability of...

Hind Ait Mait, Noureddine Aboutabit in International Journal of Speech Technology

Article 02 May 2024

End-to-end ASR framework for Indian-English accent: using speech CNN-based segmentation

The superiority of Automatic Speech Recognition (ASR) has significantly enhanced over time, with a focus from short utterance circumstances to longer...

Ghayas Ahmed, Aadil Ahmad Lawaye in International Journal of Speech Technology

Article 11 November 2023

A joint method for Chinese word segmentation and part-of-speech labeling based on deep neural network

Aiming at the sequential tasks of Chinese word segmentation and part-of-speech labeling, this paper proposes a parallel model for word segmentation...

Lichi Yuan in Soft Computing

Article 21 April 2022

Temporal feature-based approaches for enhancing phoneme boundary detection and masking in speech

Automatic phoneme boundary detection is a key problem in speech processing and applications. The accurate phoneme segmentation in continuous speech...

Shaik Mulla Shabber, Mohan Bansal in International Journal of Speech Technology

Article 25 June 2024

Modern Standard Arabic speech disorders corpus for digital speech processing applications

Digital speech processing applications including automatic speech recognition (ASR), speaker recognition, speech translation, and others, essentially...

Assal A. M. Alqudah, Mohammad A. M. Alshraideh, ... Ahmad A. S. Sharieh in International Journal of Speech Technology

Article 13 March 2024

Sentence-Level Automatic Speech Segmentation for Amharic

The extraction of information from a large archive requires extracting both audio file structure and its linguistic content. One of these processes...

Rahel Mekonen Tamiru, Solomon Teferra Abate in Proceedings of Sixth International Congress on Information and Communication Technology

Conference paper 2022

Phoneme Segmentation-Based Unsupervised Pattern Discovery and Clustering of Speech Signals

This paper proposes a new method that detects the repeated keyword/phrase patterns from speech utterances by performing pattern discovery at the...

Kishore Kumar Ravi, Sreenivasa Rao Krothapalli in Circuits, Systems, and Signal Processing

Article 15 November 2021

Processing of Chinese language and text information system under the background of speech recognition

With the popularization of computers, artificial intelligence technology has become more and more mature, among which speech recognition technology...

Huiqin Cao, Peng He, Cheng** Wang in Soft Computing

Article 10 June 2023

Reducing DFT Leakage in Speech Recognition Using Pitch Segmentation

Frame segmentation which is dividing a signal into frames, is one of the most important part in speech recognition. In general, the frame’s size of...

Sopon Wiriyarattanakul, Piroon Kaewfoongrungsi, Ekkalak Sumonphan in Second International Conference on Image Processing and Capsule Networks

Conference paper 2022

Dynamic Thresholding with Short-Time Signal Features in Continuous Bangla Speech Segmentation

This paper presents short-term signal processing strategies for segmenting continuous Bangla speech based on short-time signal features (also known...

Md Mijanur Rahman, Mahnuma Rahman Rinty in Proceedings of the International Conference on Paradigms of Computing, Communication and Data Sciences

Conference paper 2023

Guaranteed Significance Level Criterion in Automatic Speech Signal Segmentation

Abstract —

The article considers the problem of automatic segmentation of a speech signal into phonetic units in conditions of their a priori uncertain...

V. V. Savchenko, A. V. Savchenko in Journal of Communications Technology and Electronics

Article 26 November 2020

Deep Learning-Based Empirical and Sub-Space Decomposition for Speech Enhancement

This research presents a single-channel speech enhancement approach based on the combination of the adaptive empirical wavelet transform and the...

Khaoula Mraihi, Mohamed Anouar Ben Messaoud in Circuits, Systems, and Signal Processing

Article 20 February 2024

MSLID-TCN: multi-stage linear-index dilated temporal convolutional network for temporal action segmentation

Temporal Convolutional Network (TCN) has received extensive attention in the field of speech synthesis. Many researchers use TCN-based models for...

Suo Gao, Rui Wu, ... **anglong Tang in International Journal of Machine Learning and Cybernetics

Article 18 June 2024

Simultaneous Speech Extraction for Multiple Target Speakers Under Meeting Scenarios

The common target speech separation directly estimates the target source, ignoring the interrelationship between different speakers at each frame. We...

Bang Zeng, Hongbin Suo, ... Ming Li in Journal of Shanghai Jiaotong University (Science)

Article 11 May 2024

Time-domain adaptive attention network for single-channel speech separation

Recent years have witnessed a great progress in single-channel speech separation by applying self-attention based networks. Despite the excellent...

Kunpeng Wang, Hao Zhou, ... Juan Yao in EURASIP Journal on Audio, Speech, and Music Processing

Article Open access 11 May 2023

A generic optimization and learning framework for Parkinson disease via speech and handwritten records

Parkinson’s disease (PD) is a neurodegenerative disorder with slow progression whose symptoms can be identified at late stages. Early diagnosis and...

Nada R. Yousif, Hossam Magdy Balaha, ... Eman M. El-Gendy in Journal of Ambient Intelligence and Humanized Computing

Article Open access 26 August 2022

Survey on Arabic speech emotion recognition

Emotions represent a fundamental aspect when evaluating user satisfaction or collecting customer feedback in human interactions, as well as in the...

Latifa Iben Nasr, Abir Masmoudi, Lamia Hadrich Belguith in International Journal of Speech Technology

Article 01 March 2024

Improving speech recognition systems for the morphologically complex Malayalam language using subword tokens for language modeling

This article presents the research work on improving speech recognition systems for the morphologically complex Malayalam language using subword...

Kavya Manohar, Jayan A R, Rajeev Rajan in EURASIP Journal on Audio, Speech, and Music Processing

Article Open access 04 November 2023

Robust speech recognition in sports competition review based on natural language processing

Mass communication media is develo** at a fast speed, and they have obtained richer methods through different ways of combining, which has...

Penglong Wang, Yuhong Feng, ... Shengdong Yang in International Journal of System Assurance Engineering and Management

Article 24 June 2023

Exploring Generation of Pronunciation Lexicon for Low-Resource Language Automatic Speech Recognition Based on Generic Phone Recognizer

The lexicon is an essential component in the hybrid automatic speech recognition (ASR) system. However, a high-quality lexicon requires significant...

**peng Li, **e Chen, Weiqiang Zhang in Journal of Shanghai Jiaotong University (Science)

Article 23 April 2024

Search

Filters

Search Results

Search

Navigation