Search
Search Results
-
Usefulness of glottal excitation source information for audio-visual speech recognition system
In this work, the excitation source based glottal information is explored as a supplementary evidence for develo** robust audio-visual speech...
-
Modeling Source and System Features Through Multi-channel Convolutional Neural Network for Improving Intelligibility Assessment of Dysarthric Speech
This paper investigates the nuanced characteristics of the spectral envelope attributes due to vocal-tract resonance structure and fine-level...
-
An Innovative Method for Speech Signal Emotion Recognition Based on Spectral Features Using GMM and HMM Techniques
Speech is one of the communication processes of humans. One of the important features of speech is to convey the inner feelings of the person to the...
-
Self-supervised Learning for Speech Emotion Recognition Task Using Audio-visual Features and Distil Hubert Model on BAVED and RAVDESS Databases
Existing pre-trained models like Distil HuBERT excel at uncovering hidden patterns and facilitating accurate recognition across diverse data types,...
-
Spectro-Temporal Energy Ratio Features for Single-Corpus and Cross-Corpus Experiments in Speech Emotion Recognition
In this study, novel Spectro-Temporal Energy Ratio features based on the formants of vowels, linearly spaced low-frequency, and logarithmically...
-
Real-Time Automatic Continuous Speech Recognition System for Kannada Language/Dialects
In this work, we present recent advancements in our earlier automatic continuous Kannada speech recognition (ACKSR) system under real-time...
-
Mi-Go: tool which uses YouTube as data source for evaluating general-purpose speech recognition machine learning models
This article introduces Mi-Go, a tool aimed at evaluating the performance and adaptability of general-purpose speech recognition machine learning...
-
An experiment of Moroccan dialect speech recognition in noisy environments using PocketSphinx
In this study, we introduce an experimental framework for Moroccan dialect speech recognition under various additive noise conditions using the...
-
An Open-Source Voice Command-Based Human-Computer Interaction System Using Speech Recognition Platforms
Voice command-based human-computer interaction (HCI) is becoming useful and practical day by day. Here, we present an open-source voice command-based... -
An approach for speech enhancement with dysarthric speech recognition using optimization based machine learning frameworks
Dysarthric speech is the noisy or source distortion speech. Reasonable speech enhancement is required to obtain higher communication quality for...
-
Cross-corpus speech emotion recognition using subspace learning and domain adaption
Speech emotion recognition (SER) is a hot topic in speech signal processing. When the training data and the test data come from different corpus,...
-
Analysis of influencing features with spectral feature extraction and multi-class classification using deep neural network for speech recognition system
There is a drastic need for extracting information from non-linguistic features of the audio sources. It leads to the eminent rise of speech...
-
Processing of Chinese language and text information system under the background of speech recognition
With the popularization of computers, artificial intelligence technology has become more and more mature, among which speech recognition technology...
-
Research on English speech recognition system and training enhancement based on bat algorithm and acoustic model inspection
Because of its own language characteristics, English has become the main language tool for communication among countries in the world under the...
-
Amazigh CNN speech recognition system based on Mel spectrogram feature extraction method
The field of speech recognition makes it simpler for humans and machines to engage with speech. Number-oriented communication, such as using a...
-
Mixed-modality speech recognition and interaction using a wearable artificial throat
Researchers have recently been pursuing technologies for universal speech recognition and interaction that can work well with subtle sounds or noisy...
-
Transfer Accent Identification Learning for Enhancing Speech Emotion Recognition
Emotional speech has some dependency on language or within a language itself, there are certain variations due to accents. The presence of accents...
-
Fifth-generation edge computing-oriented speech recognition system applied in Japanese education and social sentiment classification
With the rapid development of the fifth-generation (5G) mobile technology, smart devices and their diversified business applications are booming, and...
-
A rough set theory and deep learning-based predictive system for gender recognition using audio speech
Speech is one of the most delicate medium through which gender of the speakers can easily be identified. Though the related research has shown very...
-
Speech Emotion Recognition Using Generative Adversarial Network and Deep Convolutional Neural Network
Speech emotion recognition (SER) has recently increased because of vast innovations in human–computer interaction and affective computing. In recent...