Search Page | SpringerLink

Usefulness of glottal excitation source information for audio-visual speech recognition system

In this work, the excitation source based glottal information is explored as a supplementary evidence for develo** robust audio-visual speech...

Salam Nandakishor, Debadatta Pati in International Journal of Speech Technology

Article 14 November 2023

Modeling Source and System Features Through Multi-channel Convolutional Neural Network for Improving Intelligibility Assessment of Dysarthric Speech

This paper investigates the nuanced characteristics of the spectral envelope attributes due to vocal-tract resonance structure and fine-level...

Md. Talib Ahmad, Gayadhar Pradhan, Jyoti Prakash Singh in Circuits, Systems, and Signal Processing

Article 16 June 2024

An Innovative Method for Speech Signal Emotion Recognition Based on Spectral Features Using GMM and HMM Techniques

Speech is one of the communication processes of humans. One of the important features of speech is to convey the inner feelings of the person to the...

Mohammed Jawad Al-Dujaili Al-Khazraji, Abbas Ebrahimi-Moghadam in Wireless Personal Communications

Article 01 January 2024

Self-supervised Learning for Speech Emotion Recognition Task Using Audio-visual Features and Distil Hubert Model on BAVED and RAVDESS Databases

Existing pre-trained models like Distil HuBERT excel at uncovering hidden patterns and facilitating accurate recognition across diverse data types,...

Karim Dabbabi, Abdelkarim Mars in Journal of Systems Science and Systems Engineering

Article 29 May 2024

Spectro-Temporal Energy Ratio Features for Single-Corpus and Cross-Corpus Experiments in Speech Emotion Recognition

In this study, novel Spectro-Temporal Energy Ratio features based on the formants of vowels, linearly spaced low-frequency, and logarithmically...

Cevahir Parlak, Banu Diri, Yusuf Altun in Arabian Journal for Science and Engineering

Article 27 May 2023

Real-Time Automatic Continuous Speech Recognition System for Kannada Language/Dialects

In this work, we present recent advancements in our earlier automatic continuous Kannada speech recognition (ACKSR) system under real-time...

G. Thimmaraja Yadava, B. G. Nagaraja, G. P. Raghudathesh in Wireless Personal Communications

Article 01 January 2024

Mi-Go: tool which uses YouTube as data source for evaluating general-purpose speech recognition machine learning models

This article introduces Mi-Go, a tool aimed at evaluating the performance and adaptability of general-purpose speech recognition machine learning...

Tomasz Wojnar, Jarosław Hryszko, Adam Roman in EURASIP Journal on Audio, Speech, and Music Processing

Article Open access 01 May 2024

An experiment of Moroccan dialect speech recognition in noisy environments using PocketSphinx

In this study, we introduce an experimental framework for Moroccan dialect speech recognition under various additive noise conditions using the...

Abdelkbir Ouisaadane, Said Safi, Miloud Frikel in International Journal of Speech Technology

Article 31 May 2024

An Open-Source Voice Command-Based Human-Computer Interaction System Using Speech Recognition Platforms

Voice command-based human-computer interaction (HCI) is becoming useful and practical day by day. Here, we present an open-source voice command-based...

Adnan Mahmud Fuad, Sheikh Jahan Ahmed, ... Kamruddin Nur in Proceedings of the 2nd International Conference on Big Data, IoT and Machine Learning

Conference paper 2024

An approach for speech enhancement with dysarthric speech recognition using optimization based machine learning frameworks

Dysarthric speech is the noisy or source distortion speech. Reasonable speech enhancement is required to obtain higher communication quality for...

Bhuvaneshwari Jolad, Rajashri Khanai in International Journal of Speech Technology

Article 21 February 2023

Cross-corpus speech emotion recognition using subspace learning and domain adaption

Speech emotion recognition (SER) is a hot topic in speech signal processing. When the training data and the test data come from different corpus,...

Xuan Cao, Maoshen Jia, ... Tun-wen Pai in EURASIP Journal on Audio, Speech, and Music Processing

Article Open access 27 December 2022

Analysis of influencing features with spectral feature extraction and multi-class classification using deep neural network for speech recognition system

There is a drastic need for extracting information from non-linguistic features of the audio sources. It leads to the eminent rise of speech...

Dinesh Kumar Anguraj, J. Anitha, ... D. Mythrayee in International Journal of Speech Technology

Article 16 May 2022

Processing of Chinese language and text information system under the background of speech recognition

With the popularization of computers, artificial intelligence technology has become more and more mature, among which speech recognition technology...

Huiqin Cao, Peng He, Cheng** Wang in Soft Computing

Article 10 June 2023

Research on English speech recognition system and training enhancement based on bat algorithm and acoustic model inspection

Because of its own language characteristics, English has become the main language tool for communication among countries in the world under the...

** Yang, Ling Li in Soft Computing

Article 29 June 2023

Amazigh CNN speech recognition system based on Mel spectrogram feature extraction method

The field of speech recognition makes it simpler for humans and machines to engage with speech. Number-oriented communication, such as using a...

Hossam Boulal, Mohamed Hamidi, ... Jamal Barkani in International Journal of Speech Technology

Article 01 March 2024

Mixed-modality speech recognition and interaction using a wearable artificial throat

Researchers have recently been pursuing technologies for universal speech recognition and interaction that can work well with subtle sounds or noisy...

Qisheng Yang, Weiqiu **, ... Tian-Ling Ren in Nature Machine Intelligence

Article 23 February 2023

Transfer Accent Identification Learning for Enhancing Speech Emotion Recognition

Emotional speech has some dependency on language or within a language itself, there are certain variations due to accents. The presence of accents...

G. Priya Dharshini, K. Sreenivasa Rao in Circuits, Systems, and Signal Processing

Article 30 April 2024

Fifth-generation edge computing-oriented speech recognition system applied in Japanese education and social sentiment classification

With the rapid development of the fifth-generation (5G) mobile technology, smart devices and their diversified business applications are booming, and...

Zhou Qiao in Soft Computing

Article 30 June 2023

A rough set theory and deep learning-based predictive system for gender recognition using audio speech

Speech is one of the most delicate medium through which gender of the speakers can easily be identified. Though the related research has shown very...

Ghazaala Yasmin, Asit Kumar Das, ... Soumi Dutta in Soft Computing

Article 20 April 2022

Speech Emotion Recognition Using Generative Adversarial Network and Deep Convolutional Neural Network

Speech emotion recognition (SER) has recently increased because of vast innovations in human–computer interaction and affective computing. In recent...

Kishor Bhangale, Mohanaprasad Kothandaraman in Circuits, Systems, and Signal Processing

Article 16 December 2023

Search

Filters

Search Results

Search

Navigation