Search
Search Results
-
Usefulness of glottal excitation source information for audio-visual speech recognition system
In this work, the excitation source based glottal information is explored as a supplementary evidence for develo** robust audio-visual speech...
-
Different Machine Learning Algorithms for Parkinson’s Disease Detection Using Speech Signals
A neurodegenerative disorder affecting the brain's neurological, physiological, and behavioral systems is called Parkinson's disease (PD). In the... -
An automated speech analysis system for the detection of cognitive decline in elderly
The goal of this study is to develop and test an automated integrated speech analysis system for detecting mild cognitive impairment (MCI) and...
-
Detection of replay signals using excitation source and shifted CQCC features
The replay attack is refereed as an unauthorized attempt to access the automatic speaker verification (ASV) system by using the pre-recorded speech...
-
Simultaneous Estimation of Glottal Source Waveforms and Vocal Tract Shapes from Speech Signals Based on ARX-LF Model
Estimating glottal source waveforms and vocal tract shapes is typically done by processing the speech signal using an inverse filter and then fitting...
-
Optimal prosodic feature extraction and classification in parametric excitation source information for Indian language identification using neural network based Q-learning algorithm
Automatic language identification (LID) system has extensively recognized in a real world multilanguage speech specific applications. The formation...
-
Analysis of algorithms to estimate glottal closure instants from speech signals
Estimation of glottal closure instants (GCIs) plays a vital role in pitch-synchronous speech processing. The current work performs a qualitative and...
-
The Voice Signal and Its Information Content—2
Information in the voice signal is embedded in both its time progression and in its spectral content, i.e. in its time domain and spectrographic... -
Excitation Features of Speech for Emotion Recognition Using Neutral Speech as Reference
In generation of emotional speech, there are deviations in the speech production features when compared to neutral (non-emotional) speech. The...
-
Background and Literature Review
This chapter provides a brief overview about the HMM-based speech synthesis. Existing works related to voicing detection and F 0 estimation are... -
The effect of pitch tracking on automatic dialect identification
Pitch tracking is one of the most important research topics in the recognition and identification area. This study concerns the effect of the pitch...
-
Speech synthesis for glottal activity region processing
The objective of this paper is to demonstrate the significance of combining different features present in the glottal activity region for statistical...
-
Pitch-Scaled Spectrum Based Excitation Model for HMM-based Speech Synthesis
The speech generated by hidden Markov model (HMM)-based speech synthesis systems (HTS) suffers from a ‘buzzing’ sound, which is due to an...
-
On the use of speech parameter contours for emotion recognition
Many features have been proposed for speech-based emotion recognition, and a majority of them are frame based or statistics estimated from...
-
Glottal inverse filtering analysis of human voice production — A review of estimation and parameterization methods of the glottal excitation and their applications
Glottal inverse filtering (GIF) refers to methods of estimating the source of voiced speech, the glottal volume velocity waveform. GIF is based on...