Search Page | SpringerLink

Usefulness of glottal excitation source information for audio-visual speech recognition system

In this work, the excitation source based glottal information is explored as a supplementary evidence for develo** robust audio-visual speech...

Salam Nandakishor, Debadatta Pati in International Journal of Speech Technology

Article 14 November 2023

Different Machine Learning Algorithms for Parkinson’s Disease Detection Using Speech Signals

A neurodegenerative disorder affecting the brain's neurological, physiological, and behavioral systems is called Parkinson's disease (PD). In the...

Chaitali Shamrao Raje, Pramodkumar H. Kulkarni, Rupali Deshmukh in Communication and Intelligent Systems

Conference paper 2024

An automated speech analysis system for the detection of cognitive decline in elderly

The goal of this study is to develop and test an automated integrated speech analysis system for detecting mild cognitive impairment (MCI) and...

Christos P. Loizou, Marios Pantzaris in International Journal of Speech Technology

Article 19 January 2023

Detection of replay signals using excitation source and shifted CQCC features

The replay attack is refereed as an unauthorized attempt to access the automatic speaker verification (ASV) system by using the pre-recorded speech...

Krishna Dutta, Madhusudan Singh, Debadatta Pati in International Journal of Speech Technology

Article 04 February 2021

Simultaneous Estimation of Glottal Source Waveforms and Vocal Tract Shapes from Speech Signals Based on ARX-LF Model

Estimating glottal source waveforms and vocal tract shapes is typically done by processing the speech signal using an inverse filter and then fitting...

Yongwei Li, Ken-Ichi Sakakibara, Masato Akagi in Journal of Signal Processing Systems

Article 23 December 2019

Optimal prosodic feature extraction and classification in parametric excitation source information for Indian language identification using neural network based Q-learning algorithm

Automatic language identification (LID) system has extensively recognized in a real world multilanguage speech specific applications. The formation...

Himanish Shekhar Das, Pinki Roy in International Journal of Speech Technology

Article 03 December 2018

Analysis of algorithms to estimate glottal closure instants from speech signals

Estimation of glottal closure instants (GCIs) plays a vital role in pitch-synchronous speech processing. The current work performs a qualitative and...

G. Anushiya Rachel, P. Vijayalakshmi, T. Nagarajan in International Journal of Speech Technology

Article 09 September 2020

The Voice Signal and Its Information Content—2

Information in the voice signal is embedded in both its time progression and in its spectral content, i.e. in its time domain and spectrographic...

Rita Singh in Profiling Humans from their Voice

Chapter 2019

Excitation Features of Speech for Emotion Recognition Using Neutral Speech as Reference

In generation of emotional speech, there are deviations in the speech production features when compared to neutral (non-emotional) speech. The...

Sudarsana Reddy Kadiri, P. Gangamohan, ... B. Yegnanarayana in Circuits, Systems, and Signal Processing

Article Open access 25 February 2020

Background and Literature Review

This chapter provides a brief overview about the HMM-based speech synthesis. Existing works related to voicing detection and F 0 estimation are...

K. Sreenivasa Rao, N. P. Narendra in Source Modeling Techniques for Quality Enhancement in Statistical Parametric Speech Synthesis

Chapter 2019

The effect of pitch tracking on automatic dialect identification

Pitch tracking is one of the most important research topics in the recognition and identification area. This study concerns the effect of the pitch...

A. Etman, A. A. Beex in International Journal of Speech Technology

Article 23 June 2017

Speech synthesis for glottal activity region processing

The objective of this paper is to demonstrate the significance of combining different features present in the glottal activity region for statistical...

Nagaraj Adiga, S. R. M Prasanna in International Journal of Speech Technology

Article 03 December 2018

Pitch-Scaled Spectrum Based Excitation Model for HMM-based Speech Synthesis

The speech generated by hidden Markov model (HMM)-based speech synthesis systems (HTS) suffers from a ‘buzzing’ sound, which is due to an...

Zhengqi Wen, Jianhua Tao, ... Yang Wang in Journal of Signal Processing Systems

Article 19 December 2013

On the use of speech parameter contours for emotion recognition

Many features have been proposed for speech-based emotion recognition, and a majority of them are frame based or statistics estimated from...

Vidhyasaharan Sethu, Eliathamby Ambikairajah, Julien Epps in EURASIP Journal on Audio, Speech, and Music Processing

Article Open access 10 July 2013

Glottal inverse filtering analysis of human voice production — A review of estimation and parameterization methods of the glottal excitation and their applications

Glottal inverse filtering (GIF) refers to methods of estimating the source of voiced speech, the glottal volume velocity waveform. GIF is based on...

PAAVO ALKU in Sadhana

Article 01 October 2011

Search

Filters

Search Results

Search

Navigation