-
Chapter and Conference Paper
UBM Based Speaker Segmentation and Clustering for 2-Speaker Detection
In this paper, a speaker segmentation method based on log-likelihood ratio score (LLRS) over universal background model (UBM) and a speaker clustering method based on difference of log-likelihood scores betwee...
-
Chapter and Conference Paper
Pitch Mean Based Frequency War**
In this paper, a novel pitch mean based frequency war** (PMFW) method is proposed to reduce the pitch variability in speech signals at the front-end of speech recognition. The warp factors used in this proce...
-
Chapter and Conference Paper
State-Dependent Phoneme-Based Model Merging for Dialectal Chinese Speech Recognition
Aiming at building a dialectal Chinese speech recognizer from a standard Chinese speech recognizer with a small amount of dialectal Chinese speech, a novel, simple but effective acoustic modeling method, named st...
-
Article
Speech detection in non-stationary noise based on the 1/f process
In this paper, an effective and robust active speech detection method is proposed based on the 1/f process technique for signals under non-stationary noisy environments. The Gaussian 1/f process, a mathematical m...
-
Article
The Hidden Markov Model of co-articulation and its application to the continuous speech recognition
The co-articulation is one of the main reasons that makes the speech recognition difficult. However, the traditional Hidden Markov Models(HMM) can not model the co-articulation, because they depend on the firs...
-
Article
HarkMan—A vocabulary-independent keyword spotter for spontaneous Chinese speech
In this paper, a novel technique adopted in HarkMan is introduced. HarkMan is a keyword-spotter designed to automatically spot the given words of a vocabulary-independent task in unconstrained Chinese telephon...
-
Article
Center-distance continuous probability models and the distance measure
In this paper, a new statistic model named Center-Distance Continuous Probability Model (CDCPM) for speech recognition is described, which is based on Center-Distance Normal (CDN) distribution. In a CDCPM, the...
-
Article
A log-index weighted cepstral distance measure for speech recognition
A log-index weighted cepstral distance measure is proposed and tested in speaker-independent and speaker-dependent isolated word recognition systems using statistic techniques. The weights for the cepstral coe...