-
Chapter and Conference Paper
Enrich Web Applications with Voice Internet Persona Text-to-Speech for Anyone, Anywhere
To embrace the coming age of rich Internet applications and to enrich applications with voice, we propose a Voice Internet Persona (VIP) service. Unlike current text-to-speech (TTS) applications, in which user...
-
Chapter and Conference Paper
A Robust Voice Activity Detection Based on Noise Eigenspace Projection
A robust voice activity detector (VAD) is expected to increase the accuracy of ASR in noisy environments. This study focuses on how to extract robust information for designing a robust VAD. To do so, we constr...
-
Chapter and Conference Paper
Improved Mandarin Speech Recognition by Lattice Rescoring with Enhanced Tone Models
Tone plays an important lexical role in spoken tonal languages like Mandarin Chinese. In this paper we propose a two-pass search strategy for improving tonal syllable recognition performance. In the first pass...
-
Chapter and Conference Paper
Automatic Detection of Tone Mispronunciation in Mandarin
In this paper we present our study on detecting tone mispronunciations in Mandarin. Both template and HMM approaches are investigated. Schematic templates of pitch contours are shown to be impractical due to t...
-
Chapter and Conference Paper
An HMM-Based Mandarin Chinese Text-To-Speech System
In this paper we present our Hidden Markov Model (HMM)-based, Mandarin Chinese Text-to-Speech (TTS) system. Mandarin Chinese or Putonghua, “the common spoken language”, is a tone language where each of the 400...
-
Chapter and Conference Paper
Non-uniform Kernel Allocation Based Parsimonious HMM
In conventional Gaussian mixture based Hidden Markov Model (HMM), all states are usually modeled with a uniform, fixed number of Gaussian kernels. In this paper, we propose to allocate kernels non-uniformly to...
-
Chapter and Conference Paper
The Paradigm for Creating Multi-lingual Text-To-Speech Voice Databases
Voice database is one of the most important parts in TTS systems. However, creating a high quality new TTS voice is not an easy task even for a professional team. The whole process is rather complicated and co...
-
Chapter and Conference Paper
Signal Trajectory Based Noise Compensation for Robust Speech Recognition
This paper presents a novel signal trajectory based noise compensation algorithm for robust speech recognition. Its performance is evaluated on the Aurora 2 database. The algorithm consists of two processing s...
-
Chapter and Conference Paper
Noisy Speech Recognition Performance of Discriminative HMMs
Discriminatively trained HMMs are investigated in both clean and noisy environments in this study. First, a recognition error is defined at different levels including string, word, phone and acoustics. A high ...