Search Results - Springer

Sort By Newest First Oldest First

Chapter and Conference Paper

Enrich Web Applications with Voice Internet Persona Text-to-Speech for Anyone, Anywhere

To embrace the coming age of rich Internet applications and to enrich applications with voice, we propose a Voice Internet Persona (VIP) service. Unlike current text-to-speech (TTS) applications, in which user...

Min Chu, Yusheng Li, **n Zou, Frank Soong in Human-Computer Interaction. HCI Intelligen… (2007)

Download PDF (244 KB)
Chapter and Conference Paper

A Robust Voice Activity Detection Based on Noise Eigenspace Projection

A robust voice activity detector (VAD) is expected to increase the accuracy of ASR in noisy environments. This study focuses on how to extract robust information for designing a robust VAD. To do so, we constr...

Dongwen Ying, Yu Shi, Frank Soong, Jianwu Dang… in Chinese Spoken Language Processing (2006)
Chapter and Conference Paper

Improved Mandarin Speech Recognition by Lattice Rescoring with Enhanced Tone Models

Tone plays an important lexical role in spoken tonal languages like Mandarin Chinese. In this paper we propose a two-pass search strategy for improving tonal syllable recognition performance. In the first pass...

Huanliang Wang, Yao Qian, Frank Soong, Jian-Lai Zhou… in Chinese Spoken Language Processing (2006)
Chapter and Conference Paper

Automatic Detection of Tone Mispronunciation in Mandarin

In this paper we present our study on detecting tone mispronunciations in Mandarin. Both template and HMM approaches are investigated. Schematic templates of pitch contours are shown to be impractical due to t...

Li Zhang, Chao Huang, Min Chu, Frank Soong… in Chinese Spoken Language Processing (2006)
Chapter and Conference Paper

An HMM-Based Mandarin Chinese Text-To-Speech System

In this paper we present our Hidden Markov Model (HMM)-based, Mandarin Chinese Text-to-Speech (TTS) system. Mandarin Chinese or Putonghua, “the common spoken language”, is a tone language where each of the 400...

Yao Qian, Frank Soong, Yining Chen, Min Chu in Chinese Spoken Language Processing (2006)
Chapter and Conference Paper

Non-uniform Kernel Allocation Based Parsimonious HMM

In conventional Gaussian mixture based Hidden Markov Model (HMM), all states are usually modeled with a uniform, fixed number of Gaussian kernels. In this paper, we propose to allocate kernels non-uniformly to...

Peng Liu, Jian-Lai Zhou, Frank Soong in Chinese Spoken Language Processing (2006)
Chapter and Conference Paper

The Paradigm for Creating Multi-lingual Text-To-Speech Voice Databases

Voice database is one of the most important parts in TTS systems. However, creating a high quality new TTS voice is not an easy task even for a professional team. The whole process is rather complicated and co...

Min Chu, Yong Zhao, Yining Chen, Lijuan Wang… in Chinese Spoken Language Processing (2006)
Chapter and Conference Paper

Signal Trajectory Based Noise Compensation for Robust Speech Recognition

This paper presents a novel signal trajectory based noise compensation algorithm for robust speech recognition. Its performance is evaluated on the Aurora 2 database. The algorithm consists of two processing s...

Zhi-Jie Yan, Jian-Lai Zhou, Frank Soong, Ren-Hua Wang in Chinese Spoken Language Processing (2006)
Chapter and Conference Paper

Noisy Speech Recognition Performance of Discriminative HMMs

Discriminatively trained HMMs are investigated in both clean and noisy environments in this study. First, a recognition error is defined at different levels including string, word, phone and acoustics. A high ...

Jun Du, Peng Liu, Frank Soong, Jian-Lai Zhou… in Chinese Spoken Language Processing (2006)

9 Result(s)

Enrich Web Applications with Voice Internet Persona Text-to-Speech for Anyone, Anywhere

A Robust Voice Activity Detection Based on Noise Eigenspace Projection

Improved Mandarin Speech Recognition by Lattice Rescoring with Enhanced Tone Models

Automatic Detection of Tone Mispronunciation in Mandarin

An HMM-Based Mandarin Chinese Text-To-Speech System

Non-uniform Kernel Allocation Based Parsimonious HMM

The Paradigm for Creating Multi-lingual Text-To-Speech Voice Databases

Signal Trajectory Based Noise Compensation for Robust Speech Recognition

Noisy Speech Recognition Performance of Discriminative HMMs

Our Content

Other Sites

Help & Contacts