Skip to main content

and
  1. Chapter and Conference Paper

    Enrich Web Applications with Voice Internet Persona Text-to-Speech for Anyone, Anywhere

    To embrace the coming age of rich Internet applications and to enrich applications with voice, we propose a Voice Internet Persona (VIP) service. Unlike current text-to-speech (TTS) applications, in which user...

    Min Chu, Yusheng Li, **n Zou, Frank Soong in Human-Computer Interaction. HCI Intelligen… (2007)

  2. No Access

    Chapter and Conference Paper

    A Robust Voice Activity Detection Based on Noise Eigenspace Projection

    A robust voice activity detector (VAD) is expected to increase the accuracy of ASR in noisy environments. This study focuses on how to extract robust information for designing a robust VAD. To do so, we constr...

    Dongwen Ying, Yu Shi, Frank Soong, Jianwu Dang in Chinese Spoken Language Processing (2006)

  3. No Access

    Chapter and Conference Paper

    Improved Mandarin Speech Recognition by Lattice Rescoring with Enhanced Tone Models

    Tone plays an important lexical role in spoken tonal languages like Mandarin Chinese. In this paper we propose a two-pass search strategy for improving tonal syllable recognition performance. In the first pass...

    Huanliang Wang, Yao Qian, Frank Soong, Jian-Lai Zhou in Chinese Spoken Language Processing (2006)

  4. No Access

    Chapter and Conference Paper

    Automatic Detection of Tone Mispronunciation in Mandarin

    In this paper we present our study on detecting tone mispronunciations in Mandarin. Both template and HMM approaches are investigated. Schematic templates of pitch contours are shown to be impractical due to t...

    Li Zhang, Chao Huang, Min Chu, Frank Soong in Chinese Spoken Language Processing (2006)

  5. No Access

    Chapter and Conference Paper

    An HMM-Based Mandarin Chinese Text-To-Speech System

    In this paper we present our Hidden Markov Model (HMM)-based, Mandarin Chinese Text-to-Speech (TTS) system. Mandarin Chinese or Putonghua, “the common spoken language”, is a tone language where each of the 400...

    Yao Qian, Frank Soong, Yining Chen, Min Chu in Chinese Spoken Language Processing (2006)

  6. No Access

    Chapter and Conference Paper

    Non-uniform Kernel Allocation Based Parsimonious HMM

    In conventional Gaussian mixture based Hidden Markov Model (HMM), all states are usually modeled with a uniform, fixed number of Gaussian kernels. In this paper, we propose to allocate kernels non-uniformly to...

    Peng Liu, Jian-Lai Zhou, Frank Soong in Chinese Spoken Language Processing (2006)

  7. No Access

    Chapter and Conference Paper

    The Paradigm for Creating Multi-lingual Text-To-Speech Voice Databases

    Voice database is one of the most important parts in TTS systems. However, creating a high quality new TTS voice is not an easy task even for a professional team. The whole process is rather complicated and co...

    Min Chu, Yong Zhao, Yining Chen, Lijuan Wang in Chinese Spoken Language Processing (2006)

  8. No Access

    Chapter and Conference Paper

    Signal Trajectory Based Noise Compensation for Robust Speech Recognition

    This paper presents a novel signal trajectory based noise compensation algorithm for robust speech recognition. Its performance is evaluated on the Aurora 2 database. The algorithm consists of two processing s...

    Zhi-Jie Yan, Jian-Lai Zhou, Frank Soong, Ren-Hua Wang in Chinese Spoken Language Processing (2006)

  9. No Access

    Chapter and Conference Paper

    Noisy Speech Recognition Performance of Discriminative HMMs

    Discriminatively trained HMMs are investigated in both clean and noisy environments in this study. First, a recognition error is defined at different levels including string, word, phone and acoustics. A high ...

    Jun Du, Peng Liu, Frank Soong, Jian-Lai Zhou in Chinese Spoken Language Processing (2006)