Information Retrieval Technology
Second Asia Information Retrieval Symposium, AIRS 2005, Jeju Island, Korea, October 13-15, 2005. Proceedings
Chapter and Conference Paper
Foley sound in movies and TV episodes is of great importance to bring a more realistic feeling to the audience. Traditionally, foley artists need to create the foley sound synchronous with the content occurrin...
Chapter
Automatic speech recognition (ASR) and text-to-speech (TTS) synthesis are two very important modules in human-computer communication. With the development of deep learning, the performance of ASR and TTS has i...
Article
Artificial intelligence (AI) education for K-12 students is an emerging necessity, owing to the rapid advancement and deployment of AI technologies. It is essential to take teachers’ perspectives into account ...
Chapter and Conference Paper
This paper presents the overview of the shared task 7, Fine-Grained Dialogue Social Bias Measurement, in NLPCC 2022. In this paper, we introduce the task, explain the construction of the provided dataset, anal...
Chapter and Conference Paper
User queries for a real-world dialog system may sometimes fall outside the scope of the system’s capabilities, but appropriate system responses will enable smooth processing throughout the human-computer inter...
Chapter and Conference Paper
Recurrent neural networks (RNNs) with long short term memory (LSTM) acoustic model (AM) has achieved state-of-the-art performance in LVCSR. The strong ability in capturing context information makes the acoust...
Article
Article
Article
Emphasis plays an important role in expressive speech synthesis in highlighting the focus of an utterance to draw the attention of the listener. As there are only a few emphasized words in a sentence, the prob...
Article
Synthetic talking avatar has been demonstrated to be very useful in human-computer interactions. In this paper, we discuss the problem of acoustic to articulatory map** and explore different kinds of models ...
Article
Emphasis plays an important role in expressive speech synthesis in highlighting the focus of an utterance to draw the attention of the listener. We present a hidden Markov model (HMM)-based emphatic speech syn...
Chapter
This paper reviews interactive methods for improving the phonetic competence of subjects in the case of second language learning as well as in the case of speech therapy for subjects suffering from hearing-imp...
Chapter and Conference Paper
Story segmentation plays a critical role in spoken document processing. Spoken documents often come in a continuous audio stream without explicit boundaries related to stories or topics. It is important to be ...
Chapter and Conference Paper
This paper presents a corpus-based approach for cooperative response generation in a spoken dialog system for the Hong Kong tourism domain. A corpus with 3874 requests and responses is collected using Wizard-o...
Chapter and Conference Paper
This paper proposes a novel approach towards a video- realistic, speech-driven talking face for Cantonese. We present a technique that realizes a talking face for a target language (Cantonese) using only audio...
Book and Conference Proceedings
Second Asia Information Retrieval Symposium, AIRS 2005, Jeju Island, Korea, October 13-15, 2005. Proceedings
Chapter and Conference Paper
This paper explores the fusion of audio and visual evidences through a multi-level hybrid fusion architecture based on dynamic Bayesian network (DBN), which combines model level and decision level fusion to ac...
Chapter and Conference Paper
In this paper, we describe a reading comprehension system. This system can return a sentence in a given document as the answer to a given question. This system applies bag-of-words matching approach as the bas...
Chapter and Conference Paper
This paper presents a pruning approach for minimizing the execution time in the pattern matching process during speaker verification. Specifically, our speaker verification system uses mel-frequency cepstral c...
Chapter
We propose a unified framework for integrating a variety of linguistic knowledge sources for representing the English word, to facilitate their concurrent utilization in language applications. Our hierarchical...