-
Article
Scene classification-oriented saliency detection via the modularized prescription
Saliency detection technology has been greatly developed and applied in recent years. However, the performance of current methods is not satisfactory in complex scenes. One of the reasons is that the performan...
-
Article
Multi-scale counting and difference representation for texture classification
Multi-scale analysis has been widely used for constructing texture descriptors by modeling the coefficients in transformed domains. However, the resulting descriptors are not robust to the rotated textures whe...
-
Article
Salient object detection in complex scenes via D-S evidence theory based region classification
In complex scenes, multiple objects are often concealed in cluttered backgrounds. Their saliency is difficult to be detected by using conventional methods, mainly because single color contrast can not shoulder...
-
Chapter and Conference Paper
Automatic Prosodic Events Detection Using a Two-Stage SVM/CRF Sequence Classifier with Acoustic Features
To benefit from the maximum-margin nature of SVMs and also from the ability of CRFs to model correlations between neighboring features, this paper utilizes a two-stage SVM/CRF sequence classifier to detect pro...
-
Chapter and Conference Paper
Latent Topic Model Based on Gaussian-LDA for Audio Retrieval
In this paper,we introduce a new topic model named Gaussian-LDA, which is more suitable to model continuous data. Topic Model based on latent Dirichlet allocation (LDA) is widely used for the statistical analy...
-
Chapter and Conference Paper
Speech Fragment Decoding Techniques Using Silent Pause Detection
Silent pause frequently occurs in spontaneous speech. When recognizing spontaneous speech, silent pause tends to degrade the performance of typical speech recognizers. This paper proposes a fragment decoding m...
-
Chapter and Conference Paper
Monaural Voiced Speech Separation with Multipitch Tracking
Separating voiced speech from its mixtures with interferences in monaural condition is not only an important but also challenging task. As multipitch tracking can enable much better performance of speech separ...
-
Chapter
Incorporate Visual Analytics to Design a Human-Centered Computing Framework for Personalized Classifier Training and Image Retrieval
Human has always been a part of the computational loop. The goal of human-centered multimedia computing is to explicitly address human factors at all levels of multimedia computations. In this chapter, we have...