-
Chapter and Conference Paper
Sign Correlation Detector for Blind Image Watermarking in the DCT Domain
Digital watermarking is a key technique for protecting intellectual property of digital media. Due to the ability to detect watermark without the original image, blind watermarking is very useful if there are ...
-
Chapter and Conference Paper
Robust Block and Gray-Level Histogram Based Watermarking Scheme
As one of the crucial problems in watermarking, robustness against geometric distortion and JPEG compression becomes more than challenging and problematic. In this paper, a robust watermarking scheme based on ...
-
Article
Predicting movie Box-office revenues by exploiting large-scale social media content
Predicting the box-office revenue of a movie before its theatrical release is an important but challenging problem that requires a high level of Artificial Intelligence. Nowadays, social media has shown its pr...
-
Chapter and Conference Paper
The Analysis for Binaural Signal’s Characteristics of a Real Source and Corresponding Virtual Sound Image
3D Audio System could rebuild more realistic and immersive sound effects. The existing 3D audio reconstruction methods mainly consider the physical characteristics of sound filed, less take head’s effect on so...
-
Chapter and Conference Paper
A Splicing Interpolation Method for Head-Related Transfer Function
We proposed a new head-related transfer function (HRTF) interpolation method based on splicing. The sound spreads from sound source to listener’s ears, the wave of head-related impulse response (HRIR) clearly ...
-
Chapter and Conference Paper
An Efficient Method Using the Parameterized HRTFs for 3D Audio Real-Time Rendering on Mobile Devices
3D audio real-time rendering is of great importance for virtual reality (VR) application, especially on mobile devices. However, the limited computational power makes it hard to implement fast generation of sp...
-
Chapter and Conference Paper
Speech Intelligibility Enhancement in Strong Mechanical Noise Based on Neural Networks
Speech intelligibility is a significant factor for successful speech communication. To enhance the intelligibility, many methods have been proposed, mainly by operating the speech signal such as increasing the...
-
Article
Weighted motion averaging for the registration of multi-view range scans
Multi-view registration is a fundamental but challenging task in 3D reconstruction and robot vision. Although the original motion averaging algorithm has been introduced as an effective means to solve the mult...
-
Article
A near-end listening enhancement system by RNN-based noise cancellation and speech modification
When people listen to the phone in noisy environments, near-end listening enhancement (NELE) is a technology to enhance speech intelligibility against environmental noise. The complex environments in mobile co...
-
Article
Audio object coding based on optimal parameter frequency resolution
Object-based audio content is becoming the main form of audio content, because it is more interactive and flexible than traditional channel-based audio content. The Spatial Audio Object Coding (SAOC) method is...
-
Article
Multi-view point cloud registration with adaptive convergence threshold and its application in 3D model retrieval
Multi-view point cloud registration is a hot topic in the communities of artificial intelligence and multimedia technology. In this paper, we propose a novel framework to reconstruct 3D models with a multi-vie...
-
Article
A map** model of spectral tilt in normal-to-Lombard speech conversion for intelligibility enhancement
Environmental noise degrades the speech intelligibility when listening to the phone. Although the phone has a clean signal source, it is still difficult for the listener to get information. Intelligibility enh...
-
Article
Single Channel multi-speaker speech Separation based on quantized ratio mask and residual network
The recently-proposed deep clustering-based algorithms represent a fundamental advance towards the single-channel multi-speaker speech sep- aration problem. These methods use an ideal binary mask to construct ...
-
Article
Optimization of sound fields reproduction based Higher-Order Ambisonics (HOA) using the Generative Adversarial Network (GAN)
Sound field reproduction using Higher-order Ambisonics (HOA) has many studies in recent years. However, in the HOA, sound fields are reproduced with the least square solution of spherical harmonics (SH) coeffi...
-
Article
Estimation of spherical harmonic coefficients in sound field recording using feed-forward neural networks
Sound field recording using spherical harmonics (SH) has been widely used. However, too many microphones are needed when recording sound fields over large areas, due to the capture of the higher order of spher...
-
Article
Audio object coding based on N-step residual compensating
Object-based audio techniques provide more flexibility and convenience for personalized rendering under various playback configurations. Many methods have been proposed to encode and transmit multiple audio ob...
-
Article
A dual-tamper-detection method for digital image authentication and content self-recovery
This paper proposes an approach to protect image content against malicious tampering based on watermarking technology. The watermark is composed of two kinds of check bits which are used for tampered region lo...
-
Article
FD-TR: feature detector based on scale invariant feature transform and bidirectional feature regionalization for digital image watermarking
In this paper we propose the FD-TR: Feature Detector Based on Scale Invariant Feature Transform and Bidirectional Feature Regionalization for digital image watermarking. The Scale Invariant Feature Transform m...
-
Article
Research on a collaboration model of green closed-loop supply chains towards intelligent manufacturing
A closed-loop supply chain (CLSC) is a complete supply chain cycle that closes the flow of logistics from procurement to sales to reduce pollution and optimize returns. In this paper, we aim to address the pro...
-
Article
Towards a multimodal human activity dataset for healthcare
Human activity recognition (HAR) based on wearable devices has become a hot topic due to the wide adoption of smartphones and smart bands. In this paper, we propose a new dataset, MMC-PCL-Activity, for wearabl...