Search Results - Springer

Article

Research on water level measurement technology based on the residual length ratio of image characters

Aiming at the low efficiency and poor adaptability of traditional water level measurement methods, a water level measurement technology based on the residual length ratio of image characters is proposed in thi...

Mingtang Liu, Changchun Wang, Wei Huang… in Signal, Image and Video Processing (2024)

Article

OV-DAR: Open-Vocabulary Object Detection and Attributes Recognition

In this paper, we endeavor to localize all potential objects in an image and infer their visual categories, attributes, and shapes, even in instances where certain objects have not been encompassed in the mode...

Keyan Chen, **aolong Jiang, Haochen Wang… in International Journal of Computer Vision (2024)

Article

OV-VIS: Open-Vocabulary Video Instance Segmentation

Conventionally, the goal of Video Instance Segmentation (VIS) is to segment and categorize objects in videos from a closed set of training categories, lacking the generalization ability to handle novel categor...

Haochen Wang, Cilin Yan, Keyan Chen… in International Journal of Computer Vision (2024)

Article

Map of sand and gravel aggregate level height and volume measurement based on contour map generation

In order to prevent the abnormal appearance of sand and gravel aggregate level in the concrete mixing plant, and improve the safety of the concrete mixing plant system as well as the efficient and high-quality...

Yingjie Liu, Shuang Yue, **aochen Wang, **hao Zhang… in Signal, Image and Video Processing (2024)

Article

Optimization analysis of football match prediction model based on neural network

How to build a football match prediction model and use scientific methods to solve the prediction problem has become a key point in the application of artificial intelligence in the sports industry. In this pa...

Shuo Guan, **aochen Wang in Neural Computing and Applications (2022)

Chapter and Conference Paper

Stacked Sparse Autoencoder for Audio Object Coding

Compared with channel-based audio coding, the object-based audio coding has a definite advantage in meeting the user’s demands of personalized control. However, in the conventional Spatial Audio Object Coding ...

Yulin Wu, Ruimin Hu, **aochen Wang, Chenhao Hu, Gang Li in MultiMedia Modeling (2021)

Chapter and Conference Paper

EMRM: Enhanced Multi-source Review-Based Model for Rating Prediction

Rating prediction, whose goal is to predict user preference for unconsumed items, has become one of the core tasks in recommendation systems. Recently, many deep learning-based methods have been applied to the...

**aochen Wang, Tingsong **ao, Jie Shao in Knowledge Science, Engineering and Management (2021)

Chapter and Conference Paper

Synthesizing Large-Scale Datasets for License Plate Detection and Recognition in the Wild

License Plate Detection and Recognition (LPDR) plays a key role in modern intelligent transportation systems. Recent state-of-the-art methods of LPDR are based on deep convolutional neural networks (DCNN), whi...

Chaochen Wang, Wenzhong Wang, Chenglong Li… in Pattern Recognition and Computer Vision (2020)

Chapter and Conference Paper

Multi-step Coding Structure of Spatial Audio Object Coding

The spatial audio object coding (SAOC) is an effective meth-od which compresses multiple audio objects and provides flexibility for personalized rendering in interactive services. It divides each frame signal ...

Chenhao Hu, Ruimin Hu, **aochen Wang, Tingzhao Wu, Dengshi Li in MultiMedia Modeling (2020)

Chapter and Conference Paper

Perceptual Localization of Virtual Sound Source Based on Loudspeaker Triplet

When using a loudspeaker triplet for virtual sound localization, the traditional conversion method will result in inaccurate localization. In this paper, we constructed a perceptual localization distortion mod...

Duanzheng Guan, Dengshi Li, Xuebei Cai, **aochen Wang, Ruimin Hu in MultiMedia Modeling (2020)

Chapter and Conference Paper

HMM-Based Person Re-identification in Large-Scale Open Scenario

This paper aims to tackle person re-identification (person re-ID) in large-scale open scenario, which differs from the conventional person re-ID tasks but is significant for some real suspect investigation ca...

Dongyang Li, Ruimin Hu, Wenxin Huang, **aochen Wang, Dengshi Li… in MultiMedia Modeling (2020)

Chapter and Conference Paper

HRTF Representation with Convolutional Auto-encoder

The head-related transfer function (HRTF) can be considered as some kind of filter that describes how a sound from an arbitrary spatial direction transfers to the listener’s eardrums. HRTF can be used to synth...

Wei Chen, Ruimin Hu, **aochen Wang, Dengshi Li in MultiMedia Modeling (2020)

Chapter and Conference Paper

Few-Shot Semantic Segmentation with Democratic Attention Networks

Few-shot segmentation has recently generated great popularity, addressing the challenging yet important problem of segmenting objects from unseen categories with scarce annotated support images. The crux of fe...

Haochen Wang, Xudong Zhang, Yutao Hu, Yandan Yang… in Computer Vision – ECCV 2020 (2020)

Chapter and Conference Paper

A Novel Ensemble Approach for Click-Through Rate Prediction Based on Factorization Machines and Gradient Boosting Decision Trees

Click-Through Rate (CTR) prediction is a significant technique in the field of computational advertising, its accuracy directly affects companies profits and user experience. Achieving great ability of general...

**aochen Wang, Gang Hu, Haoyang Lin, Jiayu Sun in Web and Big Data (2019)

Chapter and Conference Paper

Spectral Tilt Estimation for Speech Intelligibility Enhancement Using RNN Based on All-Pole Model

Speech intelligibility enhancement is extremely meaningful for successful speech communication in noisy environments. Several methods based on Lombard effect are used to increase intelligibility. In those meth...

Rui Zhang, Ruimin Hu, Gang Li, **aochen Wang in MultiMedia Modeling (2019)

Chapter and Conference Paper

The Analysis for Binaural Signal’s Characteristics of a Real Source and Corresponding Virtual Sound Image

3D Audio System could rebuild more realistic and immersive sound effects. The existing 3D audio reconstruction methods mainly consider the physical characteristics of sound filed, less take head’s effect on so...

**shan Wang, **aochen Wang, Wei** Tu… in Advances in Multimedia Information Process… (2018)

Chapter and Conference Paper

Speech Intelligibility Enhancement in Strong Mechanical Noise Based on Neural Networks

Speech intelligibility is a significant factor for successful speech communication. To enhance the intelligibility, many methods have been proposed, mainly by operating the speech signal such as increasing the...

Feng Cheng, **aochen Wang, Li Gang… in Advances in Multimedia Information Process… (2018)

Chapter and Conference Paper

A Splicing Interpolation Method for Head-Related Transfer Function

We proposed a new head-related transfer function (HRTF) interpolation method based on splicing. The sound spreads from sound source to listener’s ears, the wave of head-related impulse response (HRIR) clearly ...

Chunling Ai, **aochen Wang, Yafei Wu… in Advances in Multimedia Information Process… (2018)

Chapter and Conference Paper

An Efficient Method Using the Parameterized HRTFs for 3D Audio Real-Time Rendering on Mobile Devices

3D audio real-time rendering is of great importance for virtual reality (VR) application, especially on mobile devices. However, the limited computational power makes it hard to implement fast generation of sp...

Yucheng Song, Wei** Tu, Ruimin Hu… in Advances in Multimedia Information Process… (2018)

Chapter and Conference Paper

Head Related Transfer Function Interpolation Based on Aligning Operation

Head related transfer function (HRTF) is the main technique of binaural synthesis, which is used to reconstruct spatial sound image, and the HRTF data only can be obtained by measurement. A high resolution HRT...

Tingzhao Wu, Ruimin Hu, **aochen Wang… in Advances in Multimedia Information Process… (2016)

30 Result(s)

Research on water level measurement technology based on the residual length ratio of image characters

OV-DAR: Open-Vocabulary Object Detection and Attributes Recognition

OV-VIS: Open-Vocabulary Video Instance Segmentation

Map of sand and gravel aggregate level height and volume measurement based on contour map generation

Optimization analysis of football match prediction model based on neural network

Stacked Sparse Autoencoder for Audio Object Coding

EMRM: Enhanced Multi-source Review-Based Model for Rating Prediction

Synthesizing Large-Scale Datasets for License Plate Detection and Recognition in the Wild

Multi-step Coding Structure of Spatial Audio Object Coding

Perceptual Localization of Virtual Sound Source Based on Loudspeaker Triplet

HMM-Based Person Re-identification in Large-Scale Open Scenario

HRTF Representation with Convolutional Auto-encoder

Few-Shot Semantic Segmentation with Democratic Attention Networks

A Novel Ensemble Approach for Click-Through Rate Prediction Based on Factorization Machines and Gradient Boosting Decision Trees

Spectral Tilt Estimation for Speech Intelligibility Enhancement Using RNN Based on All-Pole Model

The Analysis for Binaural Signal’s Characteristics of a Real Source and Corresponding Virtual Sound Image

Speech Intelligibility Enhancement in Strong Mechanical Noise Based on Neural Networks

A Splicing Interpolation Method for Head-Related Transfer Function

An Efficient Method Using the Parameterized HRTFs for 3D Audio Real-Time Rendering on Mobile Devices

Head Related Transfer Function Interpolation Based on Aligning Operation

Our Content

Other Sites

Help & Contacts