Skip to main content

previous disabled Page of 2
and
  1. No Access

    Article

    Research on water level measurement technology based on the residual length ratio of image characters

    Aiming at the low efficiency and poor adaptability of traditional water level measurement methods, a water level measurement technology based on the residual length ratio of image characters is proposed in thi...

    Mingtang Liu, Changchun Wang, Wei Huang in Signal, Image and Video Processing (2024)

  2. No Access

    Article

    OV-DAR: Open-Vocabulary Object Detection and Attributes Recognition

    In this paper, we endeavor to localize all potential objects in an image and infer their visual categories, attributes, and shapes, even in instances where certain objects have not been encompassed in the mode...

    Keyan Chen, **aolong Jiang, Haochen Wang in International Journal of Computer Vision (2024)

  3. No Access

    Article

    OV-VIS: Open-Vocabulary Video Instance Segmentation

    Conventionally, the goal of Video Instance Segmentation (VIS) is to segment and categorize objects in videos from a closed set of training categories, lacking the generalization ability to handle novel categor...

    Haochen Wang, Cilin Yan, Keyan Chen in International Journal of Computer Vision (2024)

  4. No Access

    Article

    Map** of sand and gravel aggregate level height and volume measurement based on contour map** generation

    In order to prevent the abnormal appearance of sand and gravel aggregate level in the concrete mixing plant, and improve the safety of the concrete mixing plant system as well as the efficient and high-quality...

    Yingjie Liu, Shuang Yue, **aochen Wang, **hao Zhang in Signal, Image and Video Processing (2024)

  5. No Access

    Article

    Optimization analysis of football match prediction model based on neural network

    How to build a football match prediction model and use scientific methods to solve the prediction problem has become a key point in the application of artificial intelligence in the sports industry. In this pa...

    Shuo Guan, **aochen Wang in Neural Computing and Applications (2022)

  6. No Access

    Chapter and Conference Paper

    Stacked Sparse Autoencoder for Audio Object Coding

    Compared with channel-based audio coding, the object-based audio coding has a definite advantage in meeting the user’s demands of personalized control. However, in the conventional Spatial Audio Object Coding ...

    Yulin Wu, Ruimin Hu, **aochen Wang, Chenhao Hu, Gang Li in MultiMedia Modeling (2021)

  7. No Access

    Chapter and Conference Paper

    EMRM: Enhanced Multi-source Review-Based Model for Rating Prediction

    Rating prediction, whose goal is to predict user preference for unconsumed items, has become one of the core tasks in recommendation systems. Recently, many deep learning-based methods have been applied to the...

    **aochen Wang, Tingsong **ao, Jie Shao in Knowledge Science, Engineering and Management (2021)

  8. No Access

    Chapter and Conference Paper

    Synthesizing Large-Scale Datasets for License Plate Detection and Recognition in the Wild

    License Plate Detection and Recognition (LPDR) plays a key role in modern intelligent transportation systems. Recent state-of-the-art methods of LPDR are based on deep convolutional neural networks (DCNN), whi...

    Chaochen Wang, Wenzhong Wang, Chenglong Li in Pattern Recognition and Computer Vision (2020)

  9. No Access

    Chapter and Conference Paper

    Multi-step Coding Structure of Spatial Audio Object Coding

    The spatial audio object coding (SAOC) is an effective meth-od which compresses multiple audio objects and provides flexibility for personalized rendering in interactive services. It divides each frame signal ...

    Chenhao Hu, Ruimin Hu, **aochen Wang, Tingzhao Wu, Dengshi Li in MultiMedia Modeling (2020)

  10. No Access

    Chapter and Conference Paper

    Perceptual Localization of Virtual Sound Source Based on Loudspeaker Triplet

    When using a loudspeaker triplet for virtual sound localization, the traditional conversion method will result in inaccurate localization. In this paper, we constructed a perceptual localization distortion mod...

    Duanzheng Guan, Dengshi Li, Xuebei Cai, **aochen Wang, Ruimin Hu in MultiMedia Modeling (2020)

  11. No Access

    Chapter and Conference Paper

    HMM-Based Person Re-identification in Large-Scale Open Scenario

    This paper aims to tackle person re-identification (person re-ID) in large-scale open scenario, which differs from the conventional person re-ID tasks but is significant for some real suspect investigation ca...

    Dongyang Li, Ruimin Hu, Wenxin Huang, **aochen Wang, Dengshi Li in MultiMedia Modeling (2020)

  12. No Access

    Chapter and Conference Paper

    HRTF Representation with Convolutional Auto-encoder

    The head-related transfer function (HRTF) can be considered as some kind of filter that describes how a sound from an arbitrary spatial direction transfers to the listener’s eardrums. HRTF can be used to synth...

    Wei Chen, Ruimin Hu, **aochen Wang, Dengshi Li in MultiMedia Modeling (2020)

  13. No Access

    Chapter and Conference Paper

    Few-Shot Semantic Segmentation with Democratic Attention Networks

    Few-shot segmentation has recently generated great popularity, addressing the challenging yet important problem of segmenting objects from unseen categories with scarce annotated support images. The crux of fe...

    Haochen Wang, Xudong Zhang, Yutao Hu, Yandan Yang in Computer Vision – ECCV 2020 (2020)

  14. No Access

    Chapter and Conference Paper

    A Novel Ensemble Approach for Click-Through Rate Prediction Based on Factorization Machines and Gradient Boosting Decision Trees

    Click-Through Rate (CTR) prediction is a significant technique in the field of computational advertising, its accuracy directly affects companies profits and user experience. Achieving great ability of general...

    **aochen Wang, Gang Hu, Haoyang Lin, Jiayu Sun in Web and Big Data (2019)

  15. No Access

    Chapter and Conference Paper

    Spectral Tilt Estimation for Speech Intelligibility Enhancement Using RNN Based on All-Pole Model

    Speech intelligibility enhancement is extremely meaningful for successful speech communication in noisy environments. Several methods based on Lombard effect are used to increase intelligibility. In those meth...

    Rui Zhang, Ruimin Hu, Gang Li, **aochen Wang in MultiMedia Modeling (2019)

  16. No Access

    Chapter and Conference Paper

    The Analysis for Binaural Signal’s Characteristics of a Real Source and Corresponding Virtual Sound Image

    3D Audio System could rebuild more realistic and immersive sound effects. The existing 3D audio reconstruction methods mainly consider the physical characteristics of sound filed, less take head’s effect on so...

    **shan Wang, **aochen Wang, Wei** Tu in Advances in Multimedia Information Process… (2018)

  17. No Access

    Chapter and Conference Paper

    Speech Intelligibility Enhancement in Strong Mechanical Noise Based on Neural Networks

    Speech intelligibility is a significant factor for successful speech communication. To enhance the intelligibility, many methods have been proposed, mainly by operating the speech signal such as increasing the...

    Feng Cheng, **aochen Wang, Li Gang in Advances in Multimedia Information Process… (2018)

  18. No Access

    Chapter and Conference Paper

    A Splicing Interpolation Method for Head-Related Transfer Function

    We proposed a new head-related transfer function (HRTF) interpolation method based on splicing. The sound spreads from sound source to listener’s ears, the wave of head-related impulse response (HRIR) clearly ...

    Chunling Ai, **aochen Wang, Yafei Wu in Advances in Multimedia Information Process… (2018)

  19. No Access

    Chapter and Conference Paper

    An Efficient Method Using the Parameterized HRTFs for 3D Audio Real-Time Rendering on Mobile Devices

    3D audio real-time rendering is of great importance for virtual reality (VR) application, especially on mobile devices. However, the limited computational power makes it hard to implement fast generation of sp...

    Yucheng Song, Wei** Tu, Ruimin Hu in Advances in Multimedia Information Process… (2018)

  20. No Access

    Chapter and Conference Paper

    Head Related Transfer Function Interpolation Based on Aligning Operation

    Head related transfer function (HRTF) is the main technique of binaural synthesis, which is used to reconstruct spatial sound image, and the HRTF data only can be obtained by measurement. A high resolution HRT...

    Tingzhao Wu, Ruimin Hu, **aochen Wang in Advances in Multimedia Information Process… (2016)

previous disabled Page of 2