Skip to main content

previous disabled Page of 6
and
  1. No Access

    Article

    Mind-bridge: reconstructing visual images based on diffusion model from human brain activity

    Human brain vision is mysterious and complex, and it interprets the world through the connection between the brain and the eyes. In recent years, several methods have relied on fMRI to successfully reconstruct...

    Qing Liu, Hongqing Zhu, Ning Chen, Bingcang Huang in Signal, Image and Video Processing (2024)

  2. No Access

    Article

    DiffCAS: diffusion based multi-attention network for segmentation of 3D coronary artery from CT angiography

    Automatic segmentation of 3D coronary arteries from computed tomography angiography (CTA) is an indispensable part of accurate and efficient coronary artery disease (CAD) diagnosis. However, it remains challen...

    Jiajia Li, Qing Wu, Yuanquan Wang, Shoujun Zhou in Signal, Image and Video Processing (2024)

  3. No Access

    Article

    Spatially-Varying Illumination-Aware Indoor Harmonization

    In this paper, we address the problem of spatially-varying illumination-aware indoor harmonization. Existing image harmonization works either focus on extracting no more than 2D information (e.g., low-level st...

    Zhongyun Hu, Jiahao Li, Xue Wang, Qing Wang in International Journal of Computer Vision (2024)

  4. No Access

    Article

    SCA-YOLOv4: you only look once with squeeze-and-excitation, coordinate attention and adaptively spatial feature fusion

    How to effectively and efficiently identify multi-scale objects is one of the key challenges in object detection. In order to make the classification and regression of single-stage object detector more accurat...

    Pengfei Liu, Qing Wang in Signal, Image and Video Processing (2024)

  5. No Access

    Article

    ZMNet: feature fusion and semantic boundary supervision for real-time semantic segmentation

    Feature fusion module is an essential component of real-time semantic segmentation networks to bridge the semantic gap among different feature layers. However, many networks are inefficient in multi-level feat...

    Ya Li, Ziming Li, Huiwang Liu, Qing Wang in The Visual Computer (2024)

  6. No Access

    Article

    Deep learning based insulator fault detection algorithm for power transmission lines

    Aiming at the complex background of transmission lines at the present stage, which leads to the problem of low accuracy of insulator fault detection for small targets, a deep learning-based insulator fault det...

    Han Wang, Qing Yang, Binlin Zhang, Dexin Gao in Journal of Real-Time Image Processing (2024)

  7. No Access

    Article

    ViDSOD-100: A New Dataset and a Baseline Model for RGB-D Video Salient Object Detection

    With the rapid development of depth sensor, more and more RGB-D videos could be obtained. Identifying the foreground in RGB-D videos is a fundamental and important task. However, the existing salient object de...

    Junhao Lin, Lei Zhu, Jiaxing Shen, Huazhu Fu in International Journal of Computer Vision (2024)

  8. Article

    Open Access

    Joint training with local soft attention and dual cross-neighbor label smoothing for unsupervised person re-identification

    Existing unsupervised person re-identification approaches fail to fully capture the fine-grained features of local regions, which can result in people with similar appearances and different identities being as...

    Qing Han, Longfei Li, Weidong Min, Qi Wang, Qingpeng Zeng in Computational Visual Media (2024)

  9. No Access

    Article

    Harmonizing local and global features: enhanced hand gesture segmentation using synergistic fusion of CNN and transformer networks

    Hand gesture segmentation is an important research topic in computer vision. Despite ongoing efforts, achieving optimal gesture segmentation remains challenging, attributed to factors like gesture morphology a...

    Shi Wang, Ning Yang, Maohua Liu, Qing Tian in Signal, Image and Video Processing (2024)

  10. No Access

    Article

    Offline handwritten mathematical expression recognition based on YOLOv5s

    The error accumulation in traditional offline handwritten mathematical expression recognition (OHMER) becomes challenging, because of the two-dimensional structure and writing arbitrariness of offline handwrit...

    Fei Li, Hongbo Fang, Dengzhun Wang, Ruixin Liu, Qing Hou in The Visual Computer (2024)

  11. No Access

    Article

    Machine reading comprehension model based on query reconstruction technology and deep learning

    Machine reading comprehension is introduced to improve machines’ readability and understandability of human languages. This sophisticated version of natural language processing is used for testing and improvin...

    Pengming Wang, M. M. Kamruzzaman, Qing Chen in Neural Computing and Applications (2024)

  12. No Access

    Chapter and Conference Paper

    Joint Training or Not: An Exploration of Pre-trained Speech Models in Audio-Visual Speaker Diarization

    The scarcity of labeled audio-visual datasets is a constraint for training superior audio-visual speaker diarization systems. To improve the performance of audio-visual speaker diarization, we leverage pre-tra...

    Huan Zhao, Li Zhang, Yue Li, Yannan Wang, Hongji Wang in Man-Machine Speech Communication (2024)

  13. No Access

    Article

    Towards High-Resolution Specular Highlight Detection

    Specular highlight detection is an essential task with various applications in computer vision. This paper aims to detect specular highlights in single high-resolution images using deep learning while avoiding...

    Gang Fu, Qing Zhang, Lei Zhu, Qifeng Lin in International Journal of Computer Vision (2024)

  14. No Access

    Article

    ACKSNet: adaptive center keypoint selection for object detection

    Keypoint-based detectors generate a large number of false positives due to incorrect keypoint matching in the object detection task. In this paper, we propose an adaptive center keypoint selection method (ACKS...

    **ngzhu Liang, Lixin Wang, Wei Cheng, **nyun Yan, Qing Chen in The Visual Computer (2023)

  15. No Access

    Article

    Cluster-based two-branch framework for point cloud attribute compression

    Owing to the irregular distribution of point clouds in 3D space, effectively compressing the point cloud is still challenging. Recently, numerous compression methods have been developed with outstanding perfor...

    Longhua Sun, ** Wang, Qing Zhu, Jiaying Liu, Jiawen Yu in The Visual Computer (2023)

  16. No Access

    Article

    ZRDNet: zero-reference image defogging by physics-based decomposition–reconstruction mechanism and perception fusion

    This paper investigates challenging fully unsupervised defogging problems, i.e., how to remove fog by feeding only foggy images in deep neural networks rather than using paired or unpaired synthetic images, an...

    Zi-**n Li, Yu-Long Wang, Qing-Long Han, Chen Peng in The Visual Computer (2023)

  17. No Access

    Article

    Trade-off background joint learning for unsupervised vehicle re-identification

    Existing vehicle re-identification (Re-ID) methods either extract valuable background information to enhance the robustness of the vehicle model or segment background interference information to learn vehicle ...

    Sheng Wang, Qi Wang, Weidong Min, Qing Han, Di Gai, Haowen Luo in The Visual Computer (2023)

  18. Article

    Open Access

    Translating radiology reports into plain language using ChatGPT and GPT-4 with prompt learning: results, limitations, and potential

    The large language model called ChatGPT has drawn extensively attention because of its human-like expression and reasoning abilities. In this study, we investigate the feasibility of using ChatGPT in experimen...

    Qing Lyu, Josh Tan, Michael E. Zapadka in Visual Computing for Industry, Biomedicine… (2023)

  19. No Access

    Article

    VGT-MOT: visibility-guided tracking for online multiple-object tracking

    Multi-object tracking (MOT) is an important task of computer vision which has a wide range of applications. Existing multi-object tracking methods mostly employ the Kalman filter to predict the object location...

    Shuai Wang, Wei-** Li, Lu Wang, Li-Sheng Xu in Machine Vision and Applications (2023)

  20. No Access

    Chapter and Conference Paper

    CrowdFusion: Refined Cross-Modal Fusion Network for RGB-T Crowd Counting

    Crowd counting is a crucial task in computer vision, offering numerous applications in smart security, remote sensing, agriculture and forestry. While pure image-based models have made significant advancements...

    Jialu Cai, Qing Wang, Shengqin Jiang in Biometric Recognition (2023)

previous disabled Page of 6