Skip to main content

previous disabled Page of 5
and
  1. No Access

    Article

    Spatially-Varying Illumination-Aware Indoor Harmonization

    In this paper, we address the problem of spatially-varying illumination-aware indoor harmonization. Existing image harmonization works either focus on extracting no more than 2D information (e.g., low-level st...

    Zhongyun Hu, Jiahao Li, Xue Wang, Qing Wang in International Journal of Computer Vision (2024)

  2. No Access

    Article

    ZMNet: feature fusion and semantic boundary supervision for real-time semantic segmentation

    Feature fusion module is an essential component of real-time semantic segmentation networks to bridge the semantic gap among different feature layers. However, many networks are inefficient in multi-level feat...

    Ya Li, Ziming Li, Huiwang Liu, Qing Wang in The Visual Computer (2024)

  3. No Access

    Article

    ViDSOD-100: A New Dataset and a Baseline Model for RGB-D Video Salient Object Detection

    With the rapid development of depth sensor, more and more RGB-D videos could be obtained. Identifying the foreground in RGB-D videos is a fundamental and important task. However, the existing salient object de...

    Junhao Lin, Lei Zhu, Jiaxing Shen, Huazhu Fu in International Journal of Computer Vision (2024)

  4. Article

    Open Access

    Joint training with local soft attention and dual cross-neighbor label smoothing for unsupervised person re-identification

    Existing unsupervised person re-identification approaches fail to fully capture the fine-grained features of local regions, which can result in people with similar appearances and different identities being as...

    Qing Han, Longfei Li, Weidong Min, Qi Wang, Qingpeng Zeng in Computational Visual Media (2024)

  5. No Access

    Article

    Offline handwritten mathematical expression recognition based on YOLOv5s

    The error accumulation in traditional offline handwritten mathematical expression recognition (OHMER) becomes challenging, because of the two-dimensional structure and writing arbitrariness of offline handwrit...

    Fei Li, Hongbo Fang, Dengzhun Wang, Ruixin Liu, Qing Hou in The Visual Computer (2024)

  6. No Access

    Article

    Machine reading comprehension model based on query reconstruction technology and deep learning

    Machine reading comprehension is introduced to improve machines’ readability and understandability of human languages. This sophisticated version of natural language processing is used for testing and improvin...

    Pengming Wang, M. M. Kamruzzaman, Qing Chen in Neural Computing and Applications (2024)

  7. No Access

    Chapter and Conference Paper

    Joint Training or Not: An Exploration of Pre-trained Speech Models in Audio-Visual Speaker Diarization

    The scarcity of labeled audio-visual datasets is a constraint for training superior audio-visual speaker diarization systems. To improve the performance of audio-visual speaker diarization, we leverage pre-tra...

    Huan Zhao, Li Zhang, Yue Li, Yannan Wang, Hongji Wang in Man-Machine Speech Communication (2024)

  8. No Access

    Article

    Towards High-Resolution Specular Highlight Detection

    Specular highlight detection is an essential task with various applications in computer vision. This paper aims to detect specular highlights in single high-resolution images using deep learning while avoiding...

    Gang Fu, Qing Zhang, Lei Zhu, Qifeng Lin in International Journal of Computer Vision (2024)

  9. No Access

    Article

    ACKSNet: adaptive center keypoint selection for object detection

    Keypoint-based detectors generate a large number of false positives due to incorrect keypoint matching in the object detection task. In this paper, we propose an adaptive center keypoint selection method (ACKS...

    **ngzhu Liang, Lixin Wang, Wei Cheng, **nyun Yan, Qing Chen in The Visual Computer (2023)

  10. No Access

    Article

    Cluster-based two-branch framework for point cloud attribute compression

    Owing to the irregular distribution of point clouds in 3D space, effectively compressing the point cloud is still challenging. Recently, numerous compression methods have been developed with outstanding perfor...

    Longhua Sun, ** Wang, Qing Zhu, Jiaying Liu, Jiawen Yu in The Visual Computer (2023)

  11. No Access

    Article

    ZRDNet: zero-reference image defogging by physics-based decomposition–reconstruction mechanism and perception fusion

    This paper investigates challenging fully unsupervised defogging problems, i.e., how to remove fog by feeding only foggy images in deep neural networks rather than using paired or unpaired synthetic images, an...

    Zi-**n Li, Yu-Long Wang, Qing-Long Han, Chen Peng in The Visual Computer (2023)

  12. No Access

    Article

    Trade-off background joint learning for unsupervised vehicle re-identification

    Existing vehicle re-identification (Re-ID) methods either extract valuable background information to enhance the robustness of the vehicle model or segment background interference information to learn vehicle ...

    Sheng Wang, Qi Wang, Weidong Min, Qing Han, Di Gai, Haowen Luo in The Visual Computer (2023)

  13. No Access

    Chapter and Conference Paper

    CrowdFusion: Refined Cross-Modal Fusion Network for RGB-T Crowd Counting

    Crowd counting is a crucial task in computer vision, offering numerous applications in smart security, remote sensing, agriculture and forestry. While pure image-based models have made significant advancements...

    Jialu Cai, Qing Wang, Shengqin Jiang in Biometric Recognition (2023)

  14. No Access

    Chapter and Conference Paper

    MatchFormer: Interleaving Attention in Transformers for Feature Matching

    Local feature matching is a computationally intensive task at the subpixel level. While detector-based methods coupled with feature descriptors struggle in low-texture scenes, CNN-based methods with a sequential

    Qing Wang, Jiaming Zhang, Kailun Yang, Kunyu Peng in Computer Vision – ACCV 2022 (2023)

  15. Article

    Countering Malicious DeepFakes: Survey, Battleground, and Horizon

    The creation or manipulation of facial appearance through deep generative approaches, known as DeepFake, have achieved significant progress and promoted a wide range of benign and malicious applications, e.g., vi...

    Felix Juefei-Xu, Run Wang, Yihao Huang in International Journal of Computer Vision (2022)

  16. No Access

    Article

    HybNet: a hybrid network structure for pain intensity estimation

    Automatic pain intensity estimation has great potential in current rehabilitation medicine, and patients’ health status information can be obtained through the analysis of facial images. At present, deep convo...

    Yibo Huang, Linbo Qing, Shengyu Xu, Lu Wang, Yonghong Peng in The Visual Computer (2022)

  17. No Access

    Chapter and Conference Paper

    Prognostic Staging System for Esophageal Cancer Using Lasso, Cox and CS-SVM

    Esophageal cancer is a heterogeneous malignant tumor with high mortality. Design constructing an effective prognostic staging system would help to improve the prognosis of patients. In this paper, blood indexe...

    Qing Liu, Wenhao Zhang, Junwei Sun in Bio-Inspired Computing: Theories and Appli… (2022)

  18. No Access

    Chapter and Conference Paper

    An Adaptive Weight Joint Loss Optimization for Dog Face Recognition

    In recent years, the field of human face recognition has developed rapidly, and a large number of deep learning methods have proven their efficiency in human face recognition. However, these methods do not wor...

    Qiwang Wang, Jiwei Song, Le Chang, Qing Tian, Zhaofeng He in Biometric Recognition (2022)

  19. No Access

    Chapter and Conference Paper

    Lightweight Image Compression Based on Deep Learning

    Deep learning based image compression (DLIC) algorithms have achieved higher compression gain than conventional algorithms. However, the large parameters and float-point operations (FLOPs) of DLIC severely lim...

    Mengyao Li, Zhengyong Wang, Liquan Shen, Qing Ding, Liangwei Yu in Artificial Intelligence (2022)

  20. No Access

    Chapter and Conference Paper

    End-to-End Large-Scale Image Retrieval Network with Convolution and Vision Transformers

    There has been significant progress in content-based image retrieval with the development of convolutional neural networks and visual transformers. However, there are semantic gaps between high-level semantic ...

    Qing Zhang, Feilong Bao, **angdong Su in Artificial Neural Networks and Machine Lea… (2022)

previous disabled Page of 5