Skip to main content

previous disabled Page of 3
and
  1. No Access

    Article

    Keyframe-based RGB-D dense visual SLAM fused semantic cues in dynamic scenes

    The robustness of dense visual SLAM is still a challenging problem in dynamic environments. In this paper, we propose a novel keyframe-based dense visual SLAM to handle a highly dynamic environment by using an...

    Wugen Zhou, **aodong Peng, Yun Li, Mingrui Fan, Bo Liu in Machine Vision and Applications (2024)

  2. No Access

    Article

    CLIP-Adapter: Better Vision-Language Models with Feature Adapters

    Large-scale contrastive vision-language pretraining has shown significant progress in visual representation learning. Unlike traditional visual systems trained by a fixed set of discrete labels, a new paradigm...

    Peng Gao, Shijie Geng, Renrui Zhang, Teli Ma in International Journal of Computer Vision (2024)

  3. No Access

    Chapter and Conference Paper

    SSK-Yolo: Global Feature-Driven Small Object Detection Network for Images

    Timely and effective locust detection to prevent locust plagues is crucial for safeguarding agricultural production and ecological balance. However, under natural conditions, the “colour mixing mechanism” of l...

    Bei Liu, Jian Zhang, Tianwen Yuan, Peng Huang, Chengwei Feng in MultiMedia Modeling (2024)

  4. No Access

    Chapter and Conference Paper

    Mitigating Fine-Grained Hallucination by Fine-Tuning Large Vision-Language Models with Caption Rewrites

    Large language models (LLMs) have shown remarkable performance in natural language processing (NLP) tasks. To comprehend and execute diverse human instructions over image data, instruction-tuned large vision-l...

    Lei Wang, Jiabang He, Shenshen Li, Ning Liu, Ee-Peng Lim in MultiMedia Modeling (2024)

  5. No Access

    Chapter and Conference Paper

    Deep Tiny Network for Recognition-Oriented Face Image Quality Assessment

    Face recognition has made significant progress in recent years due to deep convolutional neural networks (CNN). In many face recognition (FR) scenarios, face images are acquired from a sequence with huge intr...

    Baoyun Peng, Min Liu, Zhaoning Zhang, Kai Xu, Dongsheng Li in Computational Visual Media (2024)

  6. No Access

    Chapter and Conference Paper

    Self-supervised Edge Structure Learning for Multi-view Stereo and Parallel Optimization

    Recent studies have witnessed that many self-supervised methods obtain clear progress on the multi-view stereo (MVS). However, existing methods ignore the edge structure information of the reconstructed target...

    Pan Li, Su** Wu, **tie Zhang, Yuxin Peng, Boyang Zhang, Bin Wang in MultiMedia Modeling (2024)

  7. No Access

    Article

    Crowded pose-guided multi-task learning for instance-level human parsing

    Instance-level human parsing remains challenging due to the similarity between human instances and background, complex interactions, and various poses. Aiming at assigning each human-related pixel a semantic l...

    Yong Wei, Li Liu, **aodong Fu, LiJun Liu, Wei Peng in Machine Vision and Applications (2023)

  8. No Access

    Chapter and Conference Paper

    The Tenth Visual Object Tracking VOT2022 Challenge Results

    The Visual Object Tracking challenge VOT2022 is the tenth annual tracker benchmarking activity organized by the VOT initiative. Results of 93 entries are presented; many are state-of-the-art trackers published...

    Matej Kristan, Aleš Leonardis, Jiří Matas in Computer Vision – ECCV 2022 Workshops (2023)

  9. No Access

    Chapter and Conference Paper

    MIPI 2022 Challenge on RGB+ToF Depth Completion: Dataset and Report

    Develo** and integrating advanced image sensors with novel algorithms in camera systems is prevalent with the increasing demand for computational photography and imaging on mobile platforms. However, the lac...

    Wenxiu Sun, Qingpeng Zhu, Chongyi Li in Computer Vision – ECCV 2022 Workshops (2023)

  10. No Access

    Chapter and Conference Paper

    Frame Correlation Knowledge Distillation for Gait Recognition in the Wild

    Recently, large deep models have achieved significant progress on gait recognition in the wild. However, such models come with a high cost of runtime and computational resource consumption. In this paper, we i...

    Guozhen Peng, Shaoxiong Zhang, Yuwei Zhao, Annan Li, Yunhong Wang in Biometric Recognition (2023)

  11. No Access

    Chapter and Conference Paper

    Channel Spatial Collaborative Attention Network for Fine-Grained Classification of Cervical Cells

    Accurately classifying cervical cells based on the commonly used TBS (The Bethesda System) standard is critical for building the automatic cytology diagnosing system. However, the existing two publicly availab...

    Peng Jiang, Juan Liu, Hua Chen, Cheng Li, Baochuan Pang in Neural Information Processing (2023)

  12. No Access

    Article

    Visible-infrared person re-identification model based on feature consistency and modal indistinguishability

    Visible-infrared person re-identification (VI-ReID) is used to search person images across cameras under different modalities, which can address the limitation of visible-based ReID in dark environments. Intra...

    Jia Sun, Yanfeng Li, Hou** Chen, Yahui Peng, **lei Zhu in Machine Vision and Applications (2022)

  13. No Access

    Article

    Efficient Joint-Dimensional Search with Solution Space Regularization for Real-Time Semantic Segmentation

    Semantic segmentation is a popular research topic in computer vision, and many efforts have been made on it with impressive results. In this paper, we intend to search an optimal network structure that can run...

    Peng Ye, Baopu Li, Tao Chen, Jiayuan Fan in International Journal of Computer Vision (2022)

  14. No Access

    Chapter and Conference Paper

    Unstructured Feature Decoupling for Vehicle Re-identification

     misalignment of features caused by pose and viewpoint variances is a crucial problem in Vehicle Re-Identification (ReID). Previous methods align the features by structuring the vehicles from pre-defined vehic...

    Wen Qian, Hao Luo, Silong Peng, Fan Wang, Chen Chen, Hao Li in Computer Vision – ECCV 2022 (2022)

  15. No Access

    Article

    Embedded real-time infrared and visible image fusion for UAV surveillance

    Infrared and visible image fusion is a beneficial processing task for Unmanned Aerial Vehicle (UAV) surveillance, which can improve visibility by combining the advantages of the infrared camera and the visible...

    Jun Li, Yuanxi Peng, Tian Jiang in Journal of Real-Time Image Processing (2021)

  16. No Access

    Article

    A dedicated hardware accelerator for real-time acceleration of YOLOv2

    In recent years, dedicated hardware accelerators for the acceleration of the convolutional neural network (CNN) have been extensively studied. Although many studies have presented efficient designs on FPGAs fo...

    Ke Xu, **aoyun Wang, **nyang Liu, Changfeng Cao in Journal of Real-Time Image Processing (2021)

  17. No Access

    Article

    Real-Time Semantic Segmentation via Auto Depth, Downsampling Joint Decision and Feature Aggregation

    To satisfy the stringent requirements for computational resources in the field of real-time semantic segmentation, most approaches focus on the hand-crafted design of light-weight segmentation networks. To enj...

    Peng Sun, Jiaxiang Wu, Songyuan Li, Peiwen Lin in International Journal of Computer Vision (2021)

  18. No Access

    Chapter and Conference Paper

    FARGO: A Joint Framework for FAZ and RV Segmentation from OCTA Images

    Optical coherence tomography angiography (OCTA) is a recent advance in ophthalmic imaging, which provides detailed visualization of two important anatomical landmarks, namely foveal avascular zone (FAZ) and re...

    Linkai Peng, Li Lin, Pu** Cheng, Zhonghua Wang in Ophthalmic Medical Image Analysis (2021)

  19. No Access

    Chapter and Conference Paper

    MTNAS: Search Multi-task Networks for Autonomous Driving

    Multi-task learning (MTL) aims to learn shared representations from multiple tasks simultaneously, which has yielded outstanding performance in widespread applications of computer vision. However, existing mul...

    Hao Liu, Dong Li, **Zhang Peng, Qingjie Zhao, Lu Tian in Computer Vision – ACCV 2020 (2021)

  20. No Access

    Chapter and Conference Paper

    The Eighth Visual Object Tracking VOT2020 Challenge Results

    The Visual Object Tracking challenge VOT2020 is the eighth annual tracker benchmarking activity organized by the VOT initiative. Results of 58 trackers are presented; many are state-of-the-art trackers publish...

    Matej Kristan, Aleš Leonardis, Jiří Matas in Computer Vision – ECCV 2020 Workshops (2020)

previous disabled Page of 3