Search Results - Springer

Article

Keyframe-based RGB-D dense visual SLAM fused semantic cues in dynamic scenes

The robustness of dense visual SLAM is still a challenging problem in dynamic environments. In this paper, we propose a novel keyframe-based dense visual SLAM to handle a highly dynamic environment by using an...

Wugen Zhou, **aodong Peng, Yun Li, Mingrui Fan, Bo Liu in Machine Vision and Applications (2024)

Article

CLIP-Adapter: Better Vision-Language Models with Feature Adapters

Large-scale contrastive vision-language pretraining has shown significant progress in visual representation learning. Unlike traditional visual systems trained by a fixed set of discrete labels, a new paradigm...

Peng Gao, Shijie Geng, Renrui Zhang, Teli Ma… in International Journal of Computer Vision (2024)

Chapter and Conference Paper

SSK-Yolo: Global Feature-Driven Small Object Detection Network for Images

Timely and effective locust detection to prevent locust plagues is crucial for safeguarding agricultural production and ecological balance. However, under natural conditions, the “colour mixing mechanism” of l...

Bei Liu, Jian Zhang, Tianwen Yuan, Peng Huang, Chengwei Feng… in MultiMedia Modeling (2024)

Chapter and Conference Paper

Mitigating Fine-Grained Hallucination by Fine-Tuning Large Vision-Language Models with Caption Rewrites

Large language models (LLMs) have shown remarkable performance in natural language processing (NLP) tasks. To comprehend and execute diverse human instructions over image data, instruction-tuned large vision-l...

Lei Wang, Jiabang He, Shenshen Li, Ning Liu, Ee-Peng Lim in MultiMedia Modeling (2024)

Chapter and Conference Paper

Deep Tiny Network for Recognition-Oriented Face Image Quality Assessment

Face recognition has made significant progress in recent years due to deep convolutional neural networks (CNN). In many face recognition (FR) scenarios, face images are acquired from a sequence with huge intr...

Baoyun Peng, Min Liu, Zhaoning Zhang, Kai Xu, Dongsheng Li in Computational Visual Media (2024)

Chapter and Conference Paper

Self-supervised Edge Structure Learning for Multi-view Stereo and Parallel Optimization

Recent studies have witnessed that many self-supervised methods obtain clear progress on the multi-view stereo (MVS). However, existing methods ignore the edge structure information of the reconstructed target...

Pan Li, Su** Wu, **tie Zhang, Yuxin Peng, Boyang Zhang, Bin Wang in MultiMedia Modeling (2024)

Article

Crowded pose-guided multi-task learning for instance-level human parsing

Instance-level human parsing remains challenging due to the similarity between human instances and background, complex interactions, and various poses. Aiming at assigning each human-related pixel a semantic l...

Yong Wei, Li Liu, **aodong Fu, LiJun Liu, Wei Peng in Machine Vision and Applications (2023)

Chapter and Conference Paper

The Tenth Visual Object Tracking VOT2022 Challenge Results

The Visual Object Tracking challenge VOT2022 is the tenth annual tracker benchmarking activity organized by the VOT initiative. Results of 93 entries are presented; many are state-of-the-art trackers published...

Matej Kristan, Aleš Leonardis, Jiří Matas… in Computer Vision – ECCV 2022 Workshops (2023)

Chapter and Conference Paper

MIPI 2022 Challenge on RGB+ToF Depth Completion: Dataset and Report

Develo** and integrating advanced image sensors with novel algorithms in camera systems is prevalent with the increasing demand for computational photography and imaging on mobile platforms. However, the lac...

Wenxiu Sun, Qingpeng Zhu, Chongyi Li… in Computer Vision – ECCV 2022 Workshops (2023)

Chapter and Conference Paper

Frame Correlation Knowledge Distillation for Gait Recognition in the Wild

Recently, large deep models have achieved significant progress on gait recognition in the wild. However, such models come with a high cost of runtime and computational resource consumption. In this paper, we i...

Guozhen Peng, Shaoxiong Zhang, Yuwei Zhao, Annan Li, Yunhong Wang in Biometric Recognition (2023)

Chapter and Conference Paper

Channel Spatial Collaborative Attention Network for Fine-Grained Classification of Cervical Cells

Accurately classifying cervical cells based on the commonly used TBS (The Bethesda System) standard is critical for building the automatic cytology diagnosing system. However, the existing two publicly availab...

Peng Jiang, Juan Liu, Hua Chen, Cheng Li, Baochuan Pang… in Neural Information Processing (2023)

Article

Visible-infrared person re-identification model based on feature consistency and modal indistinguishability

Visible-infrared person re-identification (VI-ReID) is used to search person images across cameras under different modalities, which can address the limitation of visible-based ReID in dark environments. Intra...

Jia Sun, Yanfeng Li, Hou** Chen, Yahui Peng, **lei Zhu in Machine Vision and Applications (2022)

Article

Efficient Joint-Dimensional Search with Solution Space Regularization for Real-Time Semantic Segmentation

Semantic segmentation is a popular research topic in computer vision, and many efforts have been made on it with impressive results. In this paper, we intend to search an optimal network structure that can run...

Peng Ye, Baopu Li, Tao Chen, Jiayuan Fan… in International Journal of Computer Vision (2022)

Chapter and Conference Paper

Unstructured Feature Decoupling for Vehicle Re-identification

misalignment of features caused by pose and viewpoint variances is a crucial problem in Vehicle Re-Identification (ReID). Previous methods align the features by structuring the vehicles from pre-defined vehic...

Wen Qian, Hao Luo, Silong Peng, Fan Wang, Chen Chen, Hao Li in Computer Vision – ECCV 2022 (2022)

Article

Embedded real-time infrared and visible image fusion for UAV surveillance

Infrared and visible image fusion is a beneficial processing task for Unmanned Aerial Vehicle (UAV) surveillance, which can improve visibility by combining the advantages of the infrared camera and the visible...

Jun Li, Yuanxi Peng, Tian Jiang in Journal of Real-Time Image Processing (2021)

Article

A dedicated hardware accelerator for real-time acceleration of YOLOv2

In recent years, dedicated hardware accelerators for the acceleration of the convolutional neural network (CNN) have been extensively studied. Although many studies have presented efficient designs on FPGAs fo...

Ke Xu, **aoyun Wang, **nyang Liu, Changfeng Cao… in Journal of Real-Time Image Processing (2021)

Article

Real-Time Semantic Segmentation via Auto Depth, Downsampling Joint Decision and Feature Aggregation

To satisfy the stringent requirements for computational resources in the field of real-time semantic segmentation, most approaches focus on the hand-crafted design of light-weight segmentation networks. To enj...

Peng Sun, Jiaxiang Wu, Songyuan Li, Peiwen Lin… in International Journal of Computer Vision (2021)

Chapter and Conference Paper

FARGO: A Joint Framework for FAZ and RV Segmentation from OCTA Images

Optical coherence tomography angiography (OCTA) is a recent advance in ophthalmic imaging, which provides detailed visualization of two important anatomical landmarks, namely foveal avascular zone (FAZ) and re...

Linkai Peng, Li Lin, Pu** Cheng, Zhonghua Wang… in Ophthalmic Medical Image Analysis (2021)

Chapter and Conference Paper

MTNAS: Search Multi-task Networks for Autonomous Driving

Multi-task learning (MTL) aims to learn shared representations from multiple tasks simultaneously, which has yielded outstanding performance in widespread applications of computer vision. However, existing mul...

Hao Liu, Dong Li, **Zhang Peng, Qingjie Zhao, Lu Tian… in Computer Vision – ACCV 2020 (2021)

Chapter and Conference Paper

The Eighth Visual Object Tracking VOT2020 Challenge Results

The Visual Object Tracking challenge VOT2020 is the eighth annual tracker benchmarking activity organized by the VOT initiative. Results of 58 trackers are presented; many are state-of-the-art trackers publish...

Matej Kristan, Aleš Leonardis, Jiří Matas… in Computer Vision – ECCV 2020 Workshops (2020)

50 Result(s)

Keyframe-based RGB-D dense visual SLAM fused semantic cues in dynamic scenes

CLIP-Adapter: Better Vision-Language Models with Feature Adapters

SSK-Yolo: Global Feature-Driven Small Object Detection Network for Images

Mitigating Fine-Grained Hallucination by Fine-Tuning Large Vision-Language Models with Caption Rewrites

Deep Tiny Network for Recognition-Oriented Face Image Quality Assessment

Self-supervised Edge Structure Learning for Multi-view Stereo and Parallel Optimization

Crowded pose-guided multi-task learning for instance-level human parsing

The Tenth Visual Object Tracking VOT2022 Challenge Results

MIPI 2022 Challenge on RGB+ToF Depth Completion: Dataset and Report

Frame Correlation Knowledge Distillation for Gait Recognition in the Wild

Channel Spatial Collaborative Attention Network for Fine-Grained Classification of Cervical Cells

Visible-infrared person re-identification model based on feature consistency and modal indistinguishability

Efficient Joint-Dimensional Search with Solution Space Regularization for Real-Time Semantic Segmentation

Unstructured Feature Decoupling for Vehicle Re-identification

Embedded real-time infrared and visible image fusion for UAV surveillance

A dedicated hardware accelerator for real-time acceleration of YOLOv2

Real-Time Semantic Segmentation via Auto Depth, Downsampling Joint Decision and Feature Aggregation

FARGO: A Joint Framework for FAZ and RV Segmentation from OCTA Images

MTNAS: Search Multi-task Networks for Autonomous Driving

The Eighth Visual Object Tracking VOT2020 Challenge Results

Our Content

Other Sites

Help & Contacts