![Loading...](https://link.springer.com/static/c4a417b97a76cc2980e3c25e2271af3129e08bbe/images/pdf-preview/spacer.gif)
-
Article
Keyframe-based RGB-D dense visual SLAM fused semantic cues in dynamic scenes
The robustness of dense visual SLAM is still a challenging problem in dynamic environments. In this paper, we propose a novel keyframe-based dense visual SLAM to handle a highly dynamic environment by using an...
-
Article
CLIP-Adapter: Better Vision-Language Models with Feature Adapters
Large-scale contrastive vision-language pretraining has shown significant progress in visual representation learning. Unlike traditional visual systems trained by a fixed set of discrete labels, a new paradigm...
-
Chapter and Conference Paper
SSK-Yolo: Global Feature-Driven Small Object Detection Network for Images
Timely and effective locust detection to prevent locust plagues is crucial for safeguarding agricultural production and ecological balance. However, under natural conditions, the “colour mixing mechanism” of l...
-
Chapter and Conference Paper
Mitigating Fine-Grained Hallucination by Fine-Tuning Large Vision-Language Models with Caption Rewrites
Large language models (LLMs) have shown remarkable performance in natural language processing (NLP) tasks. To comprehend and execute diverse human instructions over image data, instruction-tuned large vision-l...
-
Chapter and Conference Paper
Deep Tiny Network for Recognition-Oriented Face Image Quality Assessment
Face recognition has made significant progress in recent years due to deep convolutional neural networks (CNN). In many face recognition (FR) scenarios, face images are acquired from a sequence with huge intr...
-
Chapter and Conference Paper
Self-supervised Edge Structure Learning for Multi-view Stereo and Parallel Optimization
Recent studies have witnessed that many self-supervised methods obtain clear progress on the multi-view stereo (MVS). However, existing methods ignore the edge structure information of the reconstructed target...
-
Article
Crowded pose-guided multi-task learning for instance-level human parsing
Instance-level human parsing remains challenging due to the similarity between human instances and background, complex interactions, and various poses. Aiming at assigning each human-related pixel a semantic l...
-
Chapter and Conference Paper
The Tenth Visual Object Tracking VOT2022 Challenge Results
The Visual Object Tracking challenge VOT2022 is the tenth annual tracker benchmarking activity organized by the VOT initiative. Results of 93 entries are presented; many are state-of-the-art trackers published...
-
Chapter and Conference Paper
MIPI 2022 Challenge on RGB+ToF Depth Completion: Dataset and Report
Develo** and integrating advanced image sensors with novel algorithms in camera systems is prevalent with the increasing demand for computational photography and imaging on mobile platforms. However, the lac...
-
Chapter and Conference Paper
Frame Correlation Knowledge Distillation for Gait Recognition in the Wild
Recently, large deep models have achieved significant progress on gait recognition in the wild. However, such models come with a high cost of runtime and computational resource consumption. In this paper, we i...
-
Chapter and Conference Paper
Channel Spatial Collaborative Attention Network for Fine-Grained Classification of Cervical Cells
Accurately classifying cervical cells based on the commonly used TBS (The Bethesda System) standard is critical for building the automatic cytology diagnosing system. However, the existing two publicly availab...
-
Article
Visible-infrared person re-identification model based on feature consistency and modal indistinguishability
Visible-infrared person re-identification (VI-ReID) is used to search person images across cameras under different modalities, which can address the limitation of visible-based ReID in dark environments. Intra...
-
Article
Efficient Joint-Dimensional Search with Solution Space Regularization for Real-Time Semantic Segmentation
Semantic segmentation is a popular research topic in computer vision, and many efforts have been made on it with impressive results. In this paper, we intend to search an optimal network structure that can run...
-
Chapter and Conference Paper
Unstructured Feature Decoupling for Vehicle Re-identification
misalignment of features caused by pose and viewpoint variances is a crucial problem in Vehicle Re-Identification (ReID). Previous methods align the features by structuring the vehicles from pre-defined vehic...
-
Article
Embedded real-time infrared and visible image fusion for UAV surveillance
Infrared and visible image fusion is a beneficial processing task for Unmanned Aerial Vehicle (UAV) surveillance, which can improve visibility by combining the advantages of the infrared camera and the visible...
-
Article
A dedicated hardware accelerator for real-time acceleration of YOLOv2
In recent years, dedicated hardware accelerators for the acceleration of the convolutional neural network (CNN) have been extensively studied. Although many studies have presented efficient designs on FPGAs fo...
-
Article
Real-Time Semantic Segmentation via Auto Depth, Downsampling Joint Decision and Feature Aggregation
To satisfy the stringent requirements for computational resources in the field of real-time semantic segmentation, most approaches focus on the hand-crafted design of light-weight segmentation networks. To enj...
-
Chapter and Conference Paper
FARGO: A Joint Framework for FAZ and RV Segmentation from OCTA Images
Optical coherence tomography angiography (OCTA) is a recent advance in ophthalmic imaging, which provides detailed visualization of two important anatomical landmarks, namely foveal avascular zone (FAZ) and re...
-
Chapter and Conference Paper
MTNAS: Search Multi-task Networks for Autonomous Driving
Multi-task learning (MTL) aims to learn shared representations from multiple tasks simultaneously, which has yielded outstanding performance in widespread applications of computer vision. However, existing mul...
-
Chapter and Conference Paper
The Eighth Visual Object Tracking VOT2020 Challenge Results
The Visual Object Tracking challenge VOT2020 is the eighth annual tracker benchmarking activity organized by the VOT initiative. Results of 58 trackers are presented; many are state-of-the-art trackers publish...