![Loading...](https://link.springer.com/static/c4a417b97a76cc2980e3c25e2271af3129e08bbe/images/pdf-preview/spacer.gif)
112 Result(s)
-
Article
Mind-bridge: reconstructing visual images based on diffusion model from human brain activity
Human brain vision is mysterious and complex, and it interprets the world through the connection between the brain and the eyes. In recent years, several methods have relied on fMRI to successfully reconstruct...
-
Article
DiffCAS: diffusion based multi-attention network for segmentation of 3D coronary artery from CT angiography
Automatic segmentation of 3D coronary arteries from computed tomography angiography (CTA) is an indispensable part of accurate and efficient coronary artery disease (CAD) diagnosis. However, it remains challen...
-
Article
Spatially-Varying Illumination-Aware Indoor Harmonization
In this paper, we address the problem of spatially-varying illumination-aware indoor harmonization. Existing image harmonization works either focus on extracting no more than 2D information (e.g., low-level st...
-
Article
SCA-YOLOv4: you only look once with squeeze-and-excitation, coordinate attention and adaptively spatial feature fusion
How to effectively and efficiently identify multi-scale objects is one of the key challenges in object detection. In order to make the classification and regression of single-stage object detector more accurat...
-
Article
ZMNet: feature fusion and semantic boundary supervision for real-time semantic segmentation
Feature fusion module is an essential component of real-time semantic segmentation networks to bridge the semantic gap among different feature layers. However, many networks are inefficient in multi-level feat...
-
Article
Deep learning based insulator fault detection algorithm for power transmission lines
Aiming at the complex background of transmission lines at the present stage, which leads to the problem of low accuracy of insulator fault detection for small targets, a deep learning-based insulator fault det...
-
Article
ViDSOD-100: A New Dataset and a Baseline Model for RGB-D Video Salient Object Detection
With the rapid development of depth sensor, more and more RGB-D videos could be obtained. Identifying the foreground in RGB-D videos is a fundamental and important task. However, the existing salient object de...
-
Article
Open AccessJoint training with local soft attention and dual cross-neighbor label smoothing for unsupervised person re-identification
Existing unsupervised person re-identification approaches fail to fully capture the fine-grained features of local regions, which can result in people with similar appearances and different identities being as...
-
Article
Harmonizing local and global features: enhanced hand gesture segmentation using synergistic fusion of CNN and transformer networks
Hand gesture segmentation is an important research topic in computer vision. Despite ongoing efforts, achieving optimal gesture segmentation remains challenging, attributed to factors like gesture morphology a...
-
Article
Offline handwritten mathematical expression recognition based on YOLOv5s
The error accumulation in traditional offline handwritten mathematical expression recognition (OHMER) becomes challenging, because of the two-dimensional structure and writing arbitrariness of offline handwrit...
-
Article
Machine reading comprehension model based on query reconstruction technology and deep learning
Machine reading comprehension is introduced to improve machines’ readability and understandability of human languages. This sophisticated version of natural language processing is used for testing and improvin...
-
Chapter and Conference Paper
Joint Training or Not: An Exploration of Pre-trained Speech Models in Audio-Visual Speaker Diarization
The scarcity of labeled audio-visual datasets is a constraint for training superior audio-visual speaker diarization systems. To improve the performance of audio-visual speaker diarization, we leverage pre-tra...
-
Article
Towards High-Resolution Specular Highlight Detection
Specular highlight detection is an essential task with various applications in computer vision. This paper aims to detect specular highlights in single high-resolution images using deep learning while avoiding...
-
Article
ACKSNet: adaptive center keypoint selection for object detection
Keypoint-based detectors generate a large number of false positives due to incorrect keypoint matching in the object detection task. In this paper, we propose an adaptive center keypoint selection method (ACKS...
-
Article
Cluster-based two-branch framework for point cloud attribute compression
Owing to the irregular distribution of point clouds in 3D space, effectively compressing the point cloud is still challenging. Recently, numerous compression methods have been developed with outstanding perfor...
-
Article
ZRDNet: zero-reference image defogging by physics-based decomposition–reconstruction mechanism and perception fusion
This paper investigates challenging fully unsupervised defogging problems, i.e., how to remove fog by feeding only foggy images in deep neural networks rather than using paired or unpaired synthetic images, an...
-
Article
Trade-off background joint learning for unsupervised vehicle re-identification
Existing vehicle re-identification (Re-ID) methods either extract valuable background information to enhance the robustness of the vehicle model or segment background interference information to learn vehicle ...
-
Article
Open AccessTranslating radiology reports into plain language using ChatGPT and GPT-4 with prompt learning: results, limitations, and potential
The large language model called ChatGPT has drawn extensively attention because of its human-like expression and reasoning abilities. In this study, we investigate the feasibility of using ChatGPT in experimen...
-
Article
VGT-MOT: visibility-guided tracking for online multiple-object tracking
Multi-object tracking (MOT) is an important task of computer vision which has a wide range of applications. Existing multi-object tracking methods mostly employ the Kalman filter to predict the object location...
-
Chapter and Conference Paper
CrowdFusion: Refined Cross-Modal Fusion Network for RGB-T Crowd Counting
Crowd counting is a crucial task in computer vision, offering numerous applications in smart security, remote sensing, agriculture and forestry. While pure image-based models have made significant advancements...