Search Results - Springer

Sort By Newest First Oldest First

Article

Mind-bridge: reconstructing visual images based on diffusion model from human brain activity

Human brain vision is mysterious and complex, and it interprets the world through the connection between the brain and the eyes. In recent years, several methods have relied on fMRI to successfully reconstruct...

Qing Liu, Hongqing Zhu, Ning Chen, Bingcang Huang… in Signal, Image and Video Processing (2024)
Article

DiffCAS: diffusion based multi-attention network for segmentation of 3D coronary artery from CT angiography

Automatic segmentation of 3D coronary arteries from computed tomography angiography (CTA) is an indispensable part of accurate and efficient coronary artery disease (CAD) diagnosis. However, it remains challen...

Jiajia Li, Qing Wu, Yuanquan Wang, Shoujun Zhou… in Signal, Image and Video Processing (2024)
Article

Spatially-Varying Illumination-Aware Indoor Harmonization

In this paper, we address the problem of spatially-varying illumination-aware indoor harmonization. Existing image harmonization works either focus on extracting no more than 2D information (e.g., low-level st...

Zhongyun Hu, Jiahao Li, Xue Wang, Qing Wang in International Journal of Computer Vision (2024)
Article

SCA-YOLOv4: you only look once with squeeze-and-excitation, coordinate attention and adaptively spatial feature fusion

How to effectively and efficiently identify multi-scale objects is one of the key challenges in object detection. In order to make the classification and regression of single-stage object detector more accurat...

Pengfei Liu, Qing Wang in Signal, Image and Video Processing (2024)
Article

ZMNet: feature fusion and semantic boundary supervision for real-time semantic segmentation

Feature fusion module is an essential component of real-time semantic segmentation networks to bridge the semantic gap among different feature layers. However, many networks are inefficient in multi-level feat...

Ya Li, Ziming Li, Huiwang Liu, Qing Wang in The Visual Computer (2024)
Article

Deep learning based insulator fault detection algorithm for power transmission lines

Aiming at the complex background of transmission lines at the present stage, which leads to the problem of low accuracy of insulator fault detection for small targets, a deep learning-based insulator fault det...

Han Wang, Qing Yang, Binlin Zhang, Dexin Gao in Journal of Real-Time Image Processing (2024)
Article

ViDSOD-100: A New Dataset and a Baseline Model for RGB-D Video Salient Object Detection

With the rapid development of depth sensor, more and more RGB-D videos could be obtained. Identifying the foreground in RGB-D videos is a fundamental and important task. However, the existing salient object de...

Junhao Lin, Lei Zhu, Jiaxing Shen, Huazhu Fu… in International Journal of Computer Vision (2024)
Article

Open Access

Joint training with local soft attention and dual cross-neighbor label smoothing for unsupervised person re-identification

Existing unsupervised person re-identification approaches fail to fully capture the fine-grained features of local regions, which can result in people with similar appearances and different identities being as...

Qing Han, Longfei Li, Weidong Min, Qi Wang, Qingpeng Zeng… in Computational Visual Media (2024)

Download PDF (9997 KB)
Article

Harmonizing local and global features: enhanced hand gesture segmentation using synergistic fusion of CNN and transformer networks

Hand gesture segmentation is an important research topic in computer vision. Despite ongoing efforts, achieving optimal gesture segmentation remains challenging, attributed to factors like gesture morphology a...

Shi Wang, Ning Yang, Maohua Liu, Qing Tian… in Signal, Image and Video Processing (2024)
Article

Offline handwritten mathematical expression recognition based on YOLOv5s

The error accumulation in traditional offline handwritten mathematical expression recognition (OHMER) becomes challenging, because of the two-dimensional structure and writing arbitrariness of offline handwrit...

Fei Li, Hongbo Fang, Dengzhun Wang, Ruixin Liu, Qing Hou… in The Visual Computer (2024)
Article

Machine reading comprehension model based on query reconstruction technology and deep learning

Machine reading comprehension is introduced to improve machines’ readability and understandability of human languages. This sophisticated version of natural language processing is used for testing and improvin...

Pengming Wang, M. M. Kamruzzaman, Qing Chen in Neural Computing and Applications (2024)
Chapter and Conference Paper

Joint Training or Not: An Exploration of Pre-trained Speech Models in Audio-Visual Speaker Diarization

The scarcity of labeled audio-visual datasets is a constraint for training superior audio-visual speaker diarization systems. To improve the performance of audio-visual speaker diarization, we leverage pre-tra...

Huan Zhao, Li Zhang, Yue Li, Yannan Wang, Hongji Wang… in Man-Machine Speech Communication (2024)
Article

Towards High-Resolution Specular Highlight Detection

Specular highlight detection is an essential task with various applications in computer vision. This paper aims to detect specular highlights in single high-resolution images using deep learning while avoiding...

Gang Fu, Qing Zhang, Lei Zhu, Qifeng Lin… in International Journal of Computer Vision (2024)
Article

ACKSNet: adaptive center keypoint selection for object detection

Keypoint-based detectors generate a large number of false positives due to incorrect keypoint matching in the object detection task. In this paper, we propose an adaptive center keypoint selection method (ACKS...

**ngzhu Liang, Lixin Wang, Wei Cheng, **nyun Yan, Qing Chen in The Visual Computer (2023)
Article

Cluster-based two-branch framework for point cloud attribute compression

Owing to the irregular distribution of point clouds in 3D space, effectively compressing the point cloud is still challenging. Recently, numerous compression methods have been developed with outstanding perfor...

Longhua Sun, ** Wang, Qing Zhu, Jiaying Liu, Jiawen Yu in The Visual Computer (2023)
Article

ZRDNet: zero-reference image defogging by physics-based decomposition–reconstruction mechanism and perception fusion

This paper investigates challenging fully unsupervised defogging problems, i.e., how to remove fog by feeding only foggy images in deep neural networks rather than using paired or unpaired synthetic images, an...

Zi-**n Li, Yu-Long Wang, Qing-Long Han, Chen Peng in The Visual Computer (2023)
Article

Trade-off background joint learning for unsupervised vehicle re-identification

Existing vehicle re-identification (Re-ID) methods either extract valuable background information to enhance the robustness of the vehicle model or segment background interference information to learn vehicle ...

Sheng Wang, Qi Wang, Weidong Min, Qing Han, Di Gai, Haowen Luo in The Visual Computer (2023)
Article

Open Access

Translating radiology reports into plain language using ChatGPT and GPT-4 with prompt learning: results, limitations, and potential

The large language model called ChatGPT has drawn extensively attention because of its human-like expression and reasoning abilities. In this study, we investigate the feasibility of using ChatGPT in experimen...

Qing Lyu, Josh Tan, Michael E. Zapadka… in Visual Computing for Industry, Biomedicine… (2023)

Download PDF (1175 KB) View Article
Article

VGT-MOT: visibility-guided tracking for online multiple-object tracking

Multi-object tracking (MOT) is an important task of computer vision which has a wide range of applications. Existing multi-object tracking methods mostly employ the Kalman filter to predict the object location...

Shuai Wang, Wei-** Li, Lu Wang, Li-Sheng Xu… in Machine Vision and Applications (2023)
Chapter and Conference Paper

CrowdFusion: Refined Cross-Modal Fusion Network for RGB-T Crowd Counting

Crowd counting is a crucial task in computer vision, offering numerous applications in smart security, remote sensing, agriculture and forestry. While pure image-based models have made significant advancements...

Jialu Cai, Qing Wang, Shengqin Jiang in Biometric Recognition (2023)

112 Result(s)

Mind-bridge: reconstructing visual images based on diffusion model from human brain activity

DiffCAS: diffusion based multi-attention network for segmentation of 3D coronary artery from CT angiography

Spatially-Varying Illumination-Aware Indoor Harmonization

SCA-YOLOv4: you only look once with squeeze-and-excitation, coordinate attention and adaptively spatial feature fusion

ZMNet: feature fusion and semantic boundary supervision for real-time semantic segmentation

Deep learning based insulator fault detection algorithm for power transmission lines

ViDSOD-100: A New Dataset and a Baseline Model for RGB-D Video Salient Object Detection

Joint training with local soft attention and dual cross-neighbor label smoothing for unsupervised person re-identification

Harmonizing local and global features: enhanced hand gesture segmentation using synergistic fusion of CNN and transformer networks

Offline handwritten mathematical expression recognition based on YOLOv5s

Machine reading comprehension model based on query reconstruction technology and deep learning

Joint Training or Not: An Exploration of Pre-trained Speech Models in Audio-Visual Speaker Diarization

Towards High-Resolution Specular Highlight Detection

ACKSNet: adaptive center keypoint selection for object detection

Cluster-based two-branch framework for point cloud attribute compression

ZRDNet: zero-reference image defogging by physics-based decomposition–reconstruction mechanism and perception fusion

Trade-off background joint learning for unsupervised vehicle re-identification

Translating radiology reports into plain language using ChatGPT and GPT-4 with prompt learning: results, limitations, and potential

VGT-MOT: visibility-guided tracking for online multiple-object tracking

CrowdFusion: Refined Cross-Modal Fusion Network for RGB-T Crowd Counting

Our Content

Other Sites

Help & Contacts