Search Results - Springer

Sort By Newest First Oldest First

Article

Transmission-guided multi-feature fusion Dehaze network

Image dehazing is an important direction of low-level visual tasks, and its quality and efficiency directly affect the quality of high-level visual tasks. Therefore, how to quickly and efficiently process hazy...

**aoyang Zhao, Zhuo Wang, Zhongchao Deng, Hongde Qin, Zhongben Zhu in The Visual Computer (2024)
Article

A digital speckle stereo matching algorithm based on epipolar line correction

When the digital speckle correlation method captures images under certain working conditions, the extreme tilt of the camera leads to a weak correlation between the left and right images, which in turn makes t...

Li** Liu, Boya Niu, Zhuo Xu, Songyang Zhang… in Signal, Image and Video Processing (2024)
Article

Single-Temporal Supervised Learning for Universal Remote Sensing Change Detection

Bitemporal supervised learning paradigm always dominates remote sensing change detection using numerous labeled bitemporal image pairs, especially for high spatial resolution (HSR) remote sensing imagery. Howe...

Zhuo Zheng, Yanfei Zhong, Ailong Ma… in International Journal of Computer Vision (2024)
Article

RaSTFormer: region-aware spatiotemporal transformer for visual homogenization recognition in short videos

With the surge in network traffic, the homogenization of short video content is becoming increasingly prominent, resulting in low-quality entertainment due to proliferation and infringement. Therefore, recogni...

Shuying Zhang, **g Zhang, Hui Zhang, Li Zhuo in Neural Computing and Applications (2024)
Article

SSE-YOLOv5: a real-time fault line selection method based on lightweight modules and attention models

To address the problems of low precision and poor anti-noise performance of the standard route selection method for the small current grounding faults, a fault line selection approach based on YOLOv5 network t...

Shuai Hao, Wei Li, Xu Ma, Zhuo Tian in Journal of Real-Time Image Processing (2024)
Article

Unpaved road segmentation of UAV imagery via a global vision transformer with dilated cross window self-attention for dynamic map

Road segmentation is a fundamental task for dynamic map in unmanned aerial vehicle (UAV) path navigation. In unplanned, unknown and even damaged areas, there are usually unpaved roads with blurred edges, defor...

Wensheng Li, **g Zhang, Jiafeng Li, Li Zhuo in The Visual Computer (2024)
Article

Open Access

MCAD: Multi-classification anomaly detection with relational knowledge distillation

With the wide application of deep learning in anomaly detection (AD), industrial vision AD has achieved remarkable success. However, current AD usually focuses on anomaly localization and rarely investigates a...

Zhuo Li, Yifei Ge, Xuebin Yue, Lin Meng in Neural Computing and Applications (2024)

Download PDF (1259 KB) View Article
Article

FSODv2: A Deep Calibrated Few-Shot Object Detection Network

Traditional methods for object detection typically necessitate a substantial amount of training data, and creating high-quality training data is time-consuming. We propose a novel Few-Shot Object Detection net...

Qi Fan, Wei Zhuo, Chi-Keung Tang, Yu-Wing Tai in International Journal of Computer Vision (2024)
Article

Unbiased scene graph generation using the self-distillation method

Scene graph generation (SGG) aims to build a structural representation for the image with the object instance and the relations between object pairs. Due to the long-tail distribution of the dataset labeling, ...

Bo Sun, Zhuo Hao, Lejun Yu, Jun He in The Visual Computer (2024)
Article

Meningioma segmentation with GV-UNet: a hybrid model using a ghost module and vision transformer

Meningiomas are the most common intracranial tumors in adults. The size and shape of a tumor mostly rely on manual measurement by a neurosurgeon. In recent years, deep learning has rapidly developed and has gr...

Hua Bai, Zhuo Zhang, Yong Yang, Chen Niu, Qiang Gao… in Signal, Image and Video Processing (2024)
Article

MVTr: multi-feature voxel transformer for 3D object detection

Convolutional neural networks have become a powerful tool for partial 3D object detection. However, their power has not been fully realized for focusing on global information, which is crucial for object detec...

Lingmei Ai, Zhuoyu **e, Ruoxia Yao, Mengyao Yang in The Visual Computer (2024)
Article

Audio steganography cover enhancement via reinforcement learning

Recent advancements in steganography analysis based on deep neural networks have led to the development of steganography schemes that incorporate deep network technology like adversarial example, GAN, and rein...

Peiwen Zhuo, Diqun Yan, Kaiyu Ying, Rangding Wang… in Signal, Image and Video Processing (2024)
Article

HDUD-Net: heterogeneous decoupling unsupervised dehaze network

Haze reduces the imaging effectiveness of outdoor vision systems, significantly degrading the quality of images; hence, reducing haze has been a focus of many studies. In recent years, decoupled representation...

Jiafeng Li, Lingyan Kuang, Jiaqi **, Li Zhuo… in Neural Computing and Applications (2024)
Article

ACX-UNet: a multi-scale lung parenchyma segmentation study with improved fusion of skip connection and circular cross-features extraction

Convolutional neural networks (CNN) are widely used in the field of computer-aided diagnosis of lung diseases. Its main tasks are segmentation of lung parenchyma, lung nodule detection and lesion analysis. Amo...

Hongbing Wu, Zhuo Zhang, Yuchen Zhang, Baoshan Sun… in Signal, Image and Video Processing (2024)
Chapter and Conference Paper

Efficient 3D View Synthesis from Single-Image Utilizing Diffusion Priors

In this paper, we introduce a novel framework for synthesizing novel views of objects from a single image. Leveraging the capabilities of fine-tuned diffusion models, our method combines latent 3D knowledge as...

Yifan Wen, Zitong Wang, Zhuoyuan Li… in Advances in Neural Networks – ISNN 2024 (2024)
Chapter and Conference Paper

A Framework Combining Separate and Joint Training for Neural Vocoder-Based Monaural Speech Enhancement

Conventional single-channel speech enhancement methodologies have predominantly emphasized the enhancement of the amplitude spectrum while preserving the original phase spectrum. Nonetheless, this may introduc...

Qiaoyi Pan, Wenbing Jiang, Qing Zhuo, Kai Yu in Man-Machine Speech Communication (2024)
Chapter and Conference Paper

Few Shot Specific Emitter Identification Based on Triplet Loss

Deep learning-based RF fingerprinting has emerged as a crucial approach for device authentication. However, this technology often requires a large number of labelled samples practically. To address this issue,...

Yucheng Zhang, Zhuo Sun in Artificial Intelligence in China (2024)
Chapter and Conference Paper

Iterative Noisy-Target Approach: Speech Enhancement Without Clean Speech

Traditional Deep Neural Network based speech enhancement usually requires clean speech as the target of training. However, limited access to ideal clean speech hinders its practical use. Meanwhile, existing se...

Yifan Zhang, Wenbin Jiang, Qing Zhuo, Kai Yu in Man-Machine Speech Communication (2024)
Chapter and Conference Paper

3D Multi-scene Stylization Based on Conditional Neural Radiance Fields

Neural Radiation Field (NeRF) is a scene model capable of achieving high-quality view synthesis, optimized for each specific scene. In this paper, we propose a conditional neural radiation field based on multi...

Sijia Zhang, Ting Liu, Zhuoyuan Li, Yi Sun in Advances in Neural Networks – ISNN 2024 (2024)
Chapter and Conference Paper

End-to-End Streaming Customizable Keyword Spotting Based on Text-Adaptive Neural Search

Streaming keyword spotting (KWS) is an important technique for voice assistant wake-up. While KWS with a preset fixed keyword has been well studied, test-time customizable keyword spotting in streaming mode re...

Baochen Yang, Jiaqi Guo, Haoyu Li, Yu **, Qing Zhuo… in Man-Machine Speech Communication (2024)

351 Result(s)

Transmission-guided multi-feature fusion Dehaze network

A digital speckle stereo matching algorithm based on epipolar line correction

Single-Temporal Supervised Learning for Universal Remote Sensing Change Detection

RaSTFormer: region-aware spatiotemporal transformer for visual homogenization recognition in short videos

SSE-YOLOv5: a real-time fault line selection method based on lightweight modules and attention models

Unpaved road segmentation of UAV imagery via a global vision transformer with dilated cross window self-attention for dynamic map

MCAD: Multi-classification anomaly detection with relational knowledge distillation

FSODv2: A Deep Calibrated Few-Shot Object Detection Network

Unbiased scene graph generation using the self-distillation method

Meningioma segmentation with GV-UNet: a hybrid model using a ghost module and vision transformer

MVTr: multi-feature voxel transformer for 3D object detection

Audio steganography cover enhancement via reinforcement learning

HDUD-Net: heterogeneous decoupling unsupervised dehaze network

ACX-UNet: a multi-scale lung parenchyma segmentation study with improved fusion of skip connection and circular cross-features extraction

Efficient 3D View Synthesis from Single-Image Utilizing Diffusion Priors

A Framework Combining Separate and Joint Training for Neural Vocoder-Based Monaural Speech Enhancement

Few Shot Specific Emitter Identification Based on Triplet Loss

Iterative Noisy-Target Approach: Speech Enhancement Without Clean Speech

3D Multi-scene Stylization Based on Conditional Neural Radiance Fields

End-to-End Streaming Customizable Keyword Spotting Based on Text-Adaptive Neural Search

Our Content

Other Sites

Help & Contacts