Skip to main content

previous disabled Page of 18
and
  1. No Access

    Article

    Transmission-guided multi-feature fusion Dehaze network

    Image dehazing is an important direction of low-level visual tasks, and its quality and efficiency directly affect the quality of high-level visual tasks. Therefore, how to quickly and efficiently process hazy...

    **aoyang Zhao, Zhuo Wang, Zhongchao Deng, Hongde Qin, Zhongben Zhu in The Visual Computer (2024)

  2. No Access

    Article

    A digital speckle stereo matching algorithm based on epipolar line correction

    When the digital speckle correlation method captures images under certain working conditions, the extreme tilt of the camera leads to a weak correlation between the left and right images, which in turn makes t...

    Li** Liu, Boya Niu, Zhuo Xu, Songyang Zhang in Signal, Image and Video Processing (2024)

  3. No Access

    Article

    Single-Temporal Supervised Learning for Universal Remote Sensing Change Detection

    Bitemporal supervised learning paradigm always dominates remote sensing change detection using numerous labeled bitemporal image pairs, especially for high spatial resolution (HSR) remote sensing imagery. Howe...

    Zhuo Zheng, Yanfei Zhong, Ailong Ma in International Journal of Computer Vision (2024)

  4. No Access

    Article

    RaSTFormer: region-aware spatiotemporal transformer for visual homogenization recognition in short videos

    With the surge in network traffic, the homogenization of short video content is becoming increasingly prominent, resulting in low-quality entertainment due to proliferation and infringement. Therefore, recogni...

    Shuying Zhang, **g Zhang, Hui Zhang, Li Zhuo in Neural Computing and Applications (2024)

  5. No Access

    Article

    SSE-YOLOv5: a real-time fault line selection method based on lightweight modules and attention models

    To address the problems of low precision and poor anti-noise performance of the standard route selection method for the small current grounding faults, a fault line selection approach based on YOLOv5 network t...

    Shuai Hao, Wei Li, Xu Ma, Zhuo Tian in Journal of Real-Time Image Processing (2024)

  6. No Access

    Article

    Unpaved road segmentation of UAV imagery via a global vision transformer with dilated cross window self-attention for dynamic map

    Road segmentation is a fundamental task for dynamic map in unmanned aerial vehicle (UAV) path navigation. In unplanned, unknown and even damaged areas, there are usually unpaved roads with blurred edges, defor...

    Wensheng Li, **g Zhang, Jiafeng Li, Li Zhuo in The Visual Computer (2024)

  7. Article

    Open Access

    MCAD: Multi-classification anomaly detection with relational knowledge distillation

    With the wide application of deep learning in anomaly detection (AD), industrial vision AD has achieved remarkable success. However, current AD usually focuses on anomaly localization and rarely investigates a...

    Zhuo Li, Yifei Ge, Xuebin Yue, Lin Meng in Neural Computing and Applications (2024)

  8. No Access

    Article

    FSODv2: A Deep Calibrated Few-Shot Object Detection Network

    Traditional methods for object detection typically necessitate a substantial amount of training data, and creating high-quality training data is time-consuming. We propose a novel Few-Shot Object Detection net...

    Qi Fan, Wei Zhuo, Chi-Keung Tang, Yu-Wing Tai in International Journal of Computer Vision (2024)

  9. No Access

    Article

    Unbiased scene graph generation using the self-distillation method

    Scene graph generation (SGG) aims to build a structural representation for the image with the object instance and the relations between object pairs. Due to the long-tail distribution of the dataset labeling, ...

    Bo Sun, Zhuo Hao, Lejun Yu, Jun He in The Visual Computer (2024)

  10. No Access

    Article

    Meningioma segmentation with GV-UNet: a hybrid model using a ghost module and vision transformer

    Meningiomas are the most common intracranial tumors in adults. The size and shape of a tumor mostly rely on manual measurement by a neurosurgeon. In recent years, deep learning has rapidly developed and has gr...

    Hua Bai, Zhuo Zhang, Yong Yang, Chen Niu, Qiang Gao in Signal, Image and Video Processing (2024)

  11. No Access

    Article

    MVTr: multi-feature voxel transformer for 3D object detection

    Convolutional neural networks have become a powerful tool for partial 3D object detection. However, their power has not been fully realized for focusing on global information, which is crucial for object detec...

    Lingmei Ai, Zhuoyu **e, Ruoxia Yao, Mengyao Yang in The Visual Computer (2024)

  12. No Access

    Article

    Audio steganography cover enhancement via reinforcement learning

    Recent advancements in steganography analysis based on deep neural networks have led to the development of steganography schemes that incorporate deep network technology like adversarial example, GAN, and rein...

    Peiwen Zhuo, Diqun Yan, Kaiyu Ying, Rangding Wang in Signal, Image and Video Processing (2024)

  13. No Access

    Article

    HDUD-Net: heterogeneous decoupling unsupervised dehaze network

    Haze reduces the imaging effectiveness of outdoor vision systems, significantly degrading the quality of images; hence, reducing haze has been a focus of many studies. In recent years, decoupled representation...

    Jiafeng Li, Lingyan Kuang, Jiaqi **, Li Zhuo in Neural Computing and Applications (2024)

  14. No Access

    Article

    ACX-UNet: a multi-scale lung parenchyma segmentation study with improved fusion of skip connection and circular cross-features extraction

    Convolutional neural networks (CNN) are widely used in the field of computer-aided diagnosis of lung diseases. Its main tasks are segmentation of lung parenchyma, lung nodule detection and lesion analysis. Amo...

    Hongbing Wu, Zhuo Zhang, Yuchen Zhang, Baoshan Sun in Signal, Image and Video Processing (2024)

  15. No Access

    Chapter and Conference Paper

    Efficient 3D View Synthesis from Single-Image Utilizing Diffusion Priors

    In this paper, we introduce a novel framework for synthesizing novel views of objects from a single image. Leveraging the capabilities of fine-tuned diffusion models, our method combines latent 3D knowledge as...

    Yifan Wen, Zitong Wang, Zhuoyuan Li in Advances in Neural Networks – ISNN 2024 (2024)

  16. No Access

    Chapter and Conference Paper

    A Framework Combining Separate and Joint Training for Neural Vocoder-Based Monaural Speech Enhancement

    Conventional single-channel speech enhancement methodologies have predominantly emphasized the enhancement of the amplitude spectrum while preserving the original phase spectrum. Nonetheless, this may introduc...

    Qiaoyi Pan, Wenbing Jiang, Qing Zhuo, Kai Yu in Man-Machine Speech Communication (2024)

  17. No Access

    Chapter and Conference Paper

    Few Shot Specific Emitter Identification Based on Triplet Loss

    Deep learning-based RF fingerprinting has emerged as a crucial approach for device authentication. However, this technology often requires a large number of labelled samples practically. To address this issue,...

    Yucheng Zhang, Zhuo Sun in Artificial Intelligence in China (2024)

  18. No Access

    Chapter and Conference Paper

    Iterative Noisy-Target Approach: Speech Enhancement Without Clean Speech

    Traditional Deep Neural Network based speech enhancement usually requires clean speech as the target of training. However, limited access to ideal clean speech hinders its practical use. Meanwhile, existing se...

    Yifan Zhang, Wenbin Jiang, Qing Zhuo, Kai Yu in Man-Machine Speech Communication (2024)

  19. No Access

    Chapter and Conference Paper

    3D Multi-scene Stylization Based on Conditional Neural Radiance Fields

    Neural Radiation Field (NeRF) is a scene model capable of achieving high-quality view synthesis, optimized for each specific scene. In this paper, we propose a conditional neural radiation field based on multi...

    Sijia Zhang, Ting Liu, Zhuoyuan Li, Yi Sun in Advances in Neural Networks – ISNN 2024 (2024)

  20. No Access

    Chapter and Conference Paper

    End-to-End Streaming Customizable Keyword Spotting Based on Text-Adaptive Neural Search

    Streaming keyword spotting (KWS) is an important technique for voice assistant wake-up. While KWS with a preset fixed keyword has been well studied, test-time customizable keyword spotting in streaming mode re...

    Baochen Yang, Jiaqi Guo, Haoyu Li, Yu **, Qing Zhuo in Man-Machine Speech Communication (2024)

previous disabled Page of 18