Skip to main content

previous disabled Page of 5
and
  1. No Access

    Article

    A new virtual interpolation technology with range as object

    Virtual interpolation technology can be applied to direction-of-arrival (DOA) estimation as a preprocessing technique to achieve the DOA estimation for any array. In order to solve the angle-sensitive problem ...

    Tao Li, Yunxiu Yang, Wendong Chen, Qin Shu in Signal, Image and Video Processing (2024)

  2. No Access

    Article

    DML-YOLOv8-SAR image object detection algorithm

    Given the challenges posed by noise and varying target scales in SAR images, conventional convolutional neural networks often underperform in SAR image detection. To address this, this paper introduces a novel...

    Shuguang Zhao, Ronghao Tao, Fengde Jia in Signal, Image and Video Processing (2024)

  3. No Access

    Article

    Using improved YOLO V5s to recognize tomatoes in a continuous working environment

    In the continuous working environment of the picking robots, factors such as illumination change, camera hardware, the movement of the picking robots, and image background interference have a great impact on t...

    Guohua Gao, Ciyin Shuai, Shuangyou Wang, Tao Ding in Signal, Image and Video Processing (2024)

  4. No Access

    Article

    Shuff-BiseNet: a dual-branch segmentation network for pavement cracks

    In order to accurately obtain the shape and size of pavement cracks, analyze the severity of pavement cracks, avoid deterioration of the situation, and take timely measures, we proposed a dual-branch structure...

    Haiqun Wang, Bingnan Wang, Tao Zhao in Signal, Image and Video Processing (2024)

  5. No Access

    Article

    Research on image caption generation method based on multi-modal pre-training model and text mixup optimization

    In recent years, multi-modal pre-training models have demonstrated remarkable cross-modal representation capabilities, catalyzing the rapid evolution of multi-modal downstream tasks, particularly in image capt...

    **g-Tao Sun, Xuan Min in Signal, Image and Video Processing (2024)

  6. No Access

    Article

    YOLO-MTG: a lightweight YOLO model for multi-target garbage detection

    With wide adoption of deep learning technology in AI, intelligent garbage detection has become a hot research topic. However, existing datasets currently used for garbage detection rarely involves multi-catego...

    Zhongyi **a, Houkui Zhou, Huimin Yu, Haoji Hu in Signal, Image and Video Processing (2024)

  7. No Access

    Article

    An effective masked transformer network for image denoising

    The rising popularity of employing deep learning networks for image denoising can be observed over the past decade. Typically, their exceptional performance is rooted in their ability to learn the map** from...

    Shao** Xu, Nan **ao, Wuyong Tao, Changfei Zhou in Signal, Image and Video Processing (2024)

  8. No Access

    Article

    Human risky behaviour recognition during ladder climbing based on multi-modal feature fusion and adaptive graph convolutional network

    Human falls during ladder climbing are typically instantaneous, making the timely and accurate determination of security risks during ladder climbing a challenging engineering issue. A skeleton-based behaviour...

    Wenrui Zhu, Donghui Shi, Rui Cheng, Ruifeng Huang in Signal, Image and Video Processing (2024)

  9. No Access

    Article

    Boosting image denoising effect via low-level noise injection

    In the past decade, supervised denoising models trained on large datasets have demonstrated impressive performance in image denoising due to their superior denoising effect. However, these models lack flexibil...

    Jian **ao, **aohui Cheng, Shao** Xu, Wuyong Tao in Signal, Image and Video Processing (2024)

  10. No Access

    Article

    Particle recognition and shape parameter detection based on deep learning

    The size and shape parameters of sand particles are closely related to their geophysical and geomechanical properties. It is challenging to accurately identify sand particles and calculate their shape paramete...

    Xuan Li, Zhou Yang, **nyu Tao, **aojie Wang in Signal, Image and Video Processing (2024)

  11. No Access

    Chapter and Conference Paper

    MetaVSR: A Novel Approach to Video Super-Resolution for Arbitrary Magnification

    Video super-resolution is a pivotal task that involves the recovery of high-resolution video frames from their low-resolution counterparts, possessing a multitude of applications in real-world scenarios. Withi...

    Zixuan Hong, Weipeng Cao, Zhiwu Xu, Zhenru Chen, ** Tao, Zhong Ming in MultiMedia Modeling (2024)

  12. No Access

    Chapter and Conference Paper

    A Coarse and Fine Grained Masking Approach for Video-Grounded Dialogue

    The task of Video-Grounded Dialogue involves develo** a multimodal chatbot capable of answering sequential questions from humans regarding video content, audio, captions and dialog history. Although existing...

    Feifei Xu, Wang Zhou, Tao Sun, Jiahao Lu, Ziheng Yu, Guangzhen Li in MultiMedia Modeling (2024)

  13. No Access

    Chapter and Conference Paper

    High Capacity Reversible Data Hiding in Encrypted Images Based on Pixel Value Preprocessing and Block Classification

    Reversible data hiding in encrypted images (RDHEI) can simultaneously achieve secure transmission of images and secret storage of embedded additional data, which can be used for cloud storage and privacy prote...

    Tao Zhang, Ju Zhang, Yicheng Zou, Yu Zhang in MultiMedia Modeling (2024)

  14. No Access

    Article

    An attention-erasing stripe pyramid network for face forgery detection

    Face forgery detection aims to distinguish between real and fake facial images or videos by identifying manipulated or forged visual media. The main challenge in face forgery detection is achieving high model ...

    Zhenwu Hu, Qianyue Duan, PeiYu Zhang, Huanjie Tao in Signal, Image and Video Processing (2023)

  15. No Access

    Article

    Polarization image fusion based on grouped densely connected network

    The degree of linear polarization has detailed features such as contour, texture, and roughness of the object, while the intensity image (S0) receives reflected and transmitted light, which contains rich backg...

    **n Chen, Shenglai Zhen, Tao Lv, Benli Yu in Signal, Image and Video Processing (2023)

  16. No Access

    Article

    MSNet: a lightweight multi-scale deep learning network for pedestrian re-identification

    Pedestrian re-identification is highly dependent on discriminative features that enable images to encapsulate an arbitrary combination of multiple scales by different spatial scales. However, current models di...

    Keyu Pan, Yishi Zhao, Tao Wang, Shihong Yao in Signal, Image and Video Processing (2023)

  17. No Access

    Article

    Learning discriminative features for person re-identification via multi-spectral channel attention

    Person re-identification (Re-ID) aims to match a particular person captured by different cameras, which has great potential in video surveillance. However, Re-ID is still challenging due to occlusions, misalig...

    Qianyue Duan, Zhenwu Hu, Minghao Lu, Huanjie Tao in Signal, Image and Video Processing (2023)

  18. No Access

    Article

    Severe motion blurred silkworm pupae image restoration in sex discrimination

    The task of sex determination of silkworm pupae is usually accomplished using machine learning technology. However, the captured image included much blur because of live silkworm pupae’s writhing, which makes ...

    Guangying Qiu, Qingying Li, Dan Tao, Housheng Su in Signal, Image and Video Processing (2023)

  19. No Access

    Article

    Multi-feature aggregation network for salient object detection

    As the network deepens, the boundary information and small object will be lost, which result in blurred edges and incomplete salient detection. In this paper, we propose a multi-feature aggregated network to s...

    Hu Huang, ** Liu, Yanzhao Wang, Tongchi Zhou in Signal, Image and Video Processing (2023)

  20. No Access

    Article

    Fake license plate recognition in surveillance videos

    Fake license plate (FLP) recognition aims to identify modified, defaced or forged license plates in traffic videos and images. The recognition result, that is, the vehicle with an FLP, is important information...

    Wei Pan, **n Zhou, Tao Zhou, Yuanyuan Chen in Signal, Image and Video Processing (2023)

previous disabled Page of 5