Search Results - Springer

Article

A new virtual interpolation technology with range as object

Virtual interpolation technology can be applied to direction-of-arrival (DOA) estimation as a preprocessing technique to achieve the DOA estimation for any array. In order to solve the angle-sensitive problem ...

Tao Li, Yunxiu Yang, Wendong Chen, Qin Shu in Signal, Image and Video Processing (2024)

Article

DML-YOLOv8-SAR image object detection algorithm

Given the challenges posed by noise and varying target scales in SAR images, conventional convolutional neural networks often underperform in SAR image detection. To address this, this paper introduces a novel...

Shuguang Zhao, Ronghao Tao, Fengde Jia in Signal, Image and Video Processing (2024)

Article

Using improved YOLO V5s to recognize tomatoes in a continuous working environment

In the continuous working environment of the picking robots, factors such as illumination change, camera hardware, the movement of the picking robots, and image background interference have a great impact on t...

Guohua Gao, Ciyin Shuai, Shuangyou Wang, Tao Ding in Signal, Image and Video Processing (2024)

Article

Shuff-BiseNet: a dual-branch segmentation network for pavement cracks

In order to accurately obtain the shape and size of pavement cracks, analyze the severity of pavement cracks, avoid deterioration of the situation, and take timely measures, we proposed a dual-branch structure...

Haiqun Wang, Bingnan Wang, Tao Zhao in Signal, Image and Video Processing (2024)

Article

Research on image caption generation method based on multi-modal pre-training model and text mixup optimization

In recent years, multi-modal pre-training models have demonstrated remarkable cross-modal representation capabilities, catalyzing the rapid evolution of multi-modal downstream tasks, particularly in image capt...

**g-Tao Sun, Xuan Min in Signal, Image and Video Processing (2024)

Article

YOLO-MTG: a lightweight YOLO model for multi-target garbage detection

With wide adoption of deep learning technology in AI, intelligent garbage detection has become a hot research topic. However, existing datasets currently used for garbage detection rarely involves multi-catego...

Zhongyi **a, Houkui Zhou, Huimin Yu, Haoji Hu… in Signal, Image and Video Processing (2024)

Article

An effective masked transformer network for image denoising

The rising popularity of employing deep learning networks for image denoising can be observed over the past decade. Typically, their exceptional performance is rooted in their ability to learn the map** from...

Shao** Xu, Nan **ao, Wuyong Tao, Changfei Zhou… in Signal, Image and Video Processing (2024)

Article

Human risky behaviour recognition during ladder climbing based on multi-modal feature fusion and adaptive graph convolutional network

Human falls during ladder climbing are typically instantaneous, making the timely and accurate determination of security risks during ladder climbing a challenging engineering issue. A skeleton-based behaviour...

Wenrui Zhu, Donghui Shi, Rui Cheng, Ruifeng Huang… in Signal, Image and Video Processing (2024)

Article

Boosting image denoising effect via low-level noise injection

In the past decade, supervised denoising models trained on large datasets have demonstrated impressive performance in image denoising due to their superior denoising effect. However, these models lack flexibil...

Jian **ao, **aohui Cheng, Shao** Xu, Wuyong Tao… in Signal, Image and Video Processing (2024)

Article

Particle recognition and shape parameter detection based on deep learning

The size and shape parameters of sand particles are closely related to their geophysical and geomechanical properties. It is challenging to accurately identify sand particles and calculate their shape paramete...

Xuan Li, Zhou Yang, **nyu Tao, **aojie Wang… in Signal, Image and Video Processing (2024)

Chapter and Conference Paper

MetaVSR: A Novel Approach to Video Super-Resolution for Arbitrary Magnification

Video super-resolution is a pivotal task that involves the recovery of high-resolution video frames from their low-resolution counterparts, possessing a multitude of applications in real-world scenarios. Withi...

Zixuan Hong, Weipeng Cao, Zhiwu Xu, Zhenru Chen, ** Tao, Zhong Ming… in MultiMedia Modeling (2024)

Chapter and Conference Paper

A Coarse and Fine Grained Masking Approach for Video-Grounded Dialogue

The task of Video-Grounded Dialogue involves develo** a multimodal chatbot capable of answering sequential questions from humans regarding video content, audio, captions and dialog history. Although existing...

Feifei Xu, Wang Zhou, Tao Sun, Jiahao Lu, Ziheng Yu, Guangzhen Li in MultiMedia Modeling (2024)

Chapter and Conference Paper

High Capacity Reversible Data Hiding in Encrypted Images Based on Pixel Value Preprocessing and Block Classification

Reversible data hiding in encrypted images (RDHEI) can simultaneously achieve secure transmission of images and secret storage of embedded additional data, which can be used for cloud storage and privacy prote...

Tao Zhang, Ju Zhang, Yicheng Zou, Yu Zhang in MultiMedia Modeling (2024)

Article

An attention-erasing stripe pyramid network for face forgery detection

Face forgery detection aims to distinguish between real and fake facial images or videos by identifying manipulated or forged visual media. The main challenge in face forgery detection is achieving high model ...

Zhenwu Hu, Qianyue Duan, PeiYu Zhang, Huanjie Tao in Signal, Image and Video Processing (2023)

Article

Polarization image fusion based on grouped densely connected network

The degree of linear polarization has detailed features such as contour, texture, and roughness of the object, while the intensity image (S0) receives reflected and transmitted light, which contains rich backg...

**n Chen, Shenglai Zhen, Tao Lv, Benli Yu in Signal, Image and Video Processing (2023)

Article

MSNet: a lightweight multi-scale deep learning network for pedestrian re-identification

Pedestrian re-identification is highly dependent on discriminative features that enable images to encapsulate an arbitrary combination of multiple scales by different spatial scales. However, current models di...

Keyu Pan, Yishi Zhao, Tao Wang, Shihong Yao in Signal, Image and Video Processing (2023)

Article

Learning discriminative features for person re-identification via multi-spectral channel attention

Person re-identification (Re-ID) aims to match a particular person captured by different cameras, which has great potential in video surveillance. However, Re-ID is still challenging due to occlusions, misalig...

Qianyue Duan, Zhenwu Hu, Minghao Lu, Huanjie Tao in Signal, Image and Video Processing (2023)

Article

Severe motion blurred silkworm pupae image restoration in sex discrimination

The task of sex determination of silkworm pupae is usually accomplished using machine learning technology. However, the captured image included much blur because of live silkworm pupae’s writhing, which makes ...

Guangying Qiu, Qingying Li, Dan Tao, Housheng Su… in Signal, Image and Video Processing (2023)

Article

Multi-feature aggregation network for salient object detection

As the network deepens, the boundary information and small object will be lost, which result in blurred edges and incomplete salient detection. In this paper, we propose a multi-feature aggregated network to s...

Hu Huang, ** Liu, Yanzhao Wang, Tongchi Zhou… in Signal, Image and Video Processing (2023)

Article

Fake license plate recognition in surveillance videos

Fake license plate (FLP) recognition aims to identify modified, defaced or forged license plates in traffic videos and images. The recognition result, that is, the vehicle with an FLP, is important information...

Wei Pan, **n Zhou, Tao Zhou, Yuanyuan Chen in Signal, Image and Video Processing (2023)

93 Result(s)

A new virtual interpolation technology with range as object

DML-YOLOv8-SAR image object detection algorithm

Using improved YOLO V5s to recognize tomatoes in a continuous working environment

Shuff-BiseNet: a dual-branch segmentation network for pavement cracks

Research on image caption generation method based on multi-modal pre-training model and text mixup optimization

YOLO-MTG: a lightweight YOLO model for multi-target garbage detection

An effective masked transformer network for image denoising

Human risky behaviour recognition during ladder climbing based on multi-modal feature fusion and adaptive graph convolutional network

Boosting image denoising effect via low-level noise injection

Particle recognition and shape parameter detection based on deep learning

MetaVSR: A Novel Approach to Video Super-Resolution for Arbitrary Magnification

A Coarse and Fine Grained Masking Approach for Video-Grounded Dialogue

High Capacity Reversible Data Hiding in Encrypted Images Based on Pixel Value Preprocessing and Block Classification

An attention-erasing stripe pyramid network for face forgery detection

Polarization image fusion based on grouped densely connected network

MSNet: a lightweight multi-scale deep learning network for pedestrian re-identification

Learning discriminative features for person re-identification via multi-spectral channel attention

Severe motion blurred silkworm pupae image restoration in sex discrimination

Multi-feature aggregation network for salient object detection

Fake license plate recognition in surveillance videos

Our Content

Other Sites

Help & Contacts