Search Results - Springer

Article

Towards Task Sampler Learning for Meta-Learning

Meta-learning aims to learn general knowledge with diverse training tasks conducted from limited data, and then transfer it to new tasks. It is commonly believed that increasing task diversity will enhance the...

**gyao Wang, Wenwen Qiang, **ngzhe Su… in International Journal of Computer Vision (2024)

Article

Beyond Learned Metadata-Based Raw Image Reconstruction

While raw images possess distinct advantages over sRGB images, e.g., linearity and fine-grained quantization levels, they are not widely adopted by general users due to their substantial storage requirements. ...

Yufei Wang, Yi Yu, Wenhan Yang, Lanqing Guo… in International Journal of Computer Vision (2024)

Article

GST-YOLO: a lightweight visual detection algorithm for underwater garbage detection

Underwater cleaning work primarily relies on human labor, but applying computer vision technology to Autonomous Underwater Vehicles can enhance cleaning efficiency. Considering that existing vision detection a...

Longyi Jiang, Fanghua Liu, Junwei Lv, Binghua Liu… in Journal of Real-Time Image Processing (2024)

Article

Efficiently adapting large pre-trained models for real-time violence recognition in smart city surveillance

Recently, the concept of smart cities has gained prominence, aiming to enhance urban efficiency, safety, and quality of life through advanced technologies. A critical component of this infrastructure is the ex...

**aohui Ren, Wenze Fan, Yinghao Wang in Journal of Real-Time Image Processing (2024)

Article

Towards Generalized UAV Object Detection: A Novel Perspective from Frequency Domain Disentanglement

When deploying unmanned aerial vehicle (UAV) object detection networks to complex, real-world scenes, generalization ability is often reduced due to domain shift. While most existing domain-generalized object ...

Kunyu Wang, Xueyang Fu, Chengjie Ge… in International Journal of Computer Vision (2024)

Article

OV-DAR: Open-Vocabulary Object Detection and Attributes Recognition

In this paper, we endeavor to localize all potential objects in an image and infer their visual categories, attributes, and shapes, even in instances where certain objects have not been encompassed in the mode...

Keyan Chen, **aolong Jiang, Haochen Wang… in International Journal of Computer Vision (2024)

Article

Diff-Font: Diffusion Model for Robust One-Shot Font Generation

Font generation presents a significant challenge due to the intricate details needed, especially for languages with complex ideograms and numerous characters, such as Chinese and Korean. Although various few-s...

Haibin He, **nyuan Chen, Chaoyue Wang… in International Journal of Computer Vision (2024)

Article

YOLOv8n-LSLW: a lightweight method for real-time detection of wild fishing behavior

With the development of the electric power industry, the laying of transmission lines covers various waters, which poses a great threat to the life safety of fishermen who intrude into high-voltage areas. To a...

Pengcheng Yan, Wenchang Wang, Guodong Li… in Journal of Real-Time Image Processing (2024)

Article

Advancing Weakly-Supervised Audio-Visual Video Parsing via Segment-Wise Pseudo Labeling

The Audio-Visual Video Parsing task aims to identify and temporally localize the events that occur in either or both the audio and visual streams of audible videos. It often performs in a weakly-supervised man...

**xing Zhou, Dan Guo, Yiran Zhong, Meng Wang in International Journal of Computer Vision (2024)

Article

Towards Robust Semantic Segmentation against Patch-Based Attack via Attention Refinement

The attention mechanism has been proven effective on various visual tasks in recent years. In the semantic segmentation task, the attention mechanism is applied in various methods, including the case of both c...

Zheng Yuan, Jie Zhang, Yude Wang, Shiguang Shan… in International Journal of Computer Vision (2024)

Article

Chfnet: a coarse-to-fine hierarchical refinement model for monocular depth estimation

In recent years, many researchers have exploited multiple depth estimation architectures to produce high-quality depth maps from a single image. For monocular depth estimation, abundant multiscale features can...

Han Chen, Yongxiong Wang in Machine Vision and Applications (2024)

Article

ViDSOD-100: A New Dataset and a Baseline Model for RGB-D Video Salient Object Detection

With the rapid development of depth sensor, more and more RGB-D videos could be obtained. Identifying the foreground in RGB-D videos is a fundamental and important task. However, the existing salient object de...

Junhao Lin, Lei Zhu, Jiaxing Shen, Huazhu Fu… in International Journal of Computer Vision (2024)

Article

Open-Set Single-Domain Generalization for Robust Face Anti-Spoofing

Face anti-spoofing is a critical component of face recognition technology. However, it suffers from poor generalizability for cross-scenario target domains due to the simultaneous presence of unseen domains an...

Fangling Jiang, Qi Li, Weining Wang, Min Ren… in International Journal of Computer Vision (2024)

Article

Learning Generalizable Mixed-Precision Quantization via Attribution Imitation

In this paper, we propose a generalizable mixed-precision quantization (GMPQ) method for efficient inference. Conventional methods require the consistency of datasets for bitwidth search and model deployment t...

Ziwei Wang, Han **ao, Jie Zhou, Jiwen Lu in International Journal of Computer Vision (2024)

Article

Generate Transferable Adversarial Physical Camouflages via Triplet Attention Suppression

Deep learning models are vulnerable to adversarial examples. As one of the most threatening types for practical deep learning systems, physical adversarial examples have received extensive attention in recent ...

Jiakai Wang, **anglong Liu, Zixin Yin… in International Journal of Computer Vision (2024)

Article

Monitoring of Egg Growing in Video by the Improved DeepLabv3+ Network Model

The paper proposes the noninvasive image egg growing monitoring method based on an illumination and transfer learning. During the egg growing, the size of egg air cell is increased. The segmentation is perform...

Fengyang Gu, Hui Zhu, Haiyang Wang, Yanbo Zhang… in Pattern Recognition and Image Analysis (2024)

Article

CLIP-guided Prototype Modulating for Few-shot Action Recognition

Learning from large-scale contrastive language-image pre-training like CLIP has shown remarkable success in a wide range of downstream tasks recently, but it is still under-explored on the challenging few-shot...

**ang Wang, Shiwei Zhang, Jun Cen, Changxin Gao… in International Journal of Computer Vision (2024)

Article

Logit Normalization for Long-Tail Object Detection

Real-world data with skewed distributions poses a serious challenge to existing object detectors. The unbalanced label distribution leads to a bias towards dominate labels, resulting in the worse detection per...

Liang Zhao, Yao Teng, Limin Wang in International Journal of Computer Vision (2024)

Article

Delving into Identify-Emphasize Paradigm for Combating Unknown Bias

Dataset biases are notoriously detrimental to model robustness and generalization. The identify-emphasize paradigm appears to be effective in dealing with unknown biases. However, we discover that it is still ...

Bowen Zhao, Chen Chen, Qian-Wei Wang, Anfeng He… in International Journal of Computer Vision (2024)

Article

OV-VIS: Open-Vocabulary Video Instance Segmentation

Conventionally, the goal of Video Instance Segmentation (VIS) is to segment and categorize objects in videos from a closed set of training categories, lacking the generalization ability to handle novel categor...

Haochen Wang, Cilin Yan, Keyan Chen… in International Journal of Computer Vision (2024)

5,281 Result(s)

Towards Task Sampler Learning for Meta-Learning

Beyond Learned Metadata-Based Raw Image Reconstruction

GST-YOLO: a lightweight visual detection algorithm for underwater garbage detection

Efficiently adapting large pre-trained models for real-time violence recognition in smart city surveillance

Towards Generalized UAV Object Detection: A Novel Perspective from Frequency Domain Disentanglement

OV-DAR: Open-Vocabulary Object Detection and Attributes Recognition

Diff-Font: Diffusion Model for Robust One-Shot Font Generation

YOLOv8n-LSLW: a lightweight method for real-time detection of wild fishing behavior

Advancing Weakly-Supervised Audio-Visual Video Parsing via Segment-Wise Pseudo Labeling

Towards Robust Semantic Segmentation against Patch-Based Attack via Attention Refinement

Chfnet: a coarse-to-fine hierarchical refinement model for monocular depth estimation

ViDSOD-100: A New Dataset and a Baseline Model for RGB-D Video Salient Object Detection

Open-Set Single-Domain Generalization for Robust Face Anti-Spoofing

Learning Generalizable Mixed-Precision Quantization via Attribution Imitation

Generate Transferable Adversarial Physical Camouflages via Triplet Attention Suppression

Monitoring of Egg Growing in Video by the Improved DeepLabv3+ Network Model

CLIP-guided Prototype Modulating for Few-shot Action Recognition

Logit Normalization for Long-Tail Object Detection

Delving into Identify-Emphasize Paradigm for Combating Unknown Bias

OV-VIS: Open-Vocabulary Video Instance Segmentation

Our Content

Other Sites

Help & Contacts