![Loading...](https://link.springer.com/static/c4a417b97a76cc2980e3c25e2271af3129e08bbe/images/pdf-preview/spacer.gif)
5,281 Result(s)
-
Article
Towards Task Sampler Learning for Meta-Learning
Meta-learning aims to learn general knowledge with diverse training tasks conducted from limited data, and then transfer it to new tasks. It is commonly believed that increasing task diversity will enhance the...
-
Article
Beyond Learned Metadata-Based Raw Image Reconstruction
While raw images possess distinct advantages over sRGB images, e.g., linearity and fine-grained quantization levels, they are not widely adopted by general users due to their substantial storage requirements. ...
-
Article
GST-YOLO: a lightweight visual detection algorithm for underwater garbage detection
Underwater cleaning work primarily relies on human labor, but applying computer vision technology to Autonomous Underwater Vehicles can enhance cleaning efficiency. Considering that existing vision detection a...
-
Article
Efficiently adapting large pre-trained models for real-time violence recognition in smart city surveillance
Recently, the concept of smart cities has gained prominence, aiming to enhance urban efficiency, safety, and quality of life through advanced technologies. A critical component of this infrastructure is the ex...
-
Article
Towards Generalized UAV Object Detection: A Novel Perspective from Frequency Domain Disentanglement
When deploying unmanned aerial vehicle (UAV) object detection networks to complex, real-world scenes, generalization ability is often reduced due to domain shift. While most existing domain-generalized object ...
-
Article
OV-DAR: Open-Vocabulary Object Detection and Attributes Recognition
In this paper, we endeavor to localize all potential objects in an image and infer their visual categories, attributes, and shapes, even in instances where certain objects have not been encompassed in the mode...
-
Article
Diff-Font: Diffusion Model for Robust One-Shot Font Generation
Font generation presents a significant challenge due to the intricate details needed, especially for languages with complex ideograms and numerous characters, such as Chinese and Korean. Although various few-s...
-
Article
YOLOv8n-LSLW: a lightweight method for real-time detection of wild fishing behavior
With the development of the electric power industry, the laying of transmission lines covers various waters, which poses a great threat to the life safety of fishermen who intrude into high-voltage areas. To a...
-
Article
Advancing Weakly-Supervised Audio-Visual Video Parsing via Segment-Wise Pseudo Labeling
The Audio-Visual Video Parsing task aims to identify and temporally localize the events that occur in either or both the audio and visual streams of audible videos. It often performs in a weakly-supervised man...
-
Article
Towards Robust Semantic Segmentation against Patch-Based Attack via Attention Refinement
The attention mechanism has been proven effective on various visual tasks in recent years. In the semantic segmentation task, the attention mechanism is applied in various methods, including the case of both c...
-
Article
Chfnet: a coarse-to-fine hierarchical refinement model for monocular depth estimation
In recent years, many researchers have exploited multiple depth estimation architectures to produce high-quality depth maps from a single image. For monocular depth estimation, abundant multiscale features can...
-
Article
ViDSOD-100: A New Dataset and a Baseline Model for RGB-D Video Salient Object Detection
With the rapid development of depth sensor, more and more RGB-D videos could be obtained. Identifying the foreground in RGB-D videos is a fundamental and important task. However, the existing salient object de...
-
Article
Open-Set Single-Domain Generalization for Robust Face Anti-Spoofing
Face anti-spoofing is a critical component of face recognition technology. However, it suffers from poor generalizability for cross-scenario target domains due to the simultaneous presence of unseen domains an...
-
Article
Learning Generalizable Mixed-Precision Quantization via Attribution Imitation
In this paper, we propose a generalizable mixed-precision quantization (GMPQ) method for efficient inference. Conventional methods require the consistency of datasets for bitwidth search and model deployment t...
-
Article
Generate Transferable Adversarial Physical Camouflages via Triplet Attention Suppression
Deep learning models are vulnerable to adversarial examples. As one of the most threatening types for practical deep learning systems, physical adversarial examples have received extensive attention in recent ...
-
Article
Monitoring of Egg Growing in Video by the Improved DeepLabv3+ Network Model
The paper proposes the noninvasive image egg growing monitoring method based on an illumination and transfer learning. During the egg growing, the size of egg air cell is increased. The segmentation is perform...
-
Article
CLIP-guided Prototype Modulating for Few-shot Action Recognition
Learning from large-scale contrastive language-image pre-training like CLIP has shown remarkable success in a wide range of downstream tasks recently, but it is still under-explored on the challenging few-shot...
-
Article
Logit Normalization for Long-Tail Object Detection
Real-world data with skewed distributions poses a serious challenge to existing object detectors. The unbalanced label distribution leads to a bias towards dominate labels, resulting in the worse detection per...
-
Article
Delving into Identify-Emphasize Paradigm for Combating Unknown Bias
Dataset biases are notoriously detrimental to model robustness and generalization. The identify-emphasize paradigm appears to be effective in dealing with unknown biases. However, we discover that it is still ...
-
Article
OV-VIS: Open-Vocabulary Video Instance Segmentation
Conventionally, the goal of Video Instance Segmentation (VIS) is to segment and categorize objects in videos from a closed set of training categories, lacking the generalization ability to handle novel categor...