Skip to main content

Page of 265
and
  1. No Access

    Article

    Towards Task Sampler Learning for Meta-Learning

    Meta-learning aims to learn general knowledge with diverse training tasks conducted from limited data, and then transfer it to new tasks. It is commonly believed that increasing task diversity will enhance the...

    **gyao Wang, Wenwen Qiang, **ngzhe Su in International Journal of Computer Vision (2024)

  2. No Access

    Article

    Beyond Learned Metadata-Based Raw Image Reconstruction

    While raw images possess distinct advantages over sRGB images, e.g., linearity and fine-grained quantization levels, they are not widely adopted by general users due to their substantial storage requirements. ...

    Yufei Wang, Yi Yu, Wenhan Yang, Lanqing Guo in International Journal of Computer Vision (2024)

  3. No Access

    Article

    GST-YOLO: a lightweight visual detection algorithm for underwater garbage detection

    Underwater cleaning work primarily relies on human labor, but applying computer vision technology to Autonomous Underwater Vehicles can enhance cleaning efficiency. Considering that existing vision detection a...

    Longyi Jiang, Fanghua Liu, Junwei Lv, Binghua Liu in Journal of Real-Time Image Processing (2024)

  4. No Access

    Article

    Efficiently adapting large pre-trained models for real-time violence recognition in smart city surveillance

    Recently, the concept of smart cities has gained prominence, aiming to enhance urban efficiency, safety, and quality of life through advanced technologies. A critical component of this infrastructure is the ex...

    **aohui Ren, Wenze Fan, Yinghao Wang in Journal of Real-Time Image Processing (2024)

  5. No Access

    Article

    Towards Generalized UAV Object Detection: A Novel Perspective from Frequency Domain Disentanglement

    When deploying unmanned aerial vehicle (UAV) object detection networks to complex, real-world scenes, generalization ability is often reduced due to domain shift. While most existing domain-generalized object ...

    Kunyu Wang, Xueyang Fu, Chengjie Ge in International Journal of Computer Vision (2024)

  6. No Access

    Article

    OV-DAR: Open-Vocabulary Object Detection and Attributes Recognition

    In this paper, we endeavor to localize all potential objects in an image and infer their visual categories, attributes, and shapes, even in instances where certain objects have not been encompassed in the mode...

    Keyan Chen, **aolong Jiang, Haochen Wang in International Journal of Computer Vision (2024)

  7. No Access

    Article

    Diff-Font: Diffusion Model for Robust One-Shot Font Generation

    Font generation presents a significant challenge due to the intricate details needed, especially for languages with complex ideograms and numerous characters, such as Chinese and Korean. Although various few-s...

    Haibin He, **nyuan Chen, Chaoyue Wang in International Journal of Computer Vision (2024)

  8. No Access

    Article

    YOLOv8n-LSLW: a lightweight method for real-time detection of wild fishing behavior

    With the development of the electric power industry, the laying of transmission lines covers various waters, which poses a great threat to the life safety of fishermen who intrude into high-voltage areas. To a...

    Pengcheng Yan, Wenchang Wang, Guodong Li in Journal of Real-Time Image Processing (2024)

  9. No Access

    Article

    Advancing Weakly-Supervised Audio-Visual Video Parsing via Segment-Wise Pseudo Labeling

    The Audio-Visual Video Parsing task aims to identify and temporally localize the events that occur in either or both the audio and visual streams of audible videos. It often performs in a weakly-supervised man...

    **xing Zhou, Dan Guo, Yiran Zhong, Meng Wang in International Journal of Computer Vision (2024)

  10. No Access

    Article

    Towards Robust Semantic Segmentation against Patch-Based Attack via Attention Refinement

    The attention mechanism has been proven effective on various visual tasks in recent years. In the semantic segmentation task, the attention mechanism is applied in various methods, including the case of both c...

    Zheng Yuan, Jie Zhang, Yude Wang, Shiguang Shan in International Journal of Computer Vision (2024)

  11. No Access

    Article

    Chfnet: a coarse-to-fine hierarchical refinement model for monocular depth estimation

    In recent years, many researchers have exploited multiple depth estimation architectures to produce high-quality depth maps from a single image. For monocular depth estimation, abundant multiscale features can...

    Han Chen, Yongxiong Wang in Machine Vision and Applications (2024)

  12. No Access

    Article

    ViDSOD-100: A New Dataset and a Baseline Model for RGB-D Video Salient Object Detection

    With the rapid development of depth sensor, more and more RGB-D videos could be obtained. Identifying the foreground in RGB-D videos is a fundamental and important task. However, the existing salient object de...

    Junhao Lin, Lei Zhu, Jiaxing Shen, Huazhu Fu in International Journal of Computer Vision (2024)

  13. No Access

    Article

    Open-Set Single-Domain Generalization for Robust Face Anti-Spoofing

    Face anti-spoofing is a critical component of face recognition technology. However, it suffers from poor generalizability for cross-scenario target domains due to the simultaneous presence of unseen domains an...

    Fangling Jiang, Qi Li, Weining Wang, Min Ren in International Journal of Computer Vision (2024)

  14. No Access

    Article

    Learning Generalizable Mixed-Precision Quantization via Attribution Imitation

    In this paper, we propose a generalizable mixed-precision quantization (GMPQ) method for efficient inference. Conventional methods require the consistency of datasets for bitwidth search and model deployment t...

    Ziwei Wang, Han **ao, Jie Zhou, Jiwen Lu in International Journal of Computer Vision (2024)

  15. No Access

    Article

    Generate Transferable Adversarial Physical Camouflages via Triplet Attention Suppression

    Deep learning models are vulnerable to adversarial examples. As one of the most threatening types for practical deep learning systems, physical adversarial examples have received extensive attention in recent ...

    Jiakai Wang, **anglong Liu, Zixin Yin in International Journal of Computer Vision (2024)

  16. No Access

    Article

    Monitoring of Egg Growing in Video by the Improved DeepLabv3+ Network Model

    The paper proposes the noninvasive image egg growing monitoring method based on an illumination and transfer learning. During the egg growing, the size of egg air cell is increased. The segmentation is perform...

    Fengyang Gu, Hui Zhu, Haiyang Wang, Yanbo Zhang in Pattern Recognition and Image Analysis (2024)

  17. No Access

    Article

    CLIP-guided Prototype Modulating for Few-shot Action Recognition

    Learning from large-scale contrastive language-image pre-training like CLIP has shown remarkable success in a wide range of downstream tasks recently, but it is still under-explored on the challenging few-shot...

    **ang Wang, Shiwei Zhang, Jun Cen, Changxin Gao in International Journal of Computer Vision (2024)

  18. No Access

    Article

    Logit Normalization for Long-Tail Object Detection

    Real-world data with skewed distributions poses a serious challenge to existing object detectors. The unbalanced label distribution leads to a bias towards dominate labels, resulting in the worse detection per...

    Liang Zhao, Yao Teng, Limin Wang in International Journal of Computer Vision (2024)

  19. No Access

    Article

    Delving into Identify-Emphasize Paradigm for Combating Unknown Bias

    Dataset biases are notoriously detrimental to model robustness and generalization. The identify-emphasize paradigm appears to be effective in dealing with unknown biases. However, we discover that it is still ...

    Bowen Zhao, Chen Chen, Qian-Wei Wang, Anfeng He in International Journal of Computer Vision (2024)

  20. No Access

    Article

    OV-VIS: Open-Vocabulary Video Instance Segmentation

    Conventionally, the goal of Video Instance Segmentation (VIS) is to segment and categorize objects in videos from a closed set of training categories, lacking the generalization ability to handle novel categor...

    Haochen Wang, Cilin Yan, Keyan Chen in International Journal of Computer Vision (2024)

Page of 265