Search Results - Springer

Sort By Newest First Oldest First

Article

CSDG-FAS: Closed-Space Domain Generalization for Face Anti-spoofing

Domain generalization based Face Anti-spoofing (FAS) aims to enhance its ability to work in unseen domains. Existing methods endeavor to extract a discriminative common space through the alignment of distribut...

Keyao Wang, Guosheng Zhang, Haixiao Yue… in International Journal of Computer Vision (2024)
Article

Learning Hierarchical Visual Transformation for Domain Generalizable Visual Matching and Recognition

Modern deep neural networks are prone to learn domain-dependent shortcuts and thus usually suffer from severe performance degradation when tested in unseen target domains due to their poor ability of out-of-di...

Xun Yang, Tianyu Chang, Tianzhu Zhang… in International Journal of Computer Vision (2024)
Article

VLG: General Video Recognition with Web Textual Knowledge

Video recognition (action recognition) in an open world is quite challenging, as we need to handle different settings such as closed-set, long-tail, few-shot, and open-set. The majority of existing works often...

**tao Lin, Zhaoyang Liu, Wenhai Wang, Wayne Wu… in International Journal of Computer Vision (2024)
Article

EEA-Net: edge-enhanced assistance network for infrared small target detection

With the development of deep learning, the performance of infrared small target detection (IRSTD) has been significantly improved. A precise shape of the target edge is crucial for segmenting small infrared ta...

Chen Wang, **aopeng Hu, **ang Gao, Haoyu Wei, Jiawei Tao… in Machine Vision and Applications (2024)
Article

Open Access

Temporally Consistent Enhancement of Low-Light Videos via Spatial-Temporal Compatible Learning

Temporal inconsistency is the annoying artifact that has been commonly introduced in low-light video enhancement, but current methods tend to overlook the significance of utilizing both data-centric clues and ...

Lingyu Zhu, Wenhan Yang, Baoliang Chen… in International Journal of Computer Vision (2024)

Download PDF (4231 KB) View Article
Article

WATCHER: Wavelet-Guided Texture-Content Hierarchical Relation Learning for Deepfake Detection

Breathtaking advances in face forgery techniques produce visually untraceable deepfake videos, thus potential malicious abuse of these techniques has sparked great concerns. Existing deepfake detectors primari...

Yuan Wang, Chen Chen, Ning Zhang, **yuan Hu in International Journal of Computer Vision (2024)
Article

ManiCLIP: Multi-attribute Face Manipulation from Text

In this paper we present a novel multi-attribute face manipulation method based on textual descriptions. Previous text-based image editing methods either require test-time optimization for each individual imag...

Hao Wang, Guosheng Lin, Ana García del Molino… in International Journal of Computer Vision (2024)
Article

GridFormer: Residual Dense Transformer with Grid Structure for Image Restoration in Adverse Weather Conditions

Image restoration in adverse weather conditions is a difficult task in computer vision. In this paper, we propose a novel transformer-based framework called GridFormer which serves as a backbone for image rest...

Tao Wang, Kaihao Zhang, Ziqian Shao, Wenhan Luo… in International Journal of Computer Vision (2024)
Article

Fast detection of face masks in public places using QARepVGG-YOLOv7

The COVID-19 pandemic has resulted in substantial global losses. In the post-epidemic era, public health needs still advocate the correct use of medical masks in confined spaces such as hospitals and indoors. ...

Chuying Guan, Jiaxuan Jiang, Zhong Wang in Journal of Real-Time Image Processing (2024)
Article

Hybrid CNN-Transformer Architecture for Efficient Large-Scale Video Snapshot Compressive Imaging

Video snapshot compressive imaging (SCI) uses a low-speed 2D detector to capture high-speed scene, where the dynamic scene is modulated by different masks and then compressed into a snapshot measurement. Follo...

Miao Cao, Lishun Wang, Mingyu Zhu, **n Yuan in International Journal of Computer Vision (2024)
Article

An Adaptive Correlation Filtering Method for Text-Based Person Search

Text-based person search aims to align person images with natural language descriptions, which can be widely used in video surveillance field, such as missing person searching and suspect tracking. In this tas...

Mengyang Sun, Wei Suo, Peng Wang, Kai Niu… in International Journal of Computer Vision (2024)
Article

Benchmarking Object Detection Robustness against Real-World Corruptions

With the rapid recent development, deep learning based object detection techniques have been applied to various real-world software systems, especially in safety-critical applications like autonomous driving. ...

Jiawei Liu, Zhijie Wang, Lei Ma, Chunrong Fang… in International Journal of Computer Vision (2024)
Article

PRCL: Probabilistic Representation Contrastive Learning for Semi-Supervised Semantic Segmentation

Tremendous breakthroughs have been developed in Semi-Supervised Semantic Segmentation (S4) through contrastive learning. However, due to limited annotations, the guidance on unlabeled images is generated by th...

Haoyu **e, Changqi Wang, Jian Zhao, Yang Liu… in International Journal of Computer Vision (2024)
Article

Thermal infrared action recognition with two-stream shift Graph Convolutional Network

The extensive deployment of camera-based IoT devices in our society is heightening the vulnerability of citizens’ sensitive information and individual data privacy. In this context, thermal imaging techniques ...

Jishi Liu, Huanyu Wang, Junnian Wang, Dalin He… in Machine Vision and Applications (2024)
Article

Adaptive Discriminative Regularization for Visual Classification

How to improve discriminative feature learning is central in classification. Existing works address this problem by explicitly increasing inter-class separability and intra-class compactness by constructing po...

Qingsong Zhao, Yi Wang, Shuguang Dou, Chen Gong… in International Journal of Computer Vision (2024)
Article

Hardware architecture optimization for high-frequency zeroing and LFNST in H.266/VVC based on FPGA

To reduce the hardware implementation resource consumption of the two-dimensional transform component in H.266 VVC, a unified hardware structure is proposed that supports full-size Discrete Cosine Transform (D...

Junxiang Zhang, Qinghua Sheng, Rui Pan… in Journal of Real-Time Image Processing (2024)
Article

An Empirical Study on Multi-domain Robust Semantic Segmentation

How to effectively leverage the plentiful existing datasets to train a robust and high-performance model is of great significance for many practical applications. However, a model trained on a naive merge of d...

Yajie Liu, Pu Ge, Qingjie Liu, Shichao Fan… in International Journal of Computer Vision (2024)
Article

High-speed hardware accelerator based on brightness improved by Light-DehazeNet

Due to the increasing demand for artificial intelligence technology in today’s society, the entire industrial production system is undergoing a transformative process related to automation, reliability, and ro...

Peiyi Teng, Gaoming Du, Zhenmin Li, **aolei Wang… in Journal of Real-Time Image Processing (2024)
Article

Yolo-global: a real-time target detector for mineral particles

Recently, deep learning methodologies have achieved significant advancements in mineral automatic sorting and anomaly detection. However, the limited features of minerals transported in the form of small parti...

Zihao Wang, Dong Zhou, Chengjun Guo, Ruihao Zhou in Journal of Real-Time Image Processing (2024)
Article

Open Access

Meet JEANIE: A Similarity Measure for 3D Skeleton Sequences via Temporal-Viewpoint Alignment

Video sequences exhibit significant nuisance variations (undesired effects) of speed of actions, temporal locations, and subjects’ poses, leading to temporal-viewpoint misalignment when comparing two sets of f...

Lei Wang, Jun Liu, Liang Zheng, Tom Gedeon… in International Journal of Computer Vision (2024)

Download PDF (3785 KB) View Article

5,281 Result(s)

CSDG-FAS: Closed-Space Domain Generalization for Face Anti-spoofing

Learning Hierarchical Visual Transformation for Domain Generalizable Visual Matching and Recognition

VLG: General Video Recognition with Web Textual Knowledge

EEA-Net: edge-enhanced assistance network for infrared small target detection

Temporally Consistent Enhancement of Low-Light Videos via Spatial-Temporal Compatible Learning

WATCHER: Wavelet-Guided Texture-Content Hierarchical Relation Learning for Deepfake Detection

ManiCLIP: Multi-attribute Face Manipulation from Text

GridFormer: Residual Dense Transformer with Grid Structure for Image Restoration in Adverse Weather Conditions

Fast detection of face masks in public places using QARepVGG-YOLOv7

Hybrid CNN-Transformer Architecture for Efficient Large-Scale Video Snapshot Compressive Imaging

An Adaptive Correlation Filtering Method for Text-Based Person Search

Benchmarking Object Detection Robustness against Real-World Corruptions

PRCL: Probabilistic Representation Contrastive Learning for Semi-Supervised Semantic Segmentation

Thermal infrared action recognition with two-stream shift Graph Convolutional Network

Adaptive Discriminative Regularization for Visual Classification

Hardware architecture optimization for high-frequency zeroing and LFNST in H.266/VVC based on FPGA

An Empirical Study on Multi-domain Robust Semantic Segmentation

High-speed hardware accelerator based on brightness improved by Light-DehazeNet

Yolo-global: a real-time target detector for mineral particles

Meet JEANIE: A Similarity Measure for 3D Skeleton Sequences via Temporal-Viewpoint Alignment

Our Content

Other Sites

Help & Contacts