![Loading...](https://link.springer.com/static/c4a417b97a76cc2980e3c25e2271af3129e08bbe/images/pdf-preview/spacer.gif)
5,281 Result(s)
-
Article
CSDG-FAS: Closed-Space Domain Generalization for Face Anti-spoofing
Domain generalization based Face Anti-spoofing (FAS) aims to enhance its ability to work in unseen domains. Existing methods endeavor to extract a discriminative common space through the alignment of distribut...
-
Article
Learning Hierarchical Visual Transformation for Domain Generalizable Visual Matching and Recognition
Modern deep neural networks are prone to learn domain-dependent shortcuts and thus usually suffer from severe performance degradation when tested in unseen target domains due to their poor ability of out-of-di...
-
Article
VLG: General Video Recognition with Web Textual Knowledge
Video recognition (action recognition) in an open world is quite challenging, as we need to handle different settings such as closed-set, long-tail, few-shot, and open-set. The majority of existing works often...
-
Article
EEA-Net: edge-enhanced assistance network for infrared small target detection
With the development of deep learning, the performance of infrared small target detection (IRSTD) has been significantly improved. A precise shape of the target edge is crucial for segmenting small infrared ta...
-
Article
Open AccessTemporally Consistent Enhancement of Low-Light Videos via Spatial-Temporal Compatible Learning
Temporal inconsistency is the annoying artifact that has been commonly introduced in low-light video enhancement, but current methods tend to overlook the significance of utilizing both data-centric clues and ...
-
Article
WATCHER: Wavelet-Guided Texture-Content Hierarchical Relation Learning for Deepfake Detection
Breathtaking advances in face forgery techniques produce visually untraceable deepfake videos, thus potential malicious abuse of these techniques has sparked great concerns. Existing deepfake detectors primari...
-
Article
ManiCLIP: Multi-attribute Face Manipulation from Text
In this paper we present a novel multi-attribute face manipulation method based on textual descriptions. Previous text-based image editing methods either require test-time optimization for each individual imag...
-
Article
GridFormer: Residual Dense Transformer with Grid Structure for Image Restoration in Adverse Weather Conditions
Image restoration in adverse weather conditions is a difficult task in computer vision. In this paper, we propose a novel transformer-based framework called GridFormer which serves as a backbone for image rest...
-
Article
Fast detection of face masks in public places using QARepVGG-YOLOv7
The COVID-19 pandemic has resulted in substantial global losses. In the post-epidemic era, public health needs still advocate the correct use of medical masks in confined spaces such as hospitals and indoors. ...
-
Article
Hybrid CNN-Transformer Architecture for Efficient Large-Scale Video Snapshot Compressive Imaging
Video snapshot compressive imaging (SCI) uses a low-speed 2D detector to capture high-speed scene, where the dynamic scene is modulated by different masks and then compressed into a snapshot measurement. Follo...
-
Article
An Adaptive Correlation Filtering Method for Text-Based Person Search
Text-based person search aims to align person images with natural language descriptions, which can be widely used in video surveillance field, such as missing person searching and suspect tracking. In this tas...
-
Article
Benchmarking Object Detection Robustness against Real-World Corruptions
With the rapid recent development, deep learning based object detection techniques have been applied to various real-world software systems, especially in safety-critical applications like autonomous driving. ...
-
Article
PRCL: Probabilistic Representation Contrastive Learning for Semi-Supervised Semantic Segmentation
Tremendous breakthroughs have been developed in Semi-Supervised Semantic Segmentation (S4) through contrastive learning. However, due to limited annotations, the guidance on unlabeled images is generated by th...
-
Article
Thermal infrared action recognition with two-stream shift Graph Convolutional Network
The extensive deployment of camera-based IoT devices in our society is heightening the vulnerability of citizens’ sensitive information and individual data privacy. In this context, thermal imaging techniques ...
-
Article
Adaptive Discriminative Regularization for Visual Classification
How to improve discriminative feature learning is central in classification. Existing works address this problem by explicitly increasing inter-class separability and intra-class compactness by constructing po...
-
Article
Hardware architecture optimization for high-frequency zeroing and LFNST in H.266/VVC based on FPGA
To reduce the hardware implementation resource consumption of the two-dimensional transform component in H.266 VVC, a unified hardware structure is proposed that supports full-size Discrete Cosine Transform (D...
-
Article
An Empirical Study on Multi-domain Robust Semantic Segmentation
How to effectively leverage the plentiful existing datasets to train a robust and high-performance model is of great significance for many practical applications. However, a model trained on a naive merge of d...
-
Article
High-speed hardware accelerator based on brightness improved by Light-DehazeNet
Due to the increasing demand for artificial intelligence technology in today’s society, the entire industrial production system is undergoing a transformative process related to automation, reliability, and ro...
-
Article
Yolo-global: a real-time target detector for mineral particles
Recently, deep learning methodologies have achieved significant advancements in mineral automatic sorting and anomaly detection. However, the limited features of minerals transported in the form of small parti...
-
Article
Open AccessMeet JEANIE: A Similarity Measure for 3D Skeleton Sequences via Temporal-Viewpoint Alignment
Video sequences exhibit significant nuisance variations (undesired effects) of speed of actions, temporal locations, and subjects’ poses, leading to temporal-viewpoint misalignment when comparing two sets of f...