-
Article
Dynamic context-driven progressive image inpainting with auxiliary generative units
Image inpainting aims to restore missing or damaged regions of an image with plausible visual content. Most existing methods always face challenges when dealing with large hole images, such as structural disto...
-
Article
Keyframe-based RGB-D dense visual SLAM fused semantic cues in dynamic scenes
The robustness of dense visual SLAM is still a challenging problem in dynamic environments. In this paper, we propose a novel keyframe-based dense visual SLAM to handle a highly dynamic environment by using an...
-
Article
Facial expression recognition based on local–global information reasoning and spatial distribution of landmark features
In the field of facial expression recognition (FER), two main trends point to the data-driven FER and feature-driven FER exist. The former focused on the data problems (e.g., sample imbalance and multimodal fu...
-
Article
CLIP-Adapter: Better Vision-Language Models with Feature Adapters
Large-scale contrastive vision-language pretraining has shown significant progress in visual representation learning. Unlike traditional visual systems trained by a fixed set of discrete labels, a new paradigm...
-
Article
Key-frame reference selection for error resilient video coding using low-delay hierarchical coding structure
Low-delay hierarchical coding structure (LD-HCS), as one of the most crucial components in the High Efficiency Video Coding (HEVC) standard, substantially complicates the temporal reference relationship and ac...
-
Article
V \(^2\) MLP: an accurate and simple multi-view MLP network for fine-grained 3D shape recognition
Fine-grained 3D shape recognition (FGSR) is crucial for real-world applications. Existing methods face challenges in achieving high accuracy for FGSR due to high similarity within sub-categories and low dissim...
-
Article
MapReduce-based distributed tensor clustering algorithm
Cluster analysis is one of the most fundamental methods in data mining, and it has been widely used in economics, social sciences and computer science. However, with the rapid development of Internet technolog...
-
Article
A new non-convex sparse optimization method for image restoration
In the field of image processing, total variational model is an effective prior model. In order to better eliminate impulse noise, an effective method is to use
-
Article
Open AccessJNeRF: An efficient heterogeneous NeRF model zoo based on Jittor
-
Article
Crowded pose-guided multi-task learning for instance-level human parsing
Instance-level human parsing remains challenging due to the similarity between human instances and background, complex interactions, and various poses. Aiming at assigning each human-related pixel a semantic l...
-
Article
TRCA-Net: stronger U structured network for human image segmentation
Human image segmentation has been a practical and active research topic due to its wide range of potential application. There are some previous studies on manual, semi-automatic and automatic segmentation meth...
-
Article
C3N: content-constrained convolutional network for mural image completion
Ancient murals, suffering from severe diseases, usually exhibit the absence or distortion of local areas. The damaged murals severely impaired people’s visual appreciation and satisfaction in the digital conse...
-
Article
Visible-infrared person re-identification model based on feature consistency and modal indistinguishability
Visible-infrared person re-identification (VI-ReID) is used to search person images across cameras under different modalities, which can address the limitation of visible-based ReID in dark environments. Intra...
-
Article
HF-SRGR: a new hybrid feature-driven social relation graph reasoning model
Social relations and interactions between persons form the foundation of human society. Effective recognition of social relationships has great potential for understanding and improving people’s psychology and...
-
Article
Efficient Joint-Dimensional Search with Solution Space Regularization for Real-Time Semantic Segmentation
Semantic segmentation is a popular research topic in computer vision, and many efforts have been made on it with impressive results. In this paper, we intend to search an optimal network structure that can run...
-
Article
Open AccessAn evolving ensemble model of multi-stream convolutional neural networks for human action recognition in still images
Still image human action recognition (HAR) is a challenging problem owing to limited sources of information and large intra-class and small inter-class variations which requires highly discriminative features....
-
Article
Hierarchical learning with backtracking algorithm based on the Visual Confusion Label Tree for large-scale image classification
In this paper, a hierarchical learning algorithm based on the Bayesian Neural Network classifier with backtracking is proposed to support large-scale image classification, where a Visual Confusion Label Tree i...
-
Article
Embedded real-time infrared and visible image fusion for UAV surveillance
Infrared and visible image fusion is a beneficial processing task for Unmanned Aerial Vehicle (UAV) surveillance, which can improve visibility by combining the advantages of the infrared camera and the visible...
-
Article
A dedicated hardware accelerator for real-time acceleration of YOLOv2
In recent years, dedicated hardware accelerators for the acceleration of the convolutional neural network (CNN) have been extensively studied. Although many studies have presented efficient designs on FPGAs fo...
-
Article
Prescribed-time convergent and noise-tolerant Z-type neural dynamics for calculating time-dependent quadratic programming
Neural-dynamics methods for solving quadratic programming (QP) have been studied for decades. The main feature of a neural-dynamics solver is that it can generate a continuous path from the initial point, and ...