![Loading...](https://link.springer.com/static/c4a417b97a76cc2980e3c25e2271af3129e08bbe/images/pdf-preview/spacer.gif)
-
Article
Fast CU partition strategy based on texture and neighboring partition information for Versatile Video Coding Intra Coding
The next generation video coding standard, H.266/Versatile Video Coding (VVC), was released by the Joint Video Exploration Team (JVET) in July 2020. Unlike the previous generation standard H.265/High Efficienc...
-
Chapter and Conference Paper
Unsupervised Prototype Adapter for Vision-Language Models
Recently, large-scale pre-trained vision-language models (e.g. CLIP and ALIGN) have demonstrated remarkable effectiveness in acquiring transferable visual representations. To leverage the valuable knowledge encod...
-
Article
A viewpoint-guided prototype network for 3D shape classification
Multi-view learning methods have achieved remarkable results in 3D shape recognition. However, most of them focus on the visual feature extraction and feature aggregation, while viewpoints (spatial positions o...
-
Article
Block-correlation-based intra prediction for VVC
The new generation video coding standard Versatile Video Coding (VVC) has been officially released. Many novel technologies were utilized to improve the coding performance. In this paper, we propose an efficient ...
-
Article
Topological and geometrical joint learning for 3D graph data
Traditional convolutional neural networks (CNNs) are limited to be directly applied to 3D graph data due to their inherent grid structure. And most of graph-based learning methods use local-to-global hierarchi...
-
Article
Geometric machine learning: research and applications
Over the last decade, deep learning has revolutionized many traditional machine learning tasks, ranging from computer vision to natural language processing. Although deep learning has achieved excellent perfor...
-
Article
Engineering-oriented bridge multiple-damage detection with damage integrity using modified faster region-based convolutional neural network
A bridge damage detector with preserving integrity based on modified Faster region-based convolutional neural network (R-CNN) is proposed for multiple damage types. The methodologies of dataset collection, dam...
-
Article
Cross-modal multi-relationship aware reasoning for image-text matching
Cross-modal image-text matching has attracted considerable interest in both computer vision and natural language processing communities. The main issue of image-text matching is to learn the compact cross-moda...
-
Article
Ensemble diversified learning for image classification with noisy labels
In this work, we develop a new approach for learning a deep neural network for image classification with noisy labels using ensemble diversified learning. We first partition the training set into multiple subs...
-
Article
vSocial: a cloud-based system for social virtual reality learning environment applications in special education
Virtual Learning Environments (VLEs) are spaces designed to educate student groups remotely via online platforms. Although traditional VLEs have shown promise in educating students, they offer limited immersio...
-
Article
An improved R-λ rate control model based on joint spatial-temporal domain information and HVS characteristics
With the popularization of smart terminals and multimedia technologies, the video coding standard — H.264/Advanced Video Coding (AVC) and H.265/High Efficiency Video Coding (HEVC) have been unable to meet the nee...
-
Article
CLDA: an adversarial unsupervised domain adaptation method with classifier-level adaptation
Domain adaptation is an active and important research field in transfer learning. Unsupervised domain adaptation, which is better in line with real-world scenarios than supervised and semi-supervised domain ad...
-
Article
An experimental study of relative total variation and probabilistic collaborative representation for iris recognition
Iris images collected under different conditions often suffer from specular reflections, cast shadows, motion blur, defocus blur, occlusion caused by eyelashes and eyelids, eyeglasses, hair and other artifacts...
-
Article
Zero-shot recognition with latent visual attributes learning
Zero-shot learning (ZSL) aims to recognize novel object categories by means of transferring knowledge extracted from the seen categories (source domain) to the unseen categories (target domain). Recently, most...
-
Article
Adaptive Gradient Information and BFGS Based Inter Frame Rate Control for High Efficiency Video Coding
In order to meet the emerging demands of high-fidelity video services, a new video coding standard — High Efficiency Video Coding (HEVC) is developed to improve the compression performance of high definition (HD)...
-
Article
Robust distributed video coding for wireless multimedia sensor networks
Coding complexity and error-resilience are the two key factors for video streaming in Wireless Multimedia Sensor Networks (WMSNs). Towards this objective, this paper proposes a Robust Distributed Video Coding ...
-
Article
A fast inter-prediction algorithm for HEVC based on temporal and spatial correlation
In HEVC, the structure of coding unit (CU) and prediction unit (PU) is defined, which brings about higher coding efficiency than H.264/AVC. However, the rate distortion (RD) cost calculations of all depths of ...
-
Chapter and Conference Paper
Rate-Distortion Control with Delay Bound Constraint for Video Streaming over Multi-Hop Networks
We develop a relatively accurate and robust R-D control algorithm in the H.264/AVC to achieve the target bit rate. More specifically, we first present an efficient bandwidth resource allocation framework to ob...
-
Reference Work Entry In depth
Wireless Video
-
Reference Work Entry In depth
Wireless Video
Definition:Wireless video refers to transporting video signals over mobile wireless links.