Search Results - Springer

Article

Fast CU partition strategy based on texture and neighboring partition information for Versatile Video Coding Intra Coding

The next generation video coding standard, H.266/Versatile Video Coding (VVC), was released by the Joint Video Exploration Team (JVET) in July 2020. Unlike the previous generation standard H.265/High Efficienc...

Ruolan Yang, **aohai He, Shuhua **ong, Zeming Zhao… in Multimedia Tools and Applications (2024)

Chapter and Conference Paper

Unsupervised Prototype Adapter for Vision-Language Models

Recently, large-scale pre-trained vision-language models (e.g. CLIP and ALIGN) have demonstrated remarkable effectiveness in acquiring transferable visual representations. To leverage the valuable knowledge encod...

Yi Zhang, Ce Zhang, Xueting Hu, Zhihai He in Pattern Recognition and Computer Vision (2024)

Article

A viewpoint-guided prototype network for 3D shape classification

Multi-view learning methods have achieved remarkable results in 3D shape recognition. However, most of them focus on the visual feature extraction and feature aggregation, while viewpoints (spatial positions o...

Li Han, **hai He, Feng Dou, Huiwen Ma, **nyang **e, Wanwen Yang in Multimedia Systems (2023)

Article

Block-correlation-based intra prediction for VVC

The new generation video coding standard Versatile Video Coding (VVC) has been officially released. Many novel technologies were utilized to improve the coding performance. In this paper, we propose an efficient ...

Dan Luo, Shuhua **ong, **aohai He, Honggang Chen… in Multimedia Tools and Applications (2023)

Article

Topological and geometrical joint learning for 3D graph data

Traditional convolutional neural networks (CNNs) are limited to be directly applied to 3D graph data due to their inherent grid structure. And most of graph-based learning methods use local-to-global hierarchi...

Li Han, Pengyan Lan, Xue Shi, **aomin Wang, **hai He… in Multimedia Tools and Applications (2023)

Article

Geometric machine learning: research and applications

Over the last decade, deep learning has revolutionized many traditional machine learning tasks, ranging from computer vision to natural language processing. Although deep learning has achieved excellent perfor...

Wenming Cao, Canta Zheng, Zhiyue Yan, Zhihai He… in Multimedia Tools and Applications (2022)

Article

Engineering-oriented bridge multiple-damage detection with damage integrity using modified faster region-based convolutional neural network

A bridge damage detector with preserving integrity based on modified Faster region-based convolutional neural network (R-CNN) is proposed for multiple damage types. The methodologies of dataset collection, dam...

Licun Yu, Shuanhai He, **aosong Liu, Ming Ma… in Multimedia Tools and Applications (2022)

Article

Cross-modal multi-relationship aware reasoning for image-text matching

Cross-modal image-text matching has attracted considerable interest in both computer vision and natural language processing communities. The main issue of image-text matching is to learn the compact cross-moda...

** Zhang, **aohai He, Linbo Qing, Lu** Liu… in Multimedia Tools and Applications (2022)

Article

Ensemble diversified learning for image classification with noisy labels

In this work, we develop a new approach for learning a deep neural network for image classification with noisy labels using ensemble diversified learning. We first partition the training set into multiple subs...

Ahmed Ahmed, Hayder Yousif, Zhihai He in Multimedia Tools and Applications (2021)

Article

vSocial: a cloud-based system for social virtual reality learning environment applications in special education

Virtual Learning Environments (VLEs) are spaces designed to educate student groups remotely via online platforms. Although traditional VLEs have shown promise in educating students, they offer limited immersio...

Sai Shreya Nuguri, Prasad Calyam, Roland Oruche… in Multimedia Tools and Applications (2021)

Article

An improved R-λ rate control model based on joint spatial-temporal domain information and HVS characteristics

With the popularization of smart terminals and multimedia technologies, the video coding standard — H.264/Advanced Video Coding (AVC) and H.265/High Efficiency Video Coding (HEVC) have been unable to meet the nee...

Zeming Zhao, Shuhua **ong, Weiheng Sun, **aohai He… in Multimedia Tools and Applications (2021)

Article

CLDA: an adversarial unsupervised domain adaptation method with classifier-level adaptation

Domain adaptation is an active and important research field in transfer learning. Unsupervised domain adaptation, which is better in line with real-world scenarios than supervised and semi-supervised domain ad...

Zhihai He, Bo Yang, Chaoxian Chen, Qilin Mu, Zesong Li in Multimedia Tools and Applications (2020)

Article

An experimental study of relative total variation and probabilistic collaborative representation for iris recognition

Iris images collected under different conditions often suffer from specular reflections, cast shadows, motion blur, defocus blur, occlusion caused by eyelashes and eyelids, eyeglasses, hair and other artifacts...

Pradeep Karn, **aoHai He, ** Zhang, Yanteng Zhang in Multimedia Tools and Applications (2020)

Article

Zero-shot recognition with latent visual attributes learning

Zero-shot learning (ZSL) aims to recognize novel object categories by means of transferring knowledge extracted from the seen categories (source domain) to the unseen categories (target domain). Recently, most...

Yurui **e, **aohai He, **g Zhang, **aodong Luo in Multimedia Tools and Applications (2020)

Article

Adaptive Gradient Information and BFGS Based Inter Frame Rate Control for High Efficiency Video Coding

In order to meet the emerging demands of high-fidelity video services, a new video coding standard — High Efficiency Video Coding (HEVC) is developed to improve the compression performance of high definition (HD)...

Yuyun Ye, **aohai He, Qizhi Teng, Linbo Qing… in Multimedia Tools and Applications (2018)

Article

Robust distributed video coding for wireless multimedia sensor networks

Coding complexity and error-resilience are the two key factors for video streaming in Wireless Multimedia Sensor Networks (WMSNs). Towards this objective, this paper proposes a Robust Distributed Video Coding ...

Hong Yang, Linbo Qing, **aohai He, **anfeng Ou… in Multimedia Tools and Applications (2018)

Article

A fast inter-prediction algorithm for HEVC based on temporal and spatial correlation

In HEVC, the structure of coding unit (CU) and prediction unit (PU) is defined, which brings about higher coding efficiency than H.264/AVC. However, the rate distortion (RD) cost calculations of all depths of ...

Guoyun Zhong, **aohai He, Linbo Qing, Yuan Li in Multimedia Tools and Applications (2015)

Chapter and Conference Paper

Rate-Distortion Control with Delay Bound Constraint for Video Streaming over Multi-Hop Networks

We develop a relatively accurate and robust R-D control algorithm in the H.264/AVC to achieve the target bit rate. More specifically, we first present an efficient bandwidth resource allocation framework to ob...

Yunsheng Zhang, Yongfei Zhang, Shixin Sun… in Advances in Multimedia Information Process… (2010)

Reference Work Entry In depth

Wireless Video

Zhihai He, Chang Wen Chen in Encyclopedia of Multimedia (2008)

Reference Work Entry In depth

Wireless Video

Definition:Wireless video refers to transporting video signals over mobile wireless links.

Zhihai He, Chang Wen Chen in Encyclopedia of Multimedia (2006)

22 Result(s)

Fast CU partition strategy based on texture and neighboring partition information for Versatile Video Coding Intra Coding

Unsupervised Prototype Adapter for Vision-Language Models

A viewpoint-guided prototype network for 3D shape classification

Block-correlation-based intra prediction for VVC

Topological and geometrical joint learning for 3D graph data

Geometric machine learning: research and applications

Engineering-oriented bridge multiple-damage detection with damage integrity using modified faster region-based convolutional neural network

Cross-modal multi-relationship aware reasoning for image-text matching

Ensemble diversified learning for image classification with noisy labels

vSocial: a cloud-based system for social virtual reality learning environment applications in special education

An improved R-λ rate control model based on joint spatial-temporal domain information and HVS characteristics

CLDA: an adversarial unsupervised domain adaptation method with classifier-level adaptation

An experimental study of relative total variation and probabilistic collaborative representation for iris recognition

Zero-shot recognition with latent visual attributes learning

Adaptive Gradient Information and BFGS Based Inter Frame Rate Control for High Efficiency Video Coding

Robust distributed video coding for wireless multimedia sensor networks

A fast inter-prediction algorithm for HEVC based on temporal and spatial correlation

Rate-Distortion Control with Delay Bound Constraint for Video Streaming over Multi-Hop Networks

Wireless Video

Wireless Video

Our Content

Other Sites

Help & Contacts