Search Results - Springer

Article

Dilation-erosion for single-frame supervised temporal action localization

To balance the annotation labor and the granularity of supervision, single-frame annotation has been introduced in temporal action localization. It provides a rough temporal location for an action but implicit...

Bin Wang, Yan Song, Fanming Wang, Yang Zhao… in Multimedia Tools and Applications (2024)

Article

Supervised Learning Strategy for Spiking Neurons Based on Their Segmental Running Characteristics

Supervised learning of spiking neurons is an effective simulation method to explore the learning mechanism of real neurons. Desired output spike trains are often used as supervised signals to control the synap...

**ngjian Gu, **n Shu, **g Yang, Yan Xu, Haiyan Jiang… in Neural Processing Letters (2023)

Article

A simple yet effective image stitching with computational suture zone

Image stitching is the process of combining two or more photographic images with spatially overlap** areas into a wider-view panorama accommodating the full-scale information. It suffers from ghosting or obv...

Jiachao Zhang, Yang Gao, Yi Xu, Yunbin Huang, Yanming Yu… in The Visual Computer (2023)

Article

BCMask: a finer leaf instance segmentation with bilayer convolution mask

Whether in natural scenes or laboratory environments, leaf instance segmentation is still a challenging task in high-throughput plant phenotypic research. Because compared with normal instance objects, leaves ...

**ngjian Gu, Yongjie Zhu, Shougang Ren, **angbo Shu in Multimedia Systems (2023)

Article

Wavelet-Attention CNN for image classification

The feature learning methods based on convolutional neural network (CNN) have successfully produced tremendous achievements in image classification tasks. However, the inherent noise and some other factors may...

**angyu Zhao, Peng Huang, **angbo Shu in Multimedia Systems (2022)

Article

Skip-attention encoder–decoder framework for human motion prediction

Human motion prediction aims to automatically predict the future motion sequence based on an observed human motion sequence. In this paper, we propose a novel skip-attention encoder–decoder (SAED) framework to...

Ruipeng Zhang, **angbo Shu, Rui Yan, Jiachao Zhang, Yan Song in Multimedia Systems (2022)

Chapter and Conference Paper

Spatiotemporal Perturbation Based Dynamic Consistency for Semi-supervised Temporal Action Detection

Temporal action detection usually relies on huge tagging costs to achieve significant performance. Semi-supervised learning, where only a small amount of data are annotated in the training set, can help reduce...

Lin Wang, Yan Song, Rui Yan, **angbo Shu in MultiMedia Modeling (2022)

Chapter and Conference Paper

Cross Fusion for Egocentric Interactive Action Recognition

The characteristics of egocentric interactive videos, which include heavy ego-motion, frequent viewpoint changes and multiple types of activities, hinder the action recognition methods of third-person vision f...

Haiyu Jiang, Yan Song, Jiang He, **angbo Shu in MultiMedia Modeling (2020)

Chapter and Conference Paper

A Novel CNN Architecture for Real-Time Point Cloud Recognition in Road Environment

To ameliorate the problems of disorder, sparseness, and floating occur for 3D LiDAR point cloud in the road environment, we propose a novel deep CNN architecture for real-time point cloud features extraction. ...

Duyao Fan, Yazhou Yao, Yunfei Cai, **angbo Shu… in Pattern Recognition and Computer Vision (2020)

Chapter and Conference Paper

Social Adaptive Module for Weakly-Supervised Group Activity Recognition

This paper presents a new task named weakly-supervised group activity recognition (GAR) which differs from conventional GAR tasks in that only video-level labels are available, yet the important persons within...

Rui Yan, Lingxi **e, **hui Tang, **angbo Shu, Qi Tian in Computer Vision – ECCV 2020 (2020)

Article

Image annotation refinement via 2P-KNN based group sparse reconstruction

Image annotation aims at predicting labels that can accurately describe the semantic information of images. In the past few years, many methods have been proposed to solve the image annotation problem. However...

Qian Ji, Liyan Zhang, **angbo Shu, **hui Tang in Multimedia Tools and Applications (2019)

Chapter and Conference Paper

Temporal Action Localization Based on Temporal Evolution Model and Multiple Instance Learning

Temporal action localization in untrimmed long videos is an important yet challenging problem. The temporal ambiguity and the intra-class variations of temporal structure of actions make existing methods far f...

Minglei Yang, Yan Song, **angbo Shu, **hui Tang in MultiMedia Modeling (2019)

Article

A content-based recommendation algorithm for learning resources

Automatic multimedia learning resources recommendation has become an increasingly relevant problem: it allows students to discover new learning resources that match their tastes, and enables the e-learning sys...

Jiangbo Shu, **aoxuan Shen, Hai Liu, Baolin Yi, Zhaoli Zhang in Multimedia Systems (2018)

Article

A Feature Selection Method for Projection Twin Support Vector Machine

In this paper, we propose a novel feature selection method which can suppress the input features during the process of model construction automatically. The main idea is to obtain better performance and sparse...

A. Rui Yan, B. Qiaolin Ye, C. Liyan Zhang, D. Ning Ye… in Neural Processing Letters (2018)

Chapter and Conference Paper

Image Retrieval Based on Optimized Visual Dictionary and Adaptive Soft Assignment

In this work, we propose a new image retrieval scheme by identifying better visual representations and fusing multiple similarities based on multiple features. For visual representation, we propose a new coars...

Hui Liu, Zechao Li, **angbo Shu in Internet Multimedia Computing and Service (2018)

Chapter and Conference Paper

Global and Local C3D Ensemble System for First Person Interactive Action Recognition

Action recognition in first person videos is different from that in third person videos. In this paper, we aim to recognize interactive actions in first person videos. First person interactive actions contain ...

Lingling Fa, Yan Song, **angbo Shu in MultiMedia Modeling (2018)

Chapter and Conference Paper

Mini Neural Networks for Effective and Efficient Mobile Album Organization

In this paper, we present an auto mobile album organization system, which can automatically classify daily photos in mobile devices into six daily categories, e.g., Baby, Food, Party, Scenery, Selfie, and Sport. ...

Lingling Fa, Lifei Zhang, **angbo Shu… in Advances in Multimedia Information Process… (2018)

Chapter and Conference Paper

Recovering Overlap** Partials for Monaural Perfect Harmonic Musical Sound Separation Using Modified Common Amplitude Modulation

Monaural musical sound separation attempts to isolate one or more instrument sources from a mono-channel polyphonic mixture. The primary challenge is to accurately separate pitched musical sounds where their p...

Yukai Gong, **angbo Shu, **hui Tang in Advances in Multimedia Information Process… (2018)

Article

KDE based outlier detection on distributed data streams in multimedia network

Multimedia networks hold the promise of facilitating large-scale, real-time data processing in complex environments. Their foreseeable applications will help protect and monitor military, environmental, safety...

Zhigao Zheng, Hwa-Young Jeong, Tao Huang, Jiangbo Shu in Multimedia Tools and Applications (2017)

Chapter and Conference Paper

Computational Face Reader

The long-history Chinese anthroposcopy has demonstrated the often satisfying capabilities to tell the characteristics (mostly exaggerated as fortune) of a person by reading his/her face, i.e. understanding the...

**angbo Shu, Liyan Zhang, **hui Tang, Guo-Sen **e, Shuicheng Yan in MultiMedia Modeling (2016)

21 Result(s)

Dilation-erosion for single-frame supervised temporal action localization

Supervised Learning Strategy for Spiking Neurons Based on Their Segmental Running Characteristics

A simple yet effective image stitching with computational suture zone

BCMask: a finer leaf instance segmentation with bilayer convolution mask

Wavelet-Attention CNN for image classification

Skip-attention encoder–decoder framework for human motion prediction

Spatiotemporal Perturbation Based Dynamic Consistency for Semi-supervised Temporal Action Detection

Cross Fusion for Egocentric Interactive Action Recognition

A Novel CNN Architecture for Real-Time Point Cloud Recognition in Road Environment

Social Adaptive Module for Weakly-Supervised Group Activity Recognition

Image annotation refinement via 2P-KNN based group sparse reconstruction

Temporal Action Localization Based on Temporal Evolution Model and Multiple Instance Learning

A content-based recommendation algorithm for learning resources

A Feature Selection Method for Projection Twin Support Vector Machine

Image Retrieval Based on Optimized Visual Dictionary and Adaptive Soft Assignment

Global and Local C3D Ensemble System for First Person Interactive Action Recognition

Mini Neural Networks for Effective and Efficient Mobile Album Organization

Recovering Overlap** Partials for Monaural Perfect Harmonic Musical Sound Separation Using Modified Common Amplitude Modulation

KDE based outlier detection on distributed data streams in multimedia network

Computational Face Reader

Our Content

Other Sites

Help & Contacts