Skip to main content

previous disabled Page of 2
and
  1. No Access

    Article

    Dilation-erosion for single-frame supervised temporal action localization

    To balance the annotation labor and the granularity of supervision, single-frame annotation has been introduced in temporal action localization. It provides a rough temporal location for an action but implicit...

    Bin Wang, Yan Song, Fanming Wang, Yang Zhao in Multimedia Tools and Applications (2024)

  2. No Access

    Article

    Supervised Learning Strategy for Spiking Neurons Based on Their Segmental Running Characteristics

    Supervised learning of spiking neurons is an effective simulation method to explore the learning mechanism of real neurons. Desired output spike trains are often used as supervised signals to control the synap...

    **ngjian Gu, **n Shu, **g Yang, Yan Xu, Haiyan Jiang in Neural Processing Letters (2023)

  3. No Access

    Article

    A simple yet effective image stitching with computational suture zone

    Image stitching is the process of combining two or more photographic images with spatially overlap** areas into a wider-view panorama accommodating the full-scale information. It suffers from ghosting or obv...

    Jiachao Zhang, Yang Gao, Yi Xu, Yunbin Huang, Yanming Yu in The Visual Computer (2023)

  4. No Access

    Article

    BCMask: a finer leaf instance segmentation with bilayer convolution mask

    Whether in natural scenes or laboratory environments, leaf instance segmentation is still a challenging task in high-throughput plant phenotypic research. Because compared with normal instance objects, leaves ...

    **ngjian Gu, Yongjie Zhu, Shougang Ren, **angbo Shu in Multimedia Systems (2023)

  5. No Access

    Article

    Wavelet-Attention CNN for image classification

    The feature learning methods based on convolutional neural network (CNN) have successfully produced tremendous achievements in image classification tasks. However, the inherent noise and some other factors may...

    **angyu Zhao, Peng Huang, **angbo Shu in Multimedia Systems (2022)

  6. No Access

    Article

    Skip-attention encoder–decoder framework for human motion prediction

    Human motion prediction aims to automatically predict the future motion sequence based on an observed human motion sequence. In this paper, we propose a novel skip-attention encoder–decoder (SAED) framework to...

    Ruipeng Zhang, **angbo Shu, Rui Yan, Jiachao Zhang, Yan Song in Multimedia Systems (2022)

  7. No Access

    Chapter and Conference Paper

    Spatiotemporal Perturbation Based Dynamic Consistency for Semi-supervised Temporal Action Detection

    Temporal action detection usually relies on huge tagging costs to achieve significant performance. Semi-supervised learning, where only a small amount of data are annotated in the training set, can help reduce...

    Lin Wang, Yan Song, Rui Yan, **angbo Shu in MultiMedia Modeling (2022)

  8. No Access

    Chapter and Conference Paper

    Cross Fusion for Egocentric Interactive Action Recognition

    The characteristics of egocentric interactive videos, which include heavy ego-motion, frequent viewpoint changes and multiple types of activities, hinder the action recognition methods of third-person vision f...

    Haiyu Jiang, Yan Song, Jiang He, **angbo Shu in MultiMedia Modeling (2020)

  9. No Access

    Chapter and Conference Paper

    A Novel CNN Architecture for Real-Time Point Cloud Recognition in Road Environment

    To ameliorate the problems of disorder, sparseness, and floating occur for 3D LiDAR point cloud in the road environment, we propose a novel deep CNN architecture for real-time point cloud features extraction. ...

    Duyao Fan, Yazhou Yao, Yunfei Cai, **angbo Shu in Pattern Recognition and Computer Vision (2020)

  10. No Access

    Chapter and Conference Paper

    Social Adaptive Module for Weakly-Supervised Group Activity Recognition

    This paper presents a new task named weakly-supervised group activity recognition (GAR) which differs from conventional GAR tasks in that only video-level labels are available, yet the important persons within...

    Rui Yan, Lingxi **e, **hui Tang, **angbo Shu, Qi Tian in Computer Vision – ECCV 2020 (2020)

  11. No Access

    Article

    Image annotation refinement via 2P-KNN based group sparse reconstruction

    Image annotation aims at predicting labels that can accurately describe the semantic information of images. In the past few years, many methods have been proposed to solve the image annotation problem. However...

    Qian Ji, Liyan Zhang, **angbo Shu, **hui Tang in Multimedia Tools and Applications (2019)

  12. No Access

    Chapter and Conference Paper

    Temporal Action Localization Based on Temporal Evolution Model and Multiple Instance Learning

    Temporal action localization in untrimmed long videos is an important yet challenging problem. The temporal ambiguity and the intra-class variations of temporal structure of actions make existing methods far f...

    Minglei Yang, Yan Song, **angbo Shu, **hui Tang in MultiMedia Modeling (2019)

  13. No Access

    Article

    A content-based recommendation algorithm for learning resources

    Automatic multimedia learning resources recommendation has become an increasingly relevant problem: it allows students to discover new learning resources that match their tastes, and enables the e-learning sys...

    Jiangbo Shu, **aoxuan Shen, Hai Liu, Baolin Yi, Zhaoli Zhang in Multimedia Systems (2018)

  14. No Access

    Article

    A Feature Selection Method for Projection Twin Support Vector Machine

    In this paper, we propose a novel feature selection method which can suppress the input features during the process of model construction automatically. The main idea is to obtain better performance and sparse...

    A. Rui Yan, B. Qiaolin Ye, C. Liyan Zhang, D. Ning Ye in Neural Processing Letters (2018)

  15. No Access

    Chapter and Conference Paper

    Image Retrieval Based on Optimized Visual Dictionary and Adaptive Soft Assignment

    In this work, we propose a new image retrieval scheme by identifying better visual representations and fusing multiple similarities based on multiple features. For visual representation, we propose a new coars...

    Hui Liu, Zechao Li, **angbo Shu in Internet Multimedia Computing and Service (2018)

  16. No Access

    Chapter and Conference Paper

    Global and Local C3D Ensemble System for First Person Interactive Action Recognition

    Action recognition in first person videos is different from that in third person videos. In this paper, we aim to recognize interactive actions in first person videos. First person interactive actions contain ...

    Lingling Fa, Yan Song, **angbo Shu in MultiMedia Modeling (2018)

  17. No Access

    Chapter and Conference Paper

    Mini Neural Networks for Effective and Efficient Mobile Album Organization

    In this paper, we present an auto mobile album organization system, which can automatically classify daily photos in mobile devices into six daily categories, e.g., Baby, Food, Party, Scenery, Selfie, and Sport. ...

    Lingling Fa, Lifei Zhang, **angbo Shu in Advances in Multimedia Information Process… (2018)

  18. No Access

    Chapter and Conference Paper

    Recovering Overlap** Partials for Monaural Perfect Harmonic Musical Sound Separation Using Modified Common Amplitude Modulation

    Monaural musical sound separation attempts to isolate one or more instrument sources from a mono-channel polyphonic mixture. The primary challenge is to accurately separate pitched musical sounds where their p...

    Yukai Gong, **angbo Shu, **hui Tang in Advances in Multimedia Information Process… (2018)

  19. No Access

    Article

    KDE based outlier detection on distributed data streams in multimedia network

    Multimedia networks hold the promise of facilitating large-scale, real-time data processing in complex environments. Their foreseeable applications will help protect and monitor military, environmental, safety...

    Zhigao Zheng, Hwa-Young Jeong, Tao Huang, Jiangbo Shu in Multimedia Tools and Applications (2017)

  20. No Access

    Chapter and Conference Paper

    Computational Face Reader

    The long-history Chinese anthroposcopy has demonstrated the often satisfying capabilities to tell the characteristics (mostly exaggerated as fortune) of a person by reading his/her face, i.e. understanding the...

    **angbo Shu, Liyan Zhang, **hui Tang, Guo-Sen **e, Shuicheng Yan in MultiMedia Modeling (2016)

previous disabled Page of 2