![Loading...](https://link.springer.com/static/c4a417b97a76cc2980e3c25e2271af3129e08bbe/images/pdf-preview/spacer.gif)
-
Chapter and Conference Paper
Spatiotemporal Perturbation Based Dynamic Consistency for Semi-supervised Temporal Action Detection
Temporal action detection usually relies on huge tagging costs to achieve significant performance. Semi-supervised learning, where only a small amount of data are annotated in the training set, can help reduce...
-
Chapter and Conference Paper
Cross Fusion for Egocentric Interactive Action Recognition
The characteristics of egocentric interactive videos, which include heavy ego-motion, frequent viewpoint changes and multiple types of activities, hinder the action recognition methods of third-person vision f...
-
Chapter and Conference Paper
A Novel CNN Architecture for Real-Time Point Cloud Recognition in Road Environment
To ameliorate the problems of disorder, sparseness, and floating occur for 3D LiDAR point cloud in the road environment, we propose a novel deep CNN architecture for real-time point cloud features extraction. ...
-
Chapter and Conference Paper
Social Adaptive Module for Weakly-Supervised Group Activity Recognition
This paper presents a new task named weakly-supervised group activity recognition (GAR) which differs from conventional GAR tasks in that only video-level labels are available, yet the important persons within...
-
Chapter and Conference Paper
Temporal Action Localization Based on Temporal Evolution Model and Multiple Instance Learning
Temporal action localization in untrimmed long videos is an important yet challenging problem. The temporal ambiguity and the intra-class variations of temporal structure of actions make existing methods far f...
-
Chapter and Conference Paper
Image Retrieval Based on Optimized Visual Dictionary and Adaptive Soft Assignment
In this work, we propose a new image retrieval scheme by identifying better visual representations and fusing multiple similarities based on multiple features. For visual representation, we propose a new coars...
-
Chapter and Conference Paper
Global and Local C3D Ensemble System for First Person Interactive Action Recognition
Action recognition in first person videos is different from that in third person videos. In this paper, we aim to recognize interactive actions in first person videos. First person interactive actions contain ...
-
Chapter and Conference Paper
Mini Neural Networks for Effective and Efficient Mobile Album Organization
In this paper, we present an auto mobile album organization system, which can automatically classify daily photos in mobile devices into six daily categories, e.g., Baby, Food, Party, Scenery, Selfie, and Sport. ...
-
Chapter and Conference Paper
Recovering Overlap** Partials for Monaural Perfect Harmonic Musical Sound Separation Using Modified Common Amplitude Modulation
Monaural musical sound separation attempts to isolate one or more instrument sources from a mono-channel polyphonic mixture. The primary challenge is to accurately separate pitched musical sounds where their p...