Page
%P
-
Chapter and Conference Paper
Weakly-Supervised Temporal Action Localization with Multi-Head Cross-Modal Attention
Weakly-supervised temporal action localization seeks to localize temporal boundaries of actions while concurrently identifying their categories using only video-level category labels during training. Among the...