Skip to main content

and
  1. No Access

    Chapter and Conference Paper

    PTSEFormer: Progressive Temporal-Spatial Enhanced TransFormer Towards Video Object Detection

    Recent years have witnessed a trend of applying context frames to boost the performance of object detection as video object detection. Existing methods usually aggregate features at one stroke to enhance the f...

    Han Wang, Jun Tang, **aodong Liu, Shanyan Guan, Rong **e in Computer Vision – ECCV 2022 (2022)

  2. No Access

    Chapter and Conference Paper

    PNO: Personalized Network Optimization for Human Pose and Shape Reconstruction

    Most previous human pose and shape reconstruction methods focus on the generalization ability and learn a prior of the general pose and shape, however the personalized features are often ignored. We argue that...

    Zhijie Cao, Min Wang, Shanyan Guan in Artificial Neural Networks and Machine Lea… (2021)

  3. No Access

    Chapter and Conference Paper

    Multi-person/Group Interactive Video Generation

    Human motion generation from caption is a fast-growing and promising technique. Recent methods employ the latest hidden states of a recurrent neural network (RNN) to encode the skeletons, which can only addres...

    Zhan Wang, Tai** Yao, Huawei Wei in Advances in Multimedia Information Process… (2018)