![Loading...](https://link.springer.com/static/c4a417b97a76cc2980e3c25e2271af3129e08bbe/images/pdf-preview/spacer.gif)
-
Chapter and Conference Paper
PTSEFormer: Progressive Temporal-Spatial Enhanced TransFormer Towards Video Object Detection
Recent years have witnessed a trend of applying context frames to boost the performance of object detection as video object detection. Existing methods usually aggregate features at one stroke to enhance the f...
-
Chapter and Conference Paper
PNO: Personalized Network Optimization for Human Pose and Shape Reconstruction
Most previous human pose and shape reconstruction methods focus on the generalization ability and learn a prior of the general pose and shape, however the personalized features are often ignored. We argue that...
-
Chapter and Conference Paper
Multi-person/Group Interactive Video Generation
Human motion generation from caption is a fast-growing and promising technique. Recent methods employ the latest hidden states of a recurrent neural network (RNN) to encode the skeletons, which can only addres...