![Loading...](https://link.springer.com/static/c4a417b97a76cc2980e3c25e2271af3129e08bbe/images/pdf-preview/spacer.gif)
-
Chapter and Conference Paper
Bootstrapped Masked Autoencoders for Vision BERT Pretraining
We propose bootstrapped masked autoencoders (BootMAE), a new approach for vision BERT pretraining. BootMAE improves the original masked autoencoders (MAE) with two core designs: 1) momentum encoder that provid...
-
Chapter and Conference Paper
Talking Face Video Generation with Editable Expression
In rencent years, the convolutional neural network have been proved to be a great success in generating talking face. Existing methods have combined a single face image with speech to generate talking face vid...
-
Chapter and Conference Paper
Towards More Powerful Multi-column Convolutional Network for Crowd Counting
Scale variation has always been one of the most challenging problems for crowd counting. By using multi-column convolutions with different receptive fields to deal with different scales in the scene, the multi...
-
Chapter and Conference Paper
Learning from Rankings with Multi-level Features for No-Reference Image Quality Assessment
Deep neural networks for image quality assessment have been suffering from a lack of training data for a long time, as it is expensive and laborious to collect sufficient subjective mean opinion scores (MOS). ...