Search Results - Springer

Chapter and Conference Paper

Bootstrapped Masked Autoencoders for Vision BERT Pretraining

We propose bootstrapped masked autoencoders (BootMAE), a new approach for vision BERT pretraining. BootMAE improves the original masked autoencoders (MAE) with two core designs: 1) momentum encoder that provid...

**aoyi Dong, Jianmin Bao, Ting Zhang, Dongdong Chen… in Computer Vision – ECCV 2022 (2022)

Chapter and Conference Paper

Talking Face Video Generation with Editable Expression

In rencent years, the convolutional neural network have been proved to be a great success in generating talking face. Existing methods have combined a single face image with speech to generate talking face vid...

Luchuan Song, Bin Liu, Nenghai Yu in Image and Graphics (2021)

Chapter and Conference Paper

Towards More Powerful Multi-column Convolutional Network for Crowd Counting

Scale variation has always been one of the most challenging problems for crowd counting. By using multi-column convolutions with different receptive fields to deal with different scales in the scene, the multi...

Jiabin Zhang, Qi Chu, Weihai Li, Bin Liu, Weiming Zhang, Nenghai Yu in Image and Graphics (2021)

Chapter and Conference Paper

Learning from Rankings with Multi-level Features for No-Reference Image Quality Assessment

Deep neural networks for image quality assessment have been suffering from a lack of training data for a long time, as it is expensive and laborious to collect sufficient subjective mean opinion scores (MOS). ...

Yiqun Li, Lan Ma, Dahai Yu in Pattern Recognition and Computer Vision (2020)

4 Result(s)

Bootstrapped Masked Autoencoders for Vision BERT Pretraining

Talking Face Video Generation with Editable Expression

Towards More Powerful Multi-column Convolutional Network for Crowd Counting

Learning from Rankings with Multi-level Features for No-Reference Image Quality Assessment

Our Content

Other Sites

Help & Contacts