Skip to main content

and
  1. No Access

    Chapter and Conference Paper

    Bootstrapped Masked Autoencoders for Vision BERT Pretraining

    We propose bootstrapped masked autoencoders (BootMAE), a new approach for vision BERT pretraining. BootMAE improves the original masked autoencoders (MAE) with two core designs: 1) momentum encoder that provid...

    **aoyi Dong, Jianmin Bao, Ting Zhang, Dongdong Chen in Computer Vision – ECCV 2022 (2022)

  2. No Access

    Chapter and Conference Paper

    Talking Face Video Generation with Editable Expression

    In rencent years, the convolutional neural network have been proved to be a great success in generating talking face. Existing methods have combined a single face image with speech to generate talking face vid...

    Luchuan Song, Bin Liu, Nenghai Yu in Image and Graphics (2021)

  3. No Access

    Chapter and Conference Paper

    Towards More Powerful Multi-column Convolutional Network for Crowd Counting

    Scale variation has always been one of the most challenging problems for crowd counting. By using multi-column convolutions with different receptive fields to deal with different scales in the scene, the multi...

    Jiabin Zhang, Qi Chu, Weihai Li, Bin Liu, Weiming Zhang, Nenghai Yu in Image and Graphics (2021)

  4. No Access

    Chapter and Conference Paper

    Learning from Rankings with Multi-level Features for No-Reference Image Quality Assessment

    Deep neural networks for image quality assessment have been suffering from a lack of training data for a long time, as it is expensive and laborious to collect sufficient subjective mean opinion scores (MOS). ...

    Yiqun Li, Lan Ma, Dahai Yu in Pattern Recognition and Computer Vision (2020)