Skip to main content

and
  1. No Access

    Chapter and Conference Paper

    Exploiting Unlabeled Data with Vision and Language Models for Object Detection

    Building robust and generic object detection frameworks requires scaling to larger label spaces and bigger training datasets. However, it is prohibitively costly to acquire annotations for thousands of categor...

    Shiyu Zhao, Zhixing Zhang, Samuel Schulter, Long Zhao in Computer Vision – ECCV 2022 (2022)

  2. No Access

    Chapter and Conference Paper

    Single-Stream Multi-level Alignment for Vision-Language Pretraining

    Self-supervised vision-language pretraining from pure images and text with a contrastive loss is effective, but ignores fine-grained alignment due to a dual-stream architecture that aligns image and text repre...

    Zaid Khan, B. G. Vijay Kumar, **ang Yu, Samuel Schulter in Computer Vision – ECCV 2022 (2022)

  3. Chapter and Conference Paper

    Bayesian Semantic Instance Segmentation in Open Set World

    This paper addresses the semantic instance segmentation task in the open-set conditions, where input images can contain known and unknown object classes. The training process of existing semantic instance segm...

    Trung Pham, B. G. Vijay Kumar, Thanh-Toan Do in Computer Vision – ECCV 2018 (2018)

  4. Chapter and Conference Paper

    Multi-modal Cycle-Consistent Generalized Zero-Shot Learning

    In generalized zero shot learning (GZSL), the set of classes are split into seen and unseen classes, where training relies on the semantic features of the seen and unseen classes and the visual representations...

    Rafael Felix, B. G. Vijay Kumar, Ian Reid, Gustavo Carneiro in Computer Vision – ECCV 2018 (2018)