Skip to main content

and
  1. No Access

    Article

    View-Invariant, Occlusion-Robust Probabilistic Embedding for Human Pose

    Recognition of human poses and actions is crucial for autonomous systems to interact smoothly with people. However, cameras generally capture human poses in 2D as images and videos, which can have significant ...

    Ting Liu, Jennifer J. Sun, Long Zhao in International Journal of Computer Vision (2022)

  2. No Access

    Chapter and Conference Paper

    k-means Mask Transformer

    The rise of transformers in vision tasks not only advances network backbone designs, but also starts a brand-new page to achieve end-to-end image recognition (e.g., object detection and panoptic segmentation). Or...

    Qihang Yu, Huiyu Wang, Siyuan Qiao, Maxwell Collins in Computer Vision – ECCV 2022 (2022)

  3. No Access

    Chapter and Conference Paper

    Axial-DeepLab: Stand-Alone Axial-Attention for Panoptic Segmentation

    Convolution exploits locality for efficiency at a cost of missing long range context. Self-attention has been adopted to augment CNNs with non-local interactions. Recent works prove it possible to stack self-a...

    Huiyu Wang, Yukun Zhu, Bradley Green, Hartwig Adam in Computer Vision – ECCV 2020 (2020)

  4. No Access

    Chapter and Conference Paper

    Fashionpedia: Ontology, Segmentation, and an Attribute Localization Dataset

    In this work we explore the task of instance segmentation with attribute localization, which unifies instance segmentation (detect and segment each object instance) and fine-grained visual attribute categorizatio...

    Menglin Jia, Mengyun Shi, Mikhail Sirotenko, Yin Cui in Computer Vision – ECCV 2020 (2020)

  5. No Access

    Chapter and Conference Paper

    View-Invariant Probabilistic Embedding for Human Pose

    Depictions of similar human body configurations can vary with changing viewpoints. Using only 2D information, we would like to enable vision algorithms to recognize similarity in human body poses across multip...

    Jennifer J. Sun, Jia** Zhao, Liang-Chieh Chen in Computer Vision – ECCV 2020 (2020)

  6. Chapter and Conference Paper

    Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation

    Spatial pyramid pooling module or encode-decoder structure are used in deep neural networks for semantic segmentation task. The former networks are able to encode multi-scale contextual information by probing ...

    Liang-Chieh Chen, Yukun Zhu, George Papandreou in Computer Vision – ECCV 2018 (2018)

  7. Chapter and Conference Paper

    Large-Scale Object Classification Using Label Relation Graphs

    In this paper we study how to perform object classification in a principled way that exploits the rich structure of real world labels. We develop a new model that allows encoding of flexible relations between ...

    Jia Deng, Nan Ding, Yangqing Jia, Andrea Frome, Kevin Murphy in Computer Vision – ECCV 2014 (2014)