Search Results - Springer

Sort By Newest First Oldest First

Chapter and Conference Paper

Exploring Fine-Grained Audiovisual Categorization with the SSW60 Dataset

We present a new benchmark dataset, Sapsucker Woods 60 (SSW60), for advancing research on audiovisual fine-grained categorization. While our community has made great strides in fine-grained visual categorizati...

Grant Van Horn, Rui Qian, Kimberly Wilber, Hartwig Adam… in Computer Vision – ECCV 2022 (2022)
Article

View-Invariant, Occlusion-Robust Probabilistic Embedding for Human Pose

Recognition of human poses and actions is crucial for autonomous systems to interact smoothly with people. However, cameras generally capture human poses in 2D as images and videos, which can have significant ...

Ting Liu, Jennifer J. Sun, Long Zhao… in International Journal of Computer Vision (2022)
Chapter and Conference Paper

k-means Mask Transformer

The rise of transformers in vision tasks not only advances network backbone designs, but also starts a brand-new page to achieve end-to-end image recognition (e.g., object detection and panoptic segmentation). Or...

Qihang Yu, Huiyu Wang, Siyuan Qiao, Maxwell Collins… in Computer Vision – ECCV 2022 (2022)
Chapter and Conference Paper

Adaptive Transformers for Robust Few-shot Cross-domain Face Anti-spoofing

While recent face anti-spoofing methods perform well under the intra-domain setups, an effective approach needs to account for much larger appearance variations of images acquired in complex scenes with differ...

Hsin-** Huang, Deqing Sun, Yaojie Liu, Wen-Sheng Chu… in Computer Vision – ECCV 2022 (2022)
Article

Sixteen facial expressions occur in similar contexts worldwide

Understanding the degree to which human facial expressions co-vary with specific social contexts across cultures is central to the theory that emotions enable adaptive responses to important challenges and opp...

Alan S. Cowen, Dacher Keltner, Florian Schroff, Brendan Jou, Hartwig Adam… in Nature (2021)
Chapter and Conference Paper

Axial-DeepLab: Stand-Alone Axial-Attention for Panoptic Segmentation

Convolution exploits locality for efficiency at a cost of missing long range context. Self-attention has been adopted to augment CNNs with non-local interactions. Recent works prove it possible to stack self-a...

Huiyu Wang, Yukun Zhu, Bradley Green, Hartwig Adam… in Computer Vision – ECCV 2020 (2020)
Chapter and Conference Paper

Fashionpedia: Ontology, Segmentation, and an Attribute Localization Dataset

In this work we explore the task of instance segmentation with attribute localization, which unifies instance segmentation (detect and segment each object instance) and fine-grained visual attribute categorizatio...

Menglin Jia, Mengyun Shi, Mikhail Sirotenko, Yin Cui… in Computer Vision – ECCV 2020 (2020)
Chapter and Conference Paper

Naive-Student: Leveraging Semi-Supervised Learning in Video Sequences for Urban Scene Segmentation

Supervised learning in large discriminative models is a mainstay for modern computer vision. Such an approach necessitates investing in large-scale human-annotated datasets for achieving state-of-the-art resul...

Liang-Chieh Chen, Raphael Gontijo Lopes, Bowen Cheng… in Computer Vision – ECCV 2020 (2020)
Chapter and Conference Paper

View-Invariant Probabilistic Embedding for Human Pose

Depictions of similar human body configurations can vary with changing viewpoints. Using only 2D information, we would like to enable vision algorithms to recognize similarity in human body poses across multip...

Jennifer J. Sun, Jia** Zhao, Liang-Chieh Chen… in Computer Vision – ECCV 2020 (2020)
Chapter and Conference Paper

NetAdapt: Platform-Aware Neural Network Adaptation for Mobile Applications

This work proposes an algorithm, called NetAdapt, that automatically adapts a pre-trained deep neural network to a mobile platform given a resource budget. While many existing algorithms simplify networks based o...

Tien-Ju Yang, Andrew Howard, Bo Chen, **ao Zhang, Alec Go… in Computer Vision – ECCV 2018 (2018)

Download PDF (643 KB) View Chapter
Chapter and Conference Paper

Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation

Spatial pyramid pooling module or encode-decoder structure are used in deep neural networks for semantic segmentation task. The former networks are able to encode multi-scale contextual information by probing ...

Liang-Chieh Chen, Yukun Zhu, George Papandreou… in Computer Vision – ECCV 2018 (2018)

Download PDF (1673 KB) View Chapter
Chapter and Conference Paper

Large-Scale Object Classification Using Label Relation Graphs

In this paper we study how to perform object classification in a principled way that exploits the rich structure of real world labels. We develop a new model that allows encoding of flexible relations between ...

Jia Deng, Nan Ding, Yangqing Jia, Andrea Frome, Kevin Murphy… in Computer Vision – ECCV 2014 (2014)

Download PDF (727 KB)

12 Result(s)

Exploring Fine-Grained Audiovisual Categorization with the SSW60 Dataset

View-Invariant, Occlusion-Robust Probabilistic Embedding for Human Pose

k-means Mask Transformer

Adaptive Transformers for Robust Few-shot Cross-domain Face Anti-spoofing

Sixteen facial expressions occur in similar contexts worldwide

Axial-DeepLab: Stand-Alone Axial-Attention for Panoptic Segmentation

Fashionpedia: Ontology, Segmentation, and an Attribute Localization Dataset

Naive-Student: Leveraging Semi-Supervised Learning in Video Sequences for Urban Scene Segmentation

View-Invariant Probabilistic Embedding for Human Pose

NetAdapt: Platform-Aware Neural Network Adaptation for Mobile Applications

Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation

Large-Scale Object Classification Using Label Relation Graphs

Our Content

Other Sites

Help & Contacts