Skip to main content

previous disabled Page of 3
and
  1. No Access

    Chapter and Conference Paper

    Descriptive Attributes for Language-Based Object Keypoint Detection

    Multimodal vision and language (VL) models have recently shown strong performance in phrase grounding and object detection for both zero-shot and finetuned cases. We adapt a VL model (GLIP) for keypoint detect...

    Jerod Weinman, Serge Belongie, Stella Frank in Computer Vision Systems (2023)

  2. No Access

    Chapter and Conference Paper

    Text-Driven Stylization of Video Objects

    We tackle the task of stylizing video objects in an intuitive and semantic manner following a user-specified text prompt. This is a challenging task as the resulting video must satisfy multiple properties: (1)...

    Sebastian Loeschcke, Serge Belongie, Sagie Benaim in Computer Vision – ECCV 2022 Workshops (2023)

  3. No Access

    Chapter and Conference Paper

    SITTA: Single Image Texture Translation for Data Augmentation

    Recent advances in data augmentation enable one to translate images by learning the map** between a source domain and a target domain. Existing methods tend to learn the distributions by training a model on ...

    Boyi Li, Yin Cui, Tsung-Yi Lin, Serge Belongie in Computer Vision – ECCV 2022 Workshops (2023)

  4. Article

    Open Access

    Occluded Video Instance Segmentation: A Benchmark

    Can our video understanding systems perceive objects when a heavy occlusion exists in a scene? To answer this question, we collect a large-scale dataset called OVIS for occluded video instance segmentation, th...

    Jiyang Qi, Yan Gao, Yao Hu, **nggang Wang in International Journal of Computer Vision (2022)

  5. No Access

    Chapter and Conference Paper

    Exploring Fine-Grained Audiovisual Categorization with the SSW60 Dataset

    We present a new benchmark dataset, Sapsucker Woods 60 (SSW60), for advancing research on audiovisual fine-grained categorization. While our community has made great strides in fine-grained visual categorizati...

    Grant Van Horn, Rui Qian, Kimberly Wilber, Hartwig Adam in Computer Vision – ECCV 2022 (2022)

  6. No Access

    Chapter and Conference Paper

    Visual Prompt Tuning

    The current modus operandi in adapting pre-trained models involves updating all the backbone parameters, i.e., full fine-tuning. This paper introduces Visual Prompt Tuning (VPT) as an efficient and effective alte...

    Menglin Jia, Luming Tang, Bor-Chun Chen, Claire Cardie in Computer Vision – ECCV 2022 (2022)

  7. No Access

    Chapter and Conference Paper

    On Label Granularity and Object Localization

    Weakly supervised object localization (WSOL) aims to learn representations that encode object location using only image-level category labels. However, many objects can be labeled at different levels of granul...

    Elijah Cole, Kimberly Wilber, Grant Van Horn, Xuan Yang in Computer Vision – ECCV 2022 (2022)

  8. No Access

    Article

    Convolutional Networks with Adaptive Inference Graphs

    Do convolutional networks really need a fixed feed-forward structure? What if, after identifying the high-level concept of an image, a network could move directly to a layer that can distinguish fine-grained d...

    Andreas Veit, Serge Belongie in International Journal of Computer Vision (2020)

  9. No Access

    Chapter and Conference Paper

    A Metric Learning Reality Check

    Deep metric learning papers from the past four years have consistently claimed great advances in accuracy, often more than doubling the performance of decade-old methods. In this paper, we take a closer look a...

    Kevin Musgrave, Serge Belongie, Ser-Nam Lim in Computer Vision – ECCV 2020 (2020)

  10. No Access

    Chapter and Conference Paper

    Fashionpedia: Ontology, Segmentation, and an Attribute Localization Dataset

    In this work we explore the task of instance segmentation with attribute localization, which unifies instance segmentation (detect and segment each object instance) and fine-grained visual attribute categorizatio...

    Menglin Jia, Mengyun Shi, Mikhail Sirotenko, Yin Cui in Computer Vision – ECCV 2020 (2020)

  11. No Access

    Chapter and Conference Paper

    Learning Gradient Fields for Shape Generation

    In this work, we propose a novel technique to generate shapes from point cloud data. A point cloud can be viewed as samples from a distribution of 3D points whose density is concentrated near the surface of th...

    Ruo** Cai, Guandao Yang, Hadar Averbuch-Elor, Zekun Hao in Computer Vision – ECCV 2020 (2020)

  12. Chapter and Conference Paper

    Deep Fundamental Matrix Estimation Without Correspondences

    Estimating fundamental matrices is a classic problem in computer vision. Traditional methods rely heavily on the correctness of estimated key-point correspondences, which can be noisy and unreliable. As a resu...

    Omid Poursaeed, Guandao Yang, Aditya Prakash in Computer Vision – ECCV 2018 Workshops (2019)

  13. No Access

    Chapter

    Crowd Research: Open and Scalable University Laboratories

    Research experiences today are limited to a privileged few at select universities. Providing open access to research experiences would enable global upward mobility and increased diversity in the scientific wo...

    Rajan Vaish, Snehalkumar (Neil) S. Gaikwad, Geza Kovacs in Design Thinking Research (2019)

  14. No Access

    Article

    Vision-based real estate price estimation

    Since the advent of online real estate database companies like Zillow, Trulia and Redfin, the problem of automatic estimation of market values for houses has received considerable attention. Several real estat...

    Omid Poursaeed, Tomáš Matera, Serge Belongie in Machine Vision and Applications (2018)

  15. Chapter and Conference Paper

    Learning Single-View 3D Reconstruction with Limited Pose Supervision

    It is expensive to label images with 3D structure or precise camera pose. Yet, this is precisely the kind of annotation required to train single-view 3D reconstruction models. In contrast, unlabeled images or ...

    Guandao Yang, Yin Cui, Serge Belongie, Bharath Hariharan in Computer Vision – ECCV 2018 (2018)

  16. Chapter and Conference Paper

    Multimodal Unsupervised Image-to-Image Translation

    Unsupervised image-to-image translation is an important and challenging problem in computer vision. Given an image in the source domain, the goal is to learn the conditional distribution of corresponding image...

    Xun Huang, Ming-Yu Liu, Serge Belongie, Jan Kautz in Computer Vision – ECCV 2018 (2018)

  17. Chapter and Conference Paper

    Convolutional Networks with Adaptive Inference Graphs

    Do convolutional networks really need a fixed feed-forward structure? What if, after identifying the high-level concept of an image, a network could move directly to a layer that can distinguish fine-grained d...

    Andreas Veit, Serge Belongie in Computer Vision – ECCV 2018 (2018)

  18. No Access

    Chapter

    Cross-View Image Geo-localization

    The recent availability of large amounts of geo-tagged imagery has inspired a number of data-driven solutions to the image geo-localization problem. Existing approaches predict the location of a query image by...

    Tsung-Yi Lin, Serge Belongie, James Hays in Large-Scale Visual Geo-Localization (2016)

  19. No Access

    Chapter and Conference Paper

    Discriminative Regions: A Substrate for Analyzing Life-Logging Image Sequences

    Life-logging devices are becoming ubiquitous, yet still processing and extracting information from the vast amount of data that is being captured is a very challenging task. We propose a method to find discrim...

    Mohammad Moghimi, Jacqueline Kerr, Eileen Johnson, Suneeta Godbole in MultiMedia Modeling (2015)

  20. Article

    Editorial: Special Issue on Active and Interactive Methods in Computer Vision

    Kristen Grauman, Serge Belongie in International Journal of Computer Vision (2014)

previous disabled Page of 3