-
Chapter and Conference Paper
Deep Disentangled Representations for Volumetric Reconstruction
We introduce a convolutional neural network for inferring a compact disentangled graphical description of objects from 2D images that can be used for volumetric reconstruction. The network comprises an encoder...
-
Chapter and Conference Paper
Overcoming Occlusion with Inverse Graphics
Scene understanding tasks such as the prediction of object pose, shape, appearance and illumination are hampered by the occlusions often found in images. We propose a vision-as-inverse-graphics approach to han...
-
Chapter and Conference Paper
Relating Things and Stuff by High-Order Potential Modeling
In the last few years, substantially different approaches have been adopted for segmenting and detecting “things” (object categories that have a well defined shape such as people and cars) and “stuff” (object ...
-
Chapter and Conference Paper
Learning to Efficiently Detect Repeatable Interest Points in Depth Data
Interest point (IP) detection is an important component of many computer vision methods. While there are a number of methods for detecting IPs in RGB images, modalities such as depth images and range scans hav...
-
Chapter and Conference Paper
Indoor Segmentation and Support Inference from RGBD Images
We present an approach to interpret the major surfaces, objects, and support relations of an indoor scene from an RGBD image. Most existing work ignores physical interactions or is applied only to tidy rooms a...
-
Chapter and Conference Paper
A Convex Discrete-Continuous Approach for Markov Random Fields
We propose an extension of the well-known LP relaxation for Markov random fields to explicitly allow continuous label spaces. Unlike conventional continuous formulations of labelling problems which assume that...
-
Chapter and Conference Paper
Large-Lexicon Attribute-Consistent Text Recognition in Natural Images
This paper proposes a new model for the task of word recognition in natural images that simultaneously models visual and lexicon consistency of words in a single probabilistic model. Our approach combines loca...
-
Chapter and Conference Paper
Latent Hough Transform for Object Detection
Hough transform based methods for object detection work by allowing image features to vote for the location of the object. While this representation allows for parts observed in different training instances to...
-
Chapter and Conference Paper
Energy Minimization under Constraints on Label Counts
Many computer vision problems such as object segmentation or reconstruction can be formulated in terms of labeling a set of pixels or voxels. In certain scenarios, we may know the number of pixels or voxels wh...
-
Chapter and Conference Paper
TriangleFlow: Optical Flow with Triangulation-Based Higher-Order Likelihoods
We use a simple yet powerful higher-order conditional random field (CRF) to model optical flow. It consists of a standard photo-consistency cost and a prior on affine motions both modeled in terms of higher-or...
-
Chapter and Conference Paper
Geometric Image Parsing in Man-Made Environments
We present a new parsing framework for the line-based geometric analysis of a single image coming from a man-made environment. This parsing framework models the scene as a composition of geometric primitives s...
-
Chapter and Conference Paper
Graph Cut Based Inference with Co-occurrence Statistics
Markov and Conditional random fields (crfs) used in computer vision typically model only local interactions between variables, as this is computationally tractable. In this paper we consider a class of global pot...