-
Article
FLAVR: flow-free architecture for fast video frame interpolation
Many modern frame interpolation approaches rely on explicit bidirectional optical flows between adjacent frames, thus are sensitive to the accuracy of underlying flow estimation in handling occlusions while ad...
-
Chapter and Conference Paper
Single-Stream Multi-level Alignment for Vision-Language Pretraining
Self-supervised vision-language pretraining from pure images and text with a contrastive loss is effective, but ignores fine-grained alignment due to a dual-stream architecture that aligns image and text repre...
-
Chapter and Conference Paper
Improving Face Recognition by Clustering Unlabeled Faces in the Wild
While deep face recognition has benefited significantly from large-scale labeled data, current research is focused on leveraging unlabeled data to further boost performance, reducing the cost of human annotati...
-
Chapter and Conference Paper
Learning to Look around Objects for Top-View Representations of Outdoor Scenes
Given a single RGB image of a complex outdoor road scene in the perspective view, we address the novel problem of estimating an occlusion-reasoned semantic scene layout in the top-view. This challenging proble...
-
Chapter and Conference Paper
Hierarchical Metric Learning and Matching for 2D and 3D Geometric Correspondences
Interest point descriptors have fueled progress on almost every problem in computer vision. Recent advances in deep neural networks have enabled task-specific learned descriptors that outperform hand-crafted d...
-
Chapter and Conference Paper
Deep Deformation Network for Object Landmark Localization
We propose a novel cascaded framework, namely deep deformation network (DDN), for localizing landmarks in non-rigid objects. The hallmarks of DDN are its incorporation of geometric constraints within a convolu...
-
Chapter and Conference Paper
A 4D Light-Field Dataset and CNN Architectures for Material Recognition
We introduce a new light-field dataset of materials, and take advantage of the recent success of deep learning to perform material recognition on the 4D light-field. Our dataset contains 12 material categories...
-
Chapter and Conference Paper
On Shape and Material Recovery from Motion
We present a framework for the joint recovery of the shape and reflectance of an object with dichromatic BRDF, using motion cues. We show that four (small or differential) motions of the object, or three motio...
-
Article
Open AccessGlobally Optimal Algorithms for Stratified Autocalibration
We present practical algorithms for stratified autocalibration with theoretical guarantees of global optimality. Given a projective reconstruction, we first upgrade it to affine by estimating the position of t...
-
Chapter and Conference Paper
A Dual Theory of Inverse and Forward Light Transport
Inverse light transport seeks to undo global illumination effects, such as interreflections, that pervade images of most scenes. This paper presents the theoretical and computational foundations for inverse li...