277 Result(s)
-
Chapter and Conference Paper
Question-Guided Hybrid Convolution for Visual Question Answering
In this paper, we propose a novel Question-Guided Hybrid Convolution (QGHC) network for Visual Question Answering (VQA). Most state-of-the-art VQA methods fuse the high-level textual and visual features from t...
-
Chapter and Conference Paper
Progressive Neural Architecture Search
We propose a new method for learning the structure of convolutional neural networks (CNNs) that is more efficient than recent state-of-the-art methods based on reinforcement learning and evolutionary algorithm...
-
Chapter and Conference Paper
A Modulation Module for Multi-task Learning with Applications in Image Retrieval
Multi-task learning has been widely adopted in many computer vision tasks to improve overall computation efficiency or boost the performance of individual tasks, under the assumption that those tasks are corre...
-
Chapter and Conference Paper
ReenactGAN: Learning to Reenact Faces via Boundary Transfer
We present a novel learning-based framework for face reenactment. The proposed method, known as ReenactGAN, is capable of transferring facial movements and expressions from an arbitrary person’s monocular vide...
-
Chapter and Conference Paper
C-WSL: Count-Guided Weakly Supervised Localization
We introduce count-guided weakly supervised localization (C-WSL), an approach that uses per-class object count as a new form of supervision to improve weakly supervised localization (WSL). C-WSL uses a simple ...
-
Chapter and Conference Paper
Multi-fiber Networks for Video Recognition
In this paper, we aim to reduce the computational cost of spatio-temporal deep neural networks, making them run as fast as their 2D counterparts while preserving state-of-the-art accuracy on video recognition ...
-
Chapter and Conference Paper
Neural Graph Matching Networks for Fewshot 3D Action Recognition
We propose Neural Graph Matching (NGM) Networks, a novel framework that can learn to recognize a previous unseen 3D action class with only a few examples. We achieve this by leveraging the inherent structure o...
-
Chapter and Conference Paper
Semi-dense 3D Reconstruction with a Stereo Event Camera
Event cameras are bio-inspired sensors that offer several advantages, such as low latency, high-speed and high dynamic range, to tackle challenging scenarios in computer vision. This paper presents a solution ...
-
Chapter and Conference Paper
Factorizable Net: An Efficient Subgraph-Based Framework for Scene Graph Generation
Generating scene graph to describe the object interactions inside an image gains increasing interests these years. However, most of the previous methods use complicated structures with slow inference speed or ...
-
Chapter and Conference Paper
Affine-Gradient Based Local Binary Pattern Descriptor for Texture Classification
We present a novel Affine-Gradient based Local Binary Pattern (AGLBP) descriptor for texture classification. It is very hard to describe complicated texture using single type information, such as Local Binary ...
-
Chapter and Conference Paper
A Quality Evaluation Scheme to 3D Printing Objects Using Stereovision Measurement
The paper presents a comprehensive evaluation method on shape consistency by using three-dimensional scanning, reverse engineering and post-processing. The complete evaluation scheme includes data collection,...
-
Chapter and Conference Paper
Enhancing 3D Facial Expression Recognition by Exaggerating Geometry Characteristics
This paper studies exaggerated facial shapes in addition to original facial shapes to assist 3D Facial Expression Recognition (FER). We propose a Poisson equation based approach to exaggerate facial shape char...
-
Chapter and Conference Paper
4D ISIP: 4D Implicit Surface Interest Point Detection
In this paper, we proposed a new method to detect 4D spatiotemporal interest point called 4D-ISIP (4 dimension implicit surface interest point). We implicitly represent the 3D scene by 3D volume which has a tr...
-
Chapter and Conference Paper
Combining Object-Based Attention and Attributes for Image Captioning
Image captioning has been a hot topic in computer vision and natural language processing. Recently, researchers have proposed many models for image captioning which can be classified into two classes: visual a...
-
Chapter and Conference Paper
Age Estimation by Refining Label Distribution in Deep CNN
This paper proposes an age estimation algorithm by refining the label distribution in a deep learning framework. There are two tasks during the training period of our algorithm. The first one finds the optimal...
-
Chapter and Conference Paper
Realtime Human-UAV Interaction Using Deep Learning
In this paper, we propose a realtime human gesture identification for controlling a micro UAV in a GPS denied environment. Exploiting the breakthrough of deep convolution network in computer vision, we develop...
-
Chapter and Conference Paper
Multi-modal Image Registration Based on Modified-SURF and Consensus Inliers Recovery
Multi-modal image registration has been received significant research attention in past decades. In this paper, we proposed a solution for rigid multi-modal image registration, which focus on handling gradient...
-
Chapter and Conference Paper
An Image Segmentation Method Based on Asynchronous Multiscale Similarity Measure
Image segmentation as a basic operation in computer vision is widely used in object detection, feature extraction and so on. In order to improve the effects and speed of image segmentation, an asynchronous pr...
-
Chapter and Conference Paper
Coarse and Fine: A New Method for Gender Classification in the Wild
As one of the most important soft biometrics, gender has substantial applications in various areas such as demography and human-computer interaction. Successful gender estimation of face images taken under rea...
-
Chapter and Conference Paper
Multi-template Matching Algorithm Based on Adaptive Fusion
A target recognition method based on adaptive fusion of multiple matching results was proposed, in order to take advantage of the gray information and feature information in forward-looking infrared (FLIR) tar...