-
Chapter and Conference Paper
PE-MED: Prompt Enhancement for Interactive Medical Image Segmentation
Interactive medical image segmentation refers to the accurate segmentation of the target of interest through interaction (e.g., click) between the user and the image. It has been widely studied in recent years...
-
Chapter and Conference Paper
MetaVSR: A Novel Approach to Video Super-Resolution for Arbitrary Magnification
Video super-resolution is a pivotal task that involves the recovery of high-resolution video frames from their low-resolution counterparts, possessing a multitude of applications in real-world scenarios. Withi...
-
Chapter and Conference Paper
A Coarse and Fine Grained Masking Approach for Video-Grounded Dialogue
The task of Video-Grounded Dialogue involves develo** a multimodal chatbot capable of answering sequential questions from humans regarding video content, audio, captions and dialog history. Although existing...
-
Chapter and Conference Paper
High Capacity Reversible Data Hiding in Encrypted Images Based on Pixel Value Preprocessing and Block Classification
Reversible data hiding in encrypted images (RDHEI) can simultaneously achieve secure transmission of images and secret storage of embedded additional data, which can be used for cloud storage and privacy prote...
-
Chapter and Conference Paper
DHP: A Joint Video Download and Dynamic Bitrate Adaptation Algorithm for Short Video Streaming
With the development of multimedia technology and the upgrading of mobile terminal equipment, short video platforms and applications are becoming more and more popular. Compared with traditional long video, sh...
-
Chapter and Conference Paper
Self-supervised Multi-object Tracking with Cycle-Consistency
Multi-object tracking is a challenging video task that requires both locating the objects in the frames and associating the objects among the frames, which usually utilizes the tracking-by-detection paradigm. ...
-
Chapter and Conference Paper
CV4Code: Sourcecode Understanding via Visual Code Representations
We present CV4Code \(^1\) 1 ...
-
Chapter and Conference Paper
Spotlights: Probing Shapes from Spherical Viewpoints
Recent years have witnessed the surge of learned representations that directly build upon point clouds. Inspired by spherical multi-view scanners, we propose a novel sampling model called Spotlights to represent ...
-
Chapter and Conference Paper
Self-attention Convolution for Sparse to Dense Depth Completion
Depth completion from a sparse set of depth measurements and a single RGB image has been shown to be an effective method for generating high-quality depth images. However, traditional convolutional neural netw...
-
Chapter and Conference Paper
A Structured Feature Learning Model for Clothing Keypoints Localization
Visual fashion analysis has attracted many attentions in the recent years. Especially, as a fundamental technology, clothing keypoints localization has great application potential. However, most of researchers...
-
Chapter and Conference Paper
Modeling and Simulation of Soft Tissue Deformation
A stable and accurate deformable model to simulate the deformation of soft tissues is a challenging area of research. This paper describes a soft tissue simulation method that can deform multiple organs synchr...
-
Chapter and Conference Paper
An Objectionable Image Detection Method Based on Movement Invariants and Clustering
The phenomenon that objectionable contents spread over the Mobile Internet reflects badly both on users and business. To cope with the situation here, we have proposed a relatively effective and efficient meth...
-
Chapter and Conference Paper
3D Shape Analysis for Liver-Gallbladder Anatomical Structure Retrieval
Anatomical structure is important for medical education and disease diagnosis. In the application of surgical simulation, different anatomical structures can be retrieved to create variety of surgical scenario...
-
Chapter and Conference Paper
Sparse Hidden Markov Models for Surgical Gesture Classification and Skill Evaluation
We consider the problem of classifying surgical gestures and skill level in robotic surgical tasks. Prior work in this area models gestures as states of a hidden Markov model (HMM) whose observations are discr...
-
Chapter and Conference Paper
In Defence of Negative Mining for Annotating Weakly Labelled Data
We propose a novel approach to annotating weakly labelled data. In contrast to many existing approaches that perform annotation by seeking clusters of self-similar exemplars (minimising intra-class variance), ...
-
Chapter and Conference Paper
Age Invariant Face Verification with Relative Craniofacial Growth Model
Age-separated facial images usually have significant changes in both shape and texture. Although many face recognition algorithms have been proposed in the last two decades, the problem of recognizing facial i...
-
Chapter and Conference Paper
Dual-Force Metric Learning for Robust Distracter-Resistant Tracker
In this paper, we propose a robust distracter-resistant tracking approach by learning a discriminative metric that adaptively learns the importance of features on-the-fly. The proposed metric is elaborately de...
-
Chapter and Conference Paper
A Unifying Theory of Active Discovery and Learning
For learning problems where human supervision is expensive, active query selection methods are often exploited to maximise the return of each supervision. Two problems where this has been successfully applied ...
-
Chapter and Conference Paper
Gait Recognition by Ranking
The advantage of gait over other biometrics such as face or fingerprint is that it can operate from a distance and without subject cooperation. However, this also makes gait subject to changes in various covar...
-
Chapter and Conference Paper
Automated Identification of Thoracolumbar Vertebrae Using Orthogonal Matching Pursuit
A reliable detection and definitive labeling of vertebrae can be difficult due to factors such as the limited imaging coverage and various vertebral anomalies. In this paper, we investigate the problem of iden...