Search
Search Results
-
ViDSOD-100: A New Dataset and a Baseline Model for RGB-D Video Salient Object Detection
With the rapid development of depth sensor, more and more RGB-D videos could be obtained. Identifying the foreground in RGB-D videos is a fundamental...
-
RGB oralscan video-based orthodontic treatment monitoring
Orthodontic treatment monitoring involves using current images and previous 3D models to estimate the relative position of individual teeth before...
-
Effective and efficient approach for gesture detection in video through monocular RGB frames
Detecting gestures is a difficult operation, especially when the context is dynamic or noisy. Several approaches use a bounding box for the same,...
-
Video person re-identification based on RGB triple pyramid model
In order to solve the difficult problem of pedestrian motion extraction in video, in this paper, we propose a novel video action information...
-
Non-contact heart rate measurement using low-cost RGB camera under complex light conditions
A non-contact heart rate measurement method using low-cost RGB video is proposed in this study. Only an RGB video of a human wrist is required as...
-
UMINet: a unified multi-modality interaction network for RGB-D and RGB-T salient object detection
Multi-modality images with complementary cues can significantly improve the performance of salient object detection (SOD) methods in challenging...
-
VT-BPAN: vision transformer-based bilinear pooling and attention network fusion of RGB and skeleton features for human action recognition
Recent generation Microsoft Kinect Camera captures a series of multimodal signals that provide RGB video, depth sequences, and skeleton information,...
-
InterCap: Joint Markerless 3D Tracking of Humans and Objects in Interaction from Multi-view RGB-D Images
Humans constantly interact with objects to accomplish tasks. To understand such interactions, computers need to reconstruct these in 3D from images...
-
An RGB-D sensor-based instrument for sitting balance assessment
Sitting balance is an important aspect of overall motor control, particularly for individuals who are not able to stand. Typical clinical assessment...
-
Adaptive Multi-Source Predictor for Zero-Shot Video Object Segmentation
Static and moving objects often occur in real-life videos. Most video object segmentation methods only focus on extracting and exploiting motion cues...
-
RGB Guided ToF Imaging System: A Survey of Deep Learning-Based Methods
Integrating an RGB camera into a ToF imaging system has become a significant technique for perceiving the real world. The RGB guided ToF imaging...
-
Specificity-preserving RGB-D saliency detection
Salient object detection (SOD) in RGB and depth images has attracted increasing research interest. Existing RGB-D SOD models usually adopt fusion...
-
Deep learning-based RGB-thermal image denoising: review and applications
Recently, vision-based detection (VD) technology has been well-developed, and its general-purpose object detection algorithms have been applied in...
-
CMDCF: an effective cross-modal dense cooperative fusion network for RGB-D SOD
The success of vision transformer demonstrates that the transformer structure is also suitable for various vision tasks, including high-level...
-
Interactive context-aware network for RGB-T salient object detection
Salient object detection (SOD) focuses on distinguishing the most conspicuous objects in the scene. However, most related works are based on RGB...
-
Unsupervised RGB-T object tracking with attentional multi-modal feature fusion
RGB-T tracking means that given the object position in the first frame, the tracker is trained to predict the position of the object in consecutive...
-
Pyramid contract-based network for RGB-T salient object detection
RGB-Thermal (RGB-T) salient object detection (SOD) aims at utilizing RGB and thermal infrared data to segment the most visually attractive object(s)...
-
3D gesture segmentation for word-level Arabic sign language using large-scale RGB video sequences and autoencoder convolutional networks
Sign languages use hands, body movements, and facial expressions to deliver a message. Develo** a communication environment for the deaf community...
-
A RGB-D feature fusion network for occluded object 6D pose estimation
6D pose estimation using RGB-D data has been widely utilized in various scenarios, with keypoint-based methods receiving significant attention due to...
-
A Novel Edge-Inspired Depth Quality Evaluation Network for RGB-D Salient Object Detection
Recently, the pair of RGB images and depth images, which is denoted as RGB-D images, are introduced to improve the performances of salient object...