Search Page | SpringerLink

ViDSOD-100: A New Dataset and a Baseline Model for RGB-D Video Salient Object Detection

With the rapid development of depth sensor, more and more RGB-D videos could be obtained. Identifying the foreground in RGB-D videos is a fundamental...

Junhao Lin, Lei Zhu, ... Liansheng Wang in International Journal of Computer Vision

Article 04 June 2024

RGB oralscan video-based orthodontic treatment monitoring

Orthodontic treatment monitoring involves using current images and previous 3D models to estimate the relative position of individual teeth before...

Yan Tian, Hanshi Fu, ... Ruili Wang in Science China Information Sciences

Article 27 December 2023

Effective and efficient approach for gesture detection in video through monocular RGB frames

Detecting gestures is a difficult operation, especially when the context is dynamic or noisy. Several approaches use a bounding box for the same,...

Rameez Shamalik, Sanjay Koli in Multimedia Tools and Applications

Article 14 November 2022

Video person re-identification based on RGB triple pyramid model

In order to solve the difficult problem of pedestrian motion extraction in video, in this paper, we propose a novel video action information...

Dan Wei, Ziyang Wang, Yi** Luo in The Visual Computer

Article 19 January 2022

Non-contact heart rate measurement using low-cost RGB camera under complex light conditions

A non-contact heart rate measurement method using low-cost RGB video is proposed in this study. Only an RGB video of a human wrist is required as...

Haipeng Wang, Shuai Zhang in Multimedia Tools and Applications

Article 15 April 2024

UMINet: a unified multi-modality interaction network for RGB-D and RGB-T salient object detection

Multi-modality images with complementary cues can significantly improve the performance of salient object detection (SOD) methods in challenging...

Lina Gao, ** Fu, ... Bing Liu in The Visual Computer

Article 25 April 2023

VT-BPAN: vision transformer-based bilinear pooling and attention network fusion of RGB and skeleton features for human action recognition

Recent generation Microsoft Kinect Camera captures a series of multimodal signals that provide RGB video, depth sequences, and skeleton information,...

Yaohui Sun, Weiyao Xu, ... Ju Gao in Multimedia Tools and Applications

Article 11 December 2023

InterCap: Joint Markerless 3D Tracking of Humans and Objects in Interaction from Multi-view RGB-D Images

Humans constantly interact with objects to accomplish tasks. To understand such interactions, computers need to reconstruct these in 3D from images...

Yinghao Huang, Omid Taheri, ... Dimitrios Tzionas in International Journal of Computer Vision

Article Open access 06 February 2024

An RGB-D sensor-based instrument for sitting balance assessment

Sitting balance is an important aspect of overall motor control, particularly for individuals who are not able to stand. Typical clinical assessment...

Kristin A. Bartlett, Jorge D. Camba in Multimedia Tools and Applications

Article 09 February 2023

Adaptive Multi-Source Predictor for Zero-Shot Video Object Segmentation

Static and moving objects often occur in real-life videos. Most video object segmentation methods only focus on extracting and exploiting motion cues...

**aoqi Zhao, Shijie Chang, ... Huchuan Lu in International Journal of Computer Vision

Article 07 March 2024

RGB Guided ToF Imaging System: A Survey of Deep Learning-Based Methods

Integrating an RGB camera into a ToF imaging system has become a significant technique for perceiving the real world. The RGB guided ToF imaging...

**n Qiao, Matteo Poggi, ... Stefano Mattoccia in International Journal of Computer Vision

Article 29 May 2024

Specificity-preserving RGB-D saliency detection

Salient object detection (SOD) in RGB and depth images has attracted increasing research interest. Existing RGB-D SOD models usually adopt fusion...

Tao Zhou, Deng-** Fan, ... Huazhu Fu in Computational Visual Media

Article Open access 03 January 2023

Deep learning-based RGB-thermal image denoising: review and applications

Recently, vision-based detection (VD) technology has been well-developed, and its general-purpose object detection algorithms have been applied in...

Yuan Yu, Boon Giin Lee, ... Wan-Young Chung in Multimedia Tools and Applications

Article 29 June 2023

CMDCF: an effective cross-modal dense cooperative fusion network for RGB-D SOD

The success of vision transformer demonstrates that the transformer structure is also suitable for various vision tasks, including high-level...

**ngZhao Jia, Wen**u Zhao, ... YanJun Peng in Neural Computing and Applications

Article 07 May 2024

Interactive context-aware network for RGB-T salient object detection

Salient object detection (SOD) focuses on distinguishing the most conspicuous objects in the scene. However, most related works are based on RGB...

Yuxuan Wang, Feng Dong, ... Jianren Chen in Multimedia Tools and Applications

Article 08 February 2024

Unsupervised RGB-T object tracking with attentional multi-modal feature fusion

RGB-T tracking means that given the object position in the first frame, the tracker is trained to predict the position of the object in consecutive...

Shenglan Li, Rui Yao, ... Zhiwen Shao in Multimedia Tools and Applications

Article 02 February 2023

Pyramid contract-based network for RGB-T salient object detection

RGB-Thermal (RGB-T) salient object detection (SOD) aims at utilizing RGB and thermal infrared data to segment the most visually attractive object(s)...

Ranwan Wu, Hongbo Bi, ... Zhigang Liu in Multimedia Tools and Applications

Article 04 August 2023

3D gesture segmentation for word-level Arabic sign language using large-scale RGB video sequences and autoencoder convolutional networks

Sign languages use hands, body movements, and facial expressions to deliver a message. Develo** a communication environment for the deaf community...

Abdelbasset Boukdir, Mohamed Benaddy, ... Mustapha Kardouchi in Signal, Image and Video Processing

Article 23 February 2022

A RGB-D feature fusion network for occluded object 6D pose estimation

6D pose estimation using RGB-D data has been widely utilized in various scenarios, with keypoint-based methods receiving significant attention due to...

Yiwei Song, Chunhui Tang in Signal, Image and Video Processing

Article 13 June 2024

A Novel Edge-Inspired Depth Quality Evaluation Network for RGB-D Salient Object Detection

Recently, the pair of RGB images and depth images, which is denoted as RGB-D images, are introduced to improve the performances of salient object...

Kun Xu, Jichang Guo in Journal of Grid Computing

Article 04 July 2023

Search

Filters

Search Results

Search

Navigation