Search
Search Results
-
TextREC: A Dataset for Referring Expression Comprehension with Reading Comprehension
Referring expression comprehension (REC) aims at locating a specific object within a scene given a natural language expression. Although referring... -
Learnable scene prior for point cloud semantic segmentation
In this paper, we propose a Geo-SceneEncoder framework to handle point cloud scene semantic segmentation, including a SceneEncoder to learn a scene...
-
Overview of indoor scene recognition and representation methods based on multimodal knowledge graphs
This paper provides a comprehensive overview of multi-modal knowledge graph technology and a three-layer framework for scene recognition. Integrating...
-
Examination of fire scene reconstructions using virtual reality to enhance forensic decision-making. A case study in Scotland.
When attending a crime scene, first responders are responsible for identifying areas of potential interest for subsequent forensic examination. This...
-
PESTD: a large-scale Persian-English scene text dataset
Extracting text from natural scene images has become a vital issue. The uncertainty of size, color, background, and alignment of the characters make...
-
Moving vehicle tracking and scene understanding: A hybrid approach
In this paper, we present a novel deep learning method for detecting and tracking vehicles within the context of autonomous driving, particularly...
-
Scene text understanding: recapitulating the past decade
Computational perception has indeed been dramatically modified and reformed from handcrafted feature-based techniques to the advent of deep learning....
-
Knowledge enhancement and scene understanding for knowledge-based visual question answering
Knowledge-based visual question answering calls for not only paying attention to the visual content of images but also the support of relevant...
-
Neuro-Symbolic Reasoning for Multimodal Referring Expression Comprehension in HMI Systems
Conventional Human–Machine Interaction (HMI) interfaces have predominantly relied on GUI and voice commands. However, natural human communication...
-
Audio-visual scene recognition using attention-based graph convolutional model
Scene recognition aims to automatically comprehend scenes, and is widely utilized in various fields such as autonomous driving, intelligent security,...
-
A global-local feature adaptive fusion network for image scene classification
Convolutional neural networks (CNN) have been widely used in image scene classification and have achieved remarkable progress. However, because the...
-
Theories and Models in Graph Comprehension
Graph comprehension is the act of deriving meaning from graphs, an activity grounded in visuospatial reasoning that develops through a combination of... -
Analysis and design framework for the development of indoor scene understanding assistive solutions for the person with visual impairment/blindness
This paper discusses the challenges of the current state of computer vision-based indoor scene understanding assistive solutions for the person with...
-
Evaluation of user response by using visual cues designed to direct the viewer’s attention to the main scene in an immersive environment
Today the visualization of 360-degree videos has become a means to live immersive experiences.. However, an important challenge to overcome is how to...
-
Brain-based CALL in flipped higher education GE courses held through LMS: Boosting vocabulary learning and reading comprehension
The thriving technology penetration in all aspects of today’s life and deficiency of traditional pedagogies necessitate wise adoption of modern...
-
Social Perception and Scene Awareness in Human-Robot Interaction
This paper introduces various aspects of social perception skills and scene awareness for interactive robots. The low-level audio-visual perceptual... -
High level visual scene classification using background knowledge of objects
This paper introduces a novel and simple approach of high-level scene classification. Knowing that objects are the essence of any given scene, the...
-
Classification of Indoor–Outdoor Scene Using Deep Learning Techniques
Scene classification is a process in which a computer’s visualizations of a scene are mapped to segments. Then, the machine applies deep learning to... -
LaLaLoc++: Global Floor Plan Comprehension for Layout Localisation in Unvisited Environments
We present LaLaLoc++, a method for floor plan localisation in unvisited environments through latent representations of room layout. We perform... -
A Preliminary Study on the Possibility of Scene Captioning Model Integration as an Improvement in Assisted Navigation for Visually Impaired Users
This research introduces a new approach to augment image captioning for visually impaired individuals by integrating depth data with RGB images. An...