Search Page | SpringerLink

TextREC: A Dataset for Referring Expression Comprehension with Reading Comprehension

Referring expression comprehension (REC) aims at locating a specific object within a scene given a natural language expression. Although referring...

Chenyang Gao, Biao Yang, ... **ang Bai in Document Analysis and Recognition - ICDAR 2023

Conference paper 2023

Learnable scene prior for point cloud semantic segmentation

In this paper, we propose a Geo-SceneEncoder framework to handle point cloud scene semantic segmentation, including a SceneEncoder to learn a scene...

Yuanhao Chai, **gyu Gong, ... Lizhuang Ma in The Visual Computer

Article 08 April 2024

Overview of indoor scene recognition and representation methods based on multimodal knowledge graphs

This paper provides a comprehensive overview of multi-modal knowledge graph technology and a three-layer framework for scene recognition. Integrating...

Jianxin Li, Guannan Si, ... Fengyu Zhou in Applied Intelligence

Article 23 December 2023

Examination of fire scene reconstructions using virtual reality to enhance forensic decision-making. A case study in Scotland.

When attending a crime scene, first responders are responsible for identifying areas of potential interest for subsequent forensic examination. This...

Vincenzo Rinaldi, Karen Ann Robertson, ... Niamh Nic Daeid in Virtual Reality

Article Open access 01 March 2024

PESTD: a large-scale Persian-English scene text dataset

Extracting text from natural scene images has become a vital issue. The uncertainty of size, color, background, and alignment of the characters make...

Atefeh Ranjkesh Rashtehroudi, Alireza Akoushideh, Asadollah Shahbahrami in Multimedia Tools and Applications

Article 25 March 2023

Moving vehicle tracking and scene understanding: A hybrid approach

In this paper, we present a novel deep learning method for detecting and tracking vehicles within the context of autonomous driving, particularly...

**aoxu Liu, Wei Qi Yan, Nikola Kasabov in Multimedia Tools and Applications

Article 13 November 2023

Scene text understanding: recapitulating the past decade

Computational perception has indeed been dramatically modified and reformed from handcrafted feature-based techniques to the advent of deep learning....

Mridul Ghosh, Himadri Mukherjee, ... Kaushik Roy in Artificial Intelligence Review

Article 18 June 2023

Knowledge enhancement and scene understanding for knowledge-based visual question answering

Knowledge-based visual question answering calls for not only paying attention to the visual content of images but also the support of relevant...

Zhenqiang Su, Gang Gou in Knowledge and Information Systems

Article 14 December 2023

Neuro-Symbolic Reasoning for Multimodal Referring Expression Comprehension in HMI Systems

Conventional Human–Machine Interaction (HMI) interfaces have predominantly relied on GUI and voice commands. However, natural human communication...

Aman Jain, Anirudh Reddy Kondapally, ... Hitomi Yanaka in New Generation Computing

Article Open access 15 February 2024

Audio-visual scene recognition using attention-based graph convolutional model

Scene recognition aims to automatically comprehend scenes, and is widely utilized in various fields such as autonomous driving, intelligent security,...

Ziqi Wang, Yikai Wu, ... and Jordi Gonzàlez in Multimedia Tools and Applications

Article 18 June 2024

A global-local feature adaptive fusion network for image scene classification

Convolutional neural networks (CNN) have been widely used in image scene classification and have achieved remarkable progress. However, because the...

Guangrui Lv, Lili Dong, ... Wenhai Xu in Multimedia Tools and Applications

Article 10 June 2023

Theories and Models in Graph Comprehension

Graph comprehension is the act of deriving meaning from graphs, an activity grounded in visuospatial reasoning that develops through a combination of...

Amy Rae Fox in Visualization Psychology

Chapter 2023

Analysis and design framework for the development of indoor scene understanding assistive solutions for the person with visual impairment/blindness

This paper discusses the challenges of the current state of computer vision-based indoor scene understanding assistive solutions for the person with...

Moeen Valipoor, Angélica de Antonio, Julián Cabrera in Multimedia Systems

Article Open access 18 May 2024

Evaluation of user response by using visual cues designed to direct the viewer’s attention to the main scene in an immersive environment

Today the visualization of 360-degree videos has become a means to live immersive experiences.. However, an important challenge to overcome is how to...

Galo Ortega-Alvarez, Carlos Matheus-Chacin, ... Adrian Ruiz-Arroyo in Multimedia Tools and Applications

Article Open access 08 June 2022

Brain-based CALL in flipped higher education GE courses held through LMS: Boosting vocabulary learning and reading comprehension

The thriving technology penetration in all aspects of today’s life and deficiency of traditional pedagogies necessitate wise adoption of modern...

Nasrin Abdolmaleki, Zari Saeedi in International Journal of Educational Technology in Higher Education

Article Open access 07 February 2024

Social Perception and Scene Awareness in Human-Robot Interaction

This paper introduces various aspects of social perception skills and scene awareness for interactive robots. The low-level audio-visual perceptual...

Sarwar Paplu, Prabesh Khadka, ... Karsten Berns in Social Robotics

Conference paper 2024

High level visual scene classification using background knowledge of objects

This paper introduces a novel and simple approach of high-level scene classification. Knowing that objects are the essence of any given scene, the...

Lamine Benrais, Nadia Baha in Multimedia Tools and Applications

Article 18 November 2021

Classification of Indoor–Outdoor Scene Using Deep Learning Techniques

Scene classification is a process in which a computer’s visualizations of a scene are mapped to segments. Then, the machine applies deep learning to...

Bagesh Kumar, Harshit Gupta, ... O. P. Vyas in Machine Learning, Image Processing, Network Security and Data Sciences

Conference paper 2023

LaLaLoc++: Global Floor Plan Comprehension for Layout Localisation in Unvisited Environments

We present LaLaLoc++, a method for floor plan localisation in unvisited environments through latent representations of room layout. We perform...

Henry Howard-Jenkins, Victor Adrian Prisacariu in Computer Vision – ECCV 2022

Conference paper 2022

A Preliminary Study on the Possibility of Scene Captioning Model Integration as an Improvement in Assisted Navigation for Visually Impaired Users

This research introduces a new approach to augment image captioning for visually impaired individuals by integrating depth data with RGB images. An...

Atiqul Islam, Mark Kit Tsun Tee, ... Kazumasa Chong Foh-Zin in Methods and Applications for Modeling and Simulation of Complex Systems

Conference paper 2024

Search

Filters

Search Results

Search

Navigation