Search Page | SpringerLink

A Neural Network Architecture for Accurate 4D Vehicle Pose Estimation from Monocular Images with Uncertainty Assessment

This paper proposes a new neural network architecture for estimating the four degrees of freedom poses of vehicles from monocular images in an...

Tomasz Nowak, Piotr Skrzypczyński in Neural Information Processing

Conference paper 2024

Deep-learning based system for effective and automatic blood vessel segmentation from Retinal fundus images

The segmentation of blood vessels through color fundus images is a difficult and time-consuming task that requires experienced clinicians. Recently,...

Law Kumar Singh, Munish Khanna, ... Rekha Singh in Multimedia Tools and Applications

Article 01 June 2023

Neural attention for image captioning: review of outstanding methods

Image captioning is the task of automatically generating sentences that describe an input image in the best way possible. The most successful...

Zanyar Zohourianshahzadi, Jugal K. Kalita in Artificial Intelligence Review

Article 29 November 2021

Using the New YoLo Models in Detecting Small-Sized Objects in the Case of Rice Grains on Branche

Identifying a small-sized object is of interest to many studies, especially the rice grain on the branch. Due to its significance to the evaluation...

Khang Nguyen Quoc, Anh Nguyen Quynh, ... Luyl-Da Quach in Data Science and Artificial Intelligence

Conference paper 2023

The Geometry Enhanced Deep Implicit Function Based 3D Reconstruction for Objects in a Real-Scene Image

For the 3D reconstruction of objects in a real scene, the state-of-the-art scheme is to detect and identify the target by a classic deep neural...

Haiwei Mei, Chenxing Wang in PRICAI 2022: Trends in Artificial Intelligence

Conference paper 2022

A comprehensive survey on human pose estimation approaches

The human pose estimation is a significant issue that has been taken into consideration in the computer vision network for recent decades. It is a...

Shradha Dubey, Manish Dixit in Multimedia Systems

Article 16 August 2022

Computational Image Analysis Techniques, Programming Languages and Software Platforms Used in Cancer Research: A Sco** Review

Background: Cancer-related research, as indicated by the number of entries in Medline, the National Library of Medicine of the USA, has dominated the...

Youssef Arafat, Constantino Carlos Reyes-Aldasoro in Medical Image Understanding and Analysis

Conference paper 2022

Video super-resolution based on deep learning: a comprehensive survey

Video super-resolution (VSR) is reconstructing high-resolution videos from low resolution ones. Recently, the VSR methods based on deep neural...

Hongying Liu, Zhubo Ruan, ... Radu Timofte in Artificial Intelligence Review

Article 01 April 2022

Self-Enhanced Attention for Image Captioning

Image captioning, which involves automatically generating textual descriptions based on the content of images, has garnered increasing attention from...

Qingyu Sun, Juan Zhang, ... Yongbin Gao in Neural Processing Letters

Article Open access 01 April 2024

Syntax Tree Constrained Graph Network for Visual Question Answering

Visual Question Answering (VQA) aims to automatically answer natural language questions related to given image content. Existing VQA methods...

**angrui Su, Qi Zhang, ... Liang Hu in Neural Information Processing

Conference paper 2024

Engineering the Future: A Deep Dive into Remote Inspection and Reality Capture for Railway Infrastructure Digitalization

The growing importance of the railway sector demands efficient inspection and maintenance. Traditional methods are labor-intensive and costly,...

Rafael Cabral, Diogo Ribeiro, Anna Rakoczy in Digital Railway Infrastructure

Chapter 2024

MEDAS: an open-source platform as a service to help break the walls between medicine and informatics

In the past decade, deep learning (DL) has achieved unprecedented success in numerous fields, such as computer vision and healthcare. Particularly,...

Liang Zhang, Johann Li, ... Björn W. Schuller in Neural Computing and Applications

Article 16 January 2022

Rapid Seismic Risk Assessment of Bridges Using UAV Aerial Photogrammetric Survey

In this paper a framework for the rapid seismic risk assessment of bridges using aerial surveys using Unmanned Aerial Systems is presented. The...

Vincenzo Barrile, Gabriele Candela, ... Giuliana Bilotta in Geomatics for Green and Digital Transition

Conference paper 2022

GSNet: Joint Vehicle Pose and Shape Reconstruction with Geometrical and Scene-Aware Supervision

We present a novel end-to-end framework named as GSNet (Geometric and Scene-aware Network), which jointly estimates 6DoF poses and reconstructs...

Lei Ke, Shichao Li, ... Chi-Keung Tang in Computer Vision – ECCV 2020

Conference paper 2020

Local self-attention in transformer for visual question answering

Visual Question Answering (VQA) is a multimodal task that requires models to understand both textual and visual information. Various VQA models have...

**ang Shen, Dezhi Han, ... Gaofeng Luo in Applied Intelligence

Article 15 December 2022

Transformer based Multitask Learning for Image Captioning and Object Detection

In several real-world scenarios like autonomous navigation and mobility, to obtain a better visual understanding of the surroundings, image...

Debolena Basak, P. K. Srijith, Maunendra Sankar Desarkar in Advances in Knowledge Discovery and Data Mining

Conference paper 2024

Automatic oral cancer detection and classification using modified local texture descriptor and machine learning algorithms

Oral cancer is the most extensive universal problem, with 10.3 million deaths in 2020 by the World Health Organization (WHO). It is the most common...

Vijaya Yaduvanshi, R. Murugan, Tripti Goel in Multimedia Tools and Applications

Article 10 April 2024

A Shield Machine Segment Position Recognition Algorithm Based on Improved Voxel and Seed Filling

In response to the problems of low execution efficiency and poor real-time performance of traditional point cloud clustering algorithms when there is...

Pei Zhang, Lijie Jiang, ... Honglei Zhang in Intelligent Robotics and Applications

Conference paper 2023

3D Object Detection for Autonomous Driving: A Comprehensive Survey

Autonomous driving, in recent years, has been receiving increasing attention for its potential to relieve drivers’ burdens and improve the safety of...

Jiageng Mao, Shaoshuai Shi, ... Hongsheng Li in International Journal of Computer Vision

Article 27 April 2023

InterCap: Joint Markerless 3D Tracking of Humans and Objects in Interaction

Humans constantly interact with daily objects to accomplish tasks. To understand such interactions, computers need to reconstruct these from cameras...

Yinghao Huang, Omid Taheri, ... Dimitrios Tzionas in Pattern Recognition

Conference paper 2022

Search

Filters

Search Results

Search

Navigation