Search Page | SpringerLink

Research on the algorithm of helmet-wearing detection based on the optimized yolov4

At construction sites, wearing hard hats is an important and effective measure to protect workers from accidental injury. In order to remind workers...

Lingpeng Zeng, Xuliang Duan, ... Minjiang Deng in The Visual Computer

Article 19 May 2022

A Real World Dataset for Multi-view 3D Reconstruction

We present a dataset of 998 3D models of everyday tabletop objects along with their 847,000 real world RGB and depth images. Accurate annotation of...

Rakesh Shrestha, Siqi Hu, ... ** Tan in Computer Vision – ECCV 2022

Conference paper 2022

Light-Weight Multi-view Topology Consistent Facial Geometry and Reflectance Capture

We present a light-weight multi-view capture system with different lighting conditions to generate a topology consistent facial geometry and...

Penglei Ji, Hanchao Li, ... **nguo Liu in Advances in Computer Graphics

Conference paper 2021

Approximate Differentiable Rendering with Algebraic Surfaces

Differentiable renderers provide a direct mathematical link between an object’s 3D representation and images of that object. In this work, we develop...

Leonid Keselman, Martial Hebert in Computer Vision – ECCV 2022

Conference paper 2022

SAFaD: A System for Automatic Fall Detection on Surveillance Imagery

In this work we introduce SAFaD: A System for Automatic Fall Detection on Surveillance Imagery. Our system heavily relies in an intermediate...

Borja Perez-Lopez, Francisco Gomez-Donoso, Miguel Cazorla in ROBOT2022: Fifth Iberian Robotics Conference

Conference paper 2023

LocaliseBot: Multi-view 3D Object Localisation with Differentiable Rendering for Robot Gras**

Robot grasp typically follows five stages: object detection, object localisation, object pose estimation, grasp pose estimation, and grasp planning....

Sujal Vijayaraghavan, Redwan Alqasemi, ... Sudeep Sarkar in Computer Vision – ECCV 2022 Workshops

Conference paper 2023

A Cross-Modal Face Reconstruction Method for Service on Blockchains

The current blockchain systems are suffering the low scalability. In order to improve the scalability and enable the storage of more critical facial...

Zhijie Tan, ** Li in Service Science

Conference paper 2023

A review of 3D object detection based on autonomous driving

3D object detection is a popular research direction in recent years, which plays an important role in the fields of automatic driving, intelligent...

Huijuan Wang, **nyue Chen, ... Peng Liu in The Visual Computer

Article 14 June 2024

Identification of Bird’s Nest Hazard Level of Transmission Line Based on Improved Yolov5 and Location Constraints

Bird’s nest is a common defect in transmission line, which seriously affects the safe and stable operation of the line. This paper presents a method...

Yang Wu, Qunsheng Zeng, ... Jiajie Chen in Pattern Recognition and Computer Vision

Conference paper 2022

Augmentation dataset of a two-dimensional neural network model for use in the car parts segmentation and car classification of three dimensions

In this study, three-dimensional (3D) spatial data, two-dimensional (2D) texture information, and automatic marking processes were used for the...

Chuen-Horng Lin, Chia-Ching Yu, Huan-Yu Chen in The Journal of Supercomputing

Article 14 June 2022

BENet: bi-directional enhanced network for image captioning

Transformer-based models have been used in image captioning to generate a natural language text for describing a given image accurately. In this...

Peixin Yan, Zuoyong Li, ... **nrong Cao in Multimedia Systems

Article 29 January 2024

Objects Can Move: 3D Change Detection by Geometric Transformation Consistency

AR/VR applications and robots need to know when the scene has changed. An example is when objects are moved, added, or removed from the scene. We...

Aikaterini Adam, Torsten Sattler, ... Tomas Pajdla in Computer Vision – ECCV 2022

Conference paper 2022

Re-Thinking Text Clustering for Images with Text

Text-VQA refers to the set of problems that reason about the text present in an image to answer specific questions regarding the image content....

Shwet Kamal Mishra, Soham Joshi, Viswanath Gopalakrishnan in Document Analysis and Recognition - ICDAR 2023

Conference paper 2023

Fabric defect detection algorithm based on residual energy distribution and Gabor feature fusion

Gabor filter is a time-frequency combined analysis method, which is suitable for detecting local anomalies in periodic textures. Gabor-based methods...

Wenning Qin, Haoran Wen, Feng Li in The Visual Computer

Article 28 October 2022

An image storage duplication detection method using recurrent learning for smart application services

Smart and intelligent application services rely on textual and visualization information for meeting user demands. Regardless of the textual data,...

S. Usharani, K. Dhanalakshmi in The Journal of Supercomputing

Article 24 February 2023

Visual Mesh: Real-Time Object Detection Using Constant Sample Density

This paper proposes an enhancement of convolutional neural networks for object detection in resource-constrained robotics through a geometric input...

Trent Houliston, Stephan K. Chalup in RoboCup 2018: Robot World Cup XXII

Conference paper 2019

Introduction to Deep Learning

Deep learning (DL) has made a major impact on data science in the last decade. This chapter introduces the basic concepts of this field. It includes...

Lihi Shiloh-Perl, Raja Giryes in Machine Learning for Data Science Handbook

Chapter 2023

Geometrically-Aware Dual Transformer Encoding Visual and Textual Features for Image Captioning

When describing pictures from the point of view of human observers, the tendency is to prioritize eye-catching objects, link them to corresponding...

Yu-Ling Chang, Hao-Shang Ma, ... Jen-Wei Huang in Advances in Knowledge Discovery and Data Mining

Conference paper 2024

Detection of inclusion by using 3D laser scanner in composite prepreg manufacturing technique using convolutional neural networks

Among different manufacturing techniques available for composite aircraft structures, prepreg-based manual layup is widely used. During the...

M. J. Augustin, Vandana Ramesh, ... M. Ramesh Kumar in Machine Vision and Applications

Article 21 September 2021

3D-MuPPET: 3D Multi-Pigeon Pose Estimation and Tracking

Markerless methods for animal posture tracking have been rapidly develo** recently, but frameworks and benchmarks for tracking large animal groups...

Urs Waldmann, Alex Hoi Hang Chan, ... Fumihiro Kano in International Journal of Computer Vision

Article Open access 07 May 2024

Search

Filters

Search Results

Search

Navigation