Search Page | SpringerLink

Hyperbolic Deep Learning in Computer Vision: A Survey

Deep representation learning is a ubiquitous part of modern computer vision. While Euclidean space has been the de facto standard manifold for...

Pascal Mettes, Mina Ghadimi Atigh, ... Serena Yeung in International Journal of Computer Vision

Article Open access 26 March 2024

Simulating vision impairment in virtual reality: a comparison of visual task performance with real and simulated tunnel vision

In this work, we explore the potential and limitations of simulating gaze-contingent tunnel vision conditions using Virtual Reality (VR) with...

Alexander Neugebauer, Nora Castner, ... Siegfried Wahl in Virtual Reality

Article Open access 16 April 2024

Computer Vision Overview

Computer vision is an information subject/discipline that uses computers to realize the functions of human vision system (HVS). This book mainly...

Yu-** Zhang in 3-D Computer Vision

Chapter 2023

Prompt learning in computer vision: a survey

Prompt learning has attracted broad attention in computer vision since the large pre-trained vision-language models (VLMs) exploded. Based on the...

Yiming Lei, **gqi Li, ... Hongming Shan in Frontiers of Information Technology & Electronic Engineering

Article 01 January 2024

Review of vision-based reinforcement learning for drone navigation

In recent years, Unmanned aerial vehicles (UAVs) have witnessed a surge in popularity and implementation for both civilian and military usage. UAVs...

Anas Aburaya, Hazlina Selamat, Mohd Taufiq Muslim in International Journal of Intelligent Robotics and Applications

Article 28 June 2024

Data Augmentation for Low-Level Vision: CutBlur and Mixture-of-Augmentation

Data augmentation (DA) is an effective way to improve the performance of deep networks. Unfortunately, current methods are mostly developed for...

Namhyuk Ahn, Jaejun Yoo, Kyung-Ah Sohn in International Journal of Computer Vision

Article 05 January 2024

BioDrone: A Bionic Drone-Based Single Object Tracking Benchmark for Robust Vision

Single object tracking (SOT) is a fundamental problem in computer vision, with a wide range of applications, including autonomous driving, augmented...

**n Zhao, Shiyu Hu, ... Jiadong Li in International Journal of Computer Vision

Article 02 December 2023

Vision Transformers with Hierarchical Attention

This paper tackles the high computational/space complexity associated with multi-head self-attention (MHSA) in vanilla vision transformers. To this...

Yun Liu, Yu-Huan Wu, ... Luc Van Gool in Machine Intelligence Research

Article Open access 19 April 2024

A Vision Enhancement and Feature Fusion Multiscale Detection Network

In the field of object detection, there is often a high level of occlusion in real scenes, which can very easily interfere with the accuracy of the...

Chengwu Qian, Jiangbo Qian, ... Caiming Zhong in Neural Processing Letters

Article Open access 07 February 2024

Deep learning and computer vision approach - a vision transformer based classification of fruits and vegetable diseases (DLCVA-FVDC)

As technology progresses, automation gains importance. The automation might be in a large-scale industry with more employees and heavy capital...

Deepak N. A. in Multimedia Tools and Applications

Article 06 March 2024

Universal Object Detection with Large Vision Model

Over the past few years, there has been growing interest in develo** a broad, universal, and general-purpose computer vision system. Such systems...

Feng Lin, Wenze Hu, ... **aoyu Wang in International Journal of Computer Vision

Article 07 November 2023

Fast and intelligent measurement of concrete aggregate volume based on monocular vision map**

In order to prevent the abnormal appearance of gravel aggregate material level in the mixing plant, improve the safety of the concrete mixing plant...

Yingjie Liu, Shuang Yue, ... Linjian Shangguan in Journal of Real-Time Image Processing

Article 30 August 2023

Computer Vision

In recent years, one of the most transformative subfields of machine learning has been computer vision. With substantial breakthroughs in the early...

Blaž Škrlj in From Unimodal to Multimodal Machine Learning

Chapter 2024

Vision transformer models for mobile/edge devices: a survey

With the rapidly growing demand for high-performance deep learning vision models on mobile and edge devices, this paper emphasizes the importance of...

Seung Il Lee, Kwanghyun Koo, ... Hyun Kim in Multimedia Systems

Article 01 April 2024

EDFIDepth: enriched multi-path vision transformer feature interaction networks for monocular depth estimation

Monocular depth estimation (MDE) aims to predict pixel-level dense depth maps from a single RGB image. Some recent approaches mainly rely on...

Chenxing **a, Mengge Zhang, ... **ngzhu Liang in The Journal of Supercomputing

Article 05 June 2024

An Outlook into the Future of Egocentric Vision

What will the future be? We wonder! In this survey, we explore the gap between current research in egocentric vision and the ever-anticipated future,...

Chiara Plizzari, Gabriele Goletto, ... Tatiana Tommasi in International Journal of Computer Vision

Article Open access 28 May 2024

Masked Vision-language Transformer in Fashion

We present a masked vision-language transformer (MVLT) for fashion-specific multi-modal representation. Technically, we simply utilize the vision...

Ge-Peng Ji, Mingchen Zhuge, ... Luc Van Gool in Machine Intelligence Research

Article Open access 27 February 2023

Underwater image enhancement using lightweight vision transformer

Deep learning-based models have recently shown a strong potential in Underwater Image Enhancement (UIE) that are satisfying and have the right colors...

Muneeba Daud, Hammad Afzal, Khawir Mahmood in Multimedia Tools and Applications

Article 19 February 2024

When details are difficult to portray: enriching vision videos

The creation of a shared understanding of the project vision of all relevant stakeholders is vital to the requirements engineering process. One way...

Lukas Nagel, Melanie Schmedes, ... Kurt Schneider in Requirements Engineering

Article Open access 05 September 2023

Bioinspired sensing-memory-computing integrated vision systems: biomimetic mechanisms, design principles, and applications

With the explosion of sensory data in the Internet of Things (IoT) era, conventional machine vision systems are becoming increasingly difficult to...

Yujie Huang, Yinlong Tan, ... Tian Jiang in Science China Information Sciences

Article 23 April 2024

Search

Filters

Search Results

Search

Navigation