Search
Search Results
-
Hyperbolic Deep Learning in Computer Vision: A Survey
Deep representation learning is a ubiquitous part of modern computer vision. While Euclidean space has been the de facto standard manifold for...
-
Simulating vision impairment in virtual reality: a comparison of visual task performance with real and simulated tunnel vision
In this work, we explore the potential and limitations of simulating gaze-contingent tunnel vision conditions using Virtual Reality (VR) with...
-
Computer Vision Overview
Computer vision is an information subject/discipline that uses computers to realize the functions of human vision system (HVS). This book mainly... -
Prompt learning in computer vision: a survey
Prompt learning has attracted broad attention in computer vision since the large pre-trained vision-language models (VLMs) exploded. Based on the...
-
Review of vision-based reinforcement learning for drone navigation
In recent years, Unmanned aerial vehicles (UAVs) have witnessed a surge in popularity and implementation for both civilian and military usage. UAVs...
-
Data Augmentation for Low-Level Vision: CutBlur and Mixture-of-Augmentation
Data augmentation (DA) is an effective way to improve the performance of deep networks. Unfortunately, current methods are mostly developed for...
-
BioDrone: A Bionic Drone-Based Single Object Tracking Benchmark for Robust Vision
Single object tracking (SOT) is a fundamental problem in computer vision, with a wide range of applications, including autonomous driving, augmented...
-
Vision Transformers with Hierarchical Attention
This paper tackles the high computational/space complexity associated with multi-head self-attention (MHSA) in vanilla vision transformers. To this...
-
A Vision Enhancement and Feature Fusion Multiscale Detection Network
In the field of object detection, there is often a high level of occlusion in real scenes, which can very easily interfere with the accuracy of the...
-
Deep learning and computer vision approach - a vision transformer based classification of fruits and vegetable diseases (DLCVA-FVDC)
As technology progresses, automation gains importance. The automation might be in a large-scale industry with more employees and heavy capital...
-
Universal Object Detection with Large Vision Model
Over the past few years, there has been growing interest in develo** a broad, universal, and general-purpose computer vision system. Such systems...
-
Fast and intelligent measurement of concrete aggregate volume based on monocular vision map**
In order to prevent the abnormal appearance of gravel aggregate material level in the mixing plant, improve the safety of the concrete mixing plant...
-
Computer Vision
In recent years, one of the most transformative subfields of machine learning has been computer vision. With substantial breakthroughs in the early... -
Vision transformer models for mobile/edge devices: a survey
With the rapidly growing demand for high-performance deep learning vision models on mobile and edge devices, this paper emphasizes the importance of...
-
EDFIDepth: enriched multi-path vision transformer feature interaction networks for monocular depth estimation
Monocular depth estimation (MDE) aims to predict pixel-level dense depth maps from a single RGB image. Some recent approaches mainly rely on...
-
An Outlook into the Future of Egocentric Vision
What will the future be? We wonder! In this survey, we explore the gap between current research in egocentric vision and the ever-anticipated future,...
-
Masked Vision-language Transformer in Fashion
We present a masked vision-language transformer (MVLT) for fashion-specific multi-modal representation. Technically, we simply utilize the vision...
-
Underwater image enhancement using lightweight vision transformer
Deep learning-based models have recently shown a strong potential in Underwater Image Enhancement (UIE) that are satisfying and have the right colors...
-
When details are difficult to portray: enriching vision videos
The creation of a shared understanding of the project vision of all relevant stakeholders is vital to the requirements engineering process. One way...
-
Bioinspired sensing-memory-computing integrated vision systems: biomimetic mechanisms, design principles, and applications
With the explosion of sensory data in the Internet of Things (IoT) era, conventional machine vision systems are becoming increasingly difficult to...