Search
Search Results
-
Impact of LiDAR point cloud compression on 3D object detection evaluated on the KITTI dataset
The rapid growth on the amount of generated 3D data, particularly in the form of Light Detection And Ranging (LiDAR) point clouds (PCs), poses very...
-
Remote expert viewing, laboratory tests or objective metrics: which one(s) to trust?
We present a study on the validity of quality assessment in the context of the development of visual media coding schemes. The work is motivated by...
-
Subjective performance evaluation of bitrate allocation strategies for MPEG and JPEG Pleno point cloud compression
The recent rise in interest in point clouds as an imaging modality has motivated standardization groups such as JPEG and MPEG to launch activities...
-
Adaptive bridge model for compressed domain point cloud classification
The recent adoption of deep learning-based models for the processing and coding of multimedia signals has brought noticeable gains in performance,...
-
Learning-based light field imaging: an overview
Conventional photography can only provide a two-dimensional image of the scene, whereas emerging imaging modalities such as light field enable the...
-
Cartoon copyright recognition method based on character personality action
Aiming at the problem of cartoon piracy and plagiarism, this paper proposes a method of cartoon copyright recognition based on character personality...
-
4AC-YOLOv5: an improved algorithm for small target face detection
In real scenes, small target faces often encounter various conditions, such as intricate background, occlusion and scale change, which leads to the...
-
Analysis of thermal videos for detection of lie during interrogation
The lie-detection tests are traditionally carried out by well-trained experts using polygraph machines. However, it is time-consuming, invasive, and,...
-
Semi-automated computer vision-based tracking of multiple industrial entities: a framework and dataset creation approach
This contribution presents the TOMIE framework (Tracking Of Multiple Industrial Entities), a framework for the continuous tracking of industrial...
-
Fast CU size decision and intra-prediction mode decision method for H.266/VVC
H.266/Versatile Video Coding (VVC) is the most recent video coding standard developed by the Joint Video Experts Team (JVET). The quad-tree with...
-
Assessment framework for deepfake detection in real-world situations
Detecting digital face manipulation in images and video has attracted extensive attention due to the potential risk to public trust. To counteract...
-
Edge-aware nonlinear diffusion-driven regularization model for despeckling synthetic aperture radar images
Speckle noise corrupts synthetic aperture radar (SAR) images and limits their applications in sensitive scientific and engineering fields. This...
-
Secure image transmission through LTE wireless communications systems
Secure transmission of images over wireless communications systems can be done using RSA, the most known and efficient cryptographic algorithm, and...
-
Multimodal few-shot classification without attribute embedding
Multimodal few-shot learning aims to exploit complementary information inherent in multiple modalities for vision tasks in low data scenarios. Most...
-
An optimized capsule neural networks for tomato leaf disease classification
Plant diseases have a significant impact on leaves, with each disease exhibiting specific spots characterized by unique colors and locations....
-
Multi-layer features template update object tracking algorithm based on SiamFC++
SiamFC++ only extracts the object feature of the first frame as a tracking template, and only uses the highest level feature maps in both the...
-
Handbook of Face Recognition
The history of computer-aided face recognition dates to the 1960s, yet the problem of automatic face recognition – a task that humans perform...
-
Considerations and Challenges
As with any new technology, continuous biometric authentication systems have a variety of considerations and challenges that must be addressed before... -
Anwendung von Wavelet-Zerlegung und maschinellem Lernen für die sEMG-Signalbasierte Gestenerkennung
Amputierte auf der ganzen Welt haben begrenzten Zugang zu hochwertigen intelligenten Prothesen. Die korrekte Erkennung von Gesten ist eine der... -
Einführung in nicht-invasive biomedizinische Signale für die Gesundheitsversorgung
Mit dem Fortschritt der medizinischen Wissenschaft wurden neue Gesundheitsmethoden eingeführt. Biomedizinische Signale haben uns einen tiefen...