![Loading...](https://link.springer.com/static/c4a417b97a76cc2980e3c25e2271af3129e08bbe/images/pdf-preview/spacer.gif)
-
Article
Research on water level measurement technology based on the residual length ratio of image characters
Aiming at the low efficiency and poor adaptability of traditional water level measurement methods, a water level measurement technology based on the residual length ratio of image characters is proposed in thi...
-
Article
Multi-granularity hypergraph-guided transformer learning framework for visual classification
Fine-grained single-label classification tasks aim to distinguish highly similar categories but often overlook inter-category relationships. Hierarchical multi-granularity visual classification strives to cate...
-
Article
OV-DAR: Open-Vocabulary Object Detection and Attributes Recognition
In this paper, we endeavor to localize all potential objects in an image and infer their visual categories, attributes, and shapes, even in instances where certain objects have not been encompassed in the mode...
-
Article
OV-VIS: Open-Vocabulary Video Instance Segmentation
Conventionally, the goal of Video Instance Segmentation (VIS) is to segment and categorize objects in videos from a closed set of training categories, lacking the generalization ability to handle novel categor...
-
Article
SCDet: decoupling discriminative representation for dark object detection via supervised contrastive learning
Despite the significant progress made in object detection algorithms, their potential to operate effectively under the low-light environment remains to be fully explored. Recent methods realize dark object det...
-
Article
Map** of sand and gravel aggregate level height and volume measurement based on contour map** generation
In order to prevent the abnormal appearance of sand and gravel aggregate level in the concrete mixing plant, and improve the safety of the concrete mixing plant system as well as the efficient and high-quality...
-
Article
Multitask learning for image translation and salient object detection from multimodal remote sensing images
This paper presents a novel and efficient multitask learning framework for image translation and saliency detection from remote sensing images, which mainly contains the image translation network-weight sharin...
-
Article
A hybrid style transfer with whale optimization algorithm model for textual adversarial attack
Deep learning has been widely used in various research fields. However, researchers have discovered that deep learning models are vulnerable to adversarial attacks. Existing word-level attacks can be seen as a...
-
Article
Reinforcement learning from constraints and focal entity shifting in conversational KGQA
The actual needs of users for information are often hidden in multiple question answering (QA) on the same topic. In order to generate answers to users’ current questions, a conversational QA system relies on ...
-
Article
ACX-UNet: a multi-scale lung parenchyma segmentation study with improved fusion of skip connection and circular cross-features extraction
Convolutional neural networks (CNN) are widely used in the field of computer-aided diagnosis of lung diseases. Its main tasks are segmentation of lung parenchyma, lung nodule detection and lesion analysis. Amo...
-
Chapter and Conference Paper
End-to-End Streaming Customizable Keyword Spotting Based on Text-Adaptive Neural Search
Streaming keyword spotting (KWS) is an important technique for voice assistant wake-up. While KWS with a preset fixed keyword has been well studied, test-time customizable keyword spotting in streaming mode re...
-
Chapter and Conference Paper
3RE-Net: Joint Loss-REcovery and Super-REsolution Neural Network for REal-Time Video
Real-time video over the Internet suffers from packet loss and low network bandwidth. The receiving side may receive down-sampled video with damaged frames. In this work, we are motivated to enhance the qualit...
-
Article
MVHANet: multi-view hierarchical aggregation network for skeleton-based hand gesture recognition
Skeleton-based gesture recognition (SHGR) is a very challenging task due to the complex articulated topology of hands. Previous works often learn hand characteristics from a single observation viewpoint. Howev...
-
Article
Open AccessClassification of barely visible impact damage in composite laminates using deep learning and pulsed thermographic inspection
With the increasingly comprehensive utilisation of Carbon Fibre-Reinforced Polymers (CFRP) in modern industry, defects detection and characterisation of these materials have become very important and draw sign...
-
Article
Toward visual quality enhancement of dehazing effect with improved Cycle-GAN
Image dehazing is a fundamental problem in computer vision. However, GT images for supervised dehazing network training are virtually impossible to obtain in the real world. Therefore, unsupervised image dehaz...
-
Article
Open AccessA machine learning-based clustering approach to diagnose multi-component degradation of aircraft fuel systems
Accurate fault diagnosis and prognosis can significantly reduce maintenance costs, increase the safety and availability of engineering systems that have become increasingly complex. It has been observed that v...
-
Chapter and Conference Paper
Preliminary Experiment for Measuring the Anxiety Level Using Heart Rate Variability
Anxiety is one of the most significant health issues. Generally, there are four levels of anxiety: mild anxiety, moderate anxiety, severe anxiety, and panic level anxiety
-
Chapter and Conference Paper
Power Efficient Video Super-Resolution on Mobile NPUs with Deep Learning, Mobile AI & AIM 2022 Challenge: Report
Video super-resolution is one of the most popular tasks on mobile devices, being widely used for an automatic improvement of low-bitrate and low-resolution video streams. While numerous solutions have been pro...
-
Chapter and Conference Paper
Semantic Enhancement Framework for Robust Speech Recognition
Auto speech recognition (ASR) has been widely used in dialogue systems of various domains, performing as a crucial part of technology. Since the output of the ASR system will provide input to the subsequent sy...
-
Chapter and Conference Paper
A Fast Stain Normalization Network for Cervical Papanicolaou Images
The domain shift between different styles of stain images greatly challenges the generalization of computer-aided diagnosis (CAD) algorithms. To bridge the gap, color normalization is a prerequisite for most C...