![Loading...](https://link.springer.com/static/c4a417b97a76cc2980e3c25e2271af3129e08bbe/images/pdf-preview/spacer.gif)
149 Result(s)
-
Article
A new virtual interpolation technology with range as object
Virtual interpolation technology can be applied to direction-of-arrival (DOA) estimation as a preprocessing technique to achieve the DOA estimation for any array. In order to solve the angle-sensitive problem ...
-
Article
A full-detection association tracker with confidence optimization for real-time multi-object tracking
Multi-object tracking (MOT) aims to obtain trajectories with unique identifiers for multiple objects in a video stream. In current approaches, confidence thresholds were frequently used to perform multi-stage ...
-
Article
DML-YOLOv8-SAR image object detection algorithm
Given the challenges posed by noise and varying target scales in SAR images, conventional convolutional neural networks often underperform in SAR image detection. To address this, this paper introduces a novel...
-
Article
Using improved YOLO V5s to recognize tomatoes in a continuous working environment
In the continuous working environment of the picking robots, factors such as illumination change, camera hardware, the movement of the picking robots, and image background interference have a great impact on t...
-
Article
A hardware-friendly logarithmic quantization method for CNNs and FPGA implementation
Convolutional Neural Networks (CNNs) have been widely used in various fields due to their high accuracy and efficiency. The performance of CNNs is mainly affected by the computing capability, memory bandwidth,...
-
Article
Shuff-BiseNet: a dual-branch segmentation network for pavement cracks
In order to accurately obtain the shape and size of pavement cracks, analyze the severity of pavement cracks, avoid deterioration of the situation, and take timely measures, we proposed a dual-branch structure...
-
Article
Research on image caption generation method based on multi-modal pre-training model and text mixup optimization
In recent years, multi-modal pre-training models have demonstrated remarkable cross-modal representation capabilities, catalyzing the rapid evolution of multi-modal downstream tasks, particularly in image capt...
-
Article
YOLO-MTG: a lightweight YOLO model for multi-target garbage detection
With wide adoption of deep learning technology in AI, intelligent garbage detection has become a hot research topic. However, existing datasets currently used for garbage detection rarely involves multi-catego...
-
Article
An effective masked transformer network for image denoising
The rising popularity of employing deep learning networks for image denoising can be observed over the past decade. Typically, their exceptional performance is rooted in their ability to learn the map** from...
-
Article
Human risky behaviour recognition during ladder climbing based on multi-modal feature fusion and adaptive graph convolutional network
Human falls during ladder climbing are typically instantaneous, making the timely and accurate determination of security risks during ladder climbing a challenging engineering issue. A skeleton-based behaviour...
-
Article
Boosting image denoising effect via low-level noise injection
In the past decade, supervised denoising models trained on large datasets have demonstrated impressive performance in image denoising due to their superior denoising effect. However, these models lack flexibil...
-
Article
Accurate and real-time visual detection algorithm for environmental perception of USVS under all-weather conditions
Owing to the intricate and ever-changing nature of the marine environment, traditional marine survey methods are subject to numerous limitations. Unmanned surface vehicles (USVs) have gained significant popula...
-
Article
Particle recognition and shape parameter detection based on deep learning
The size and shape parameters of sand particles are closely related to their geophysical and geomechanical properties. It is challenging to accurately identify sand particles and calculate their shape paramete...
-
Chapter and Conference Paper
Semi-End-to-End Nested Named Entity Recognition from Speech
There are two approaches for Named Entity Recognition (NER) from speech: two-step pipeline and End-to-End (E2E). In the pipeline approach, cascading errors are inevitable. In the E2E approach, its annotation m...
-
Chapter and Conference Paper
IvyGPT: InteractiVe Chinese Pathway Language Model in Medical Domain
General large language models (LLMs) such as ChatGPT have shown remarkable success. However, such LLMs have not been widely adopted for medical purposes, due to poor accuracy and inability to provide medical a...
-
Chapter and Conference Paper
Detecting Software Vulnerabilities Based on Hierarchical Graph Attention Network
Detecting software vulnerabilities is a crucial part of software security. At present, the most commonly used methods are to train supervised classification or regression models from the source code to detect ...
-
Chapter and Conference Paper
Within- and Between-Class Sample Interpolation Based Supervised Metric Learning for Speaker Verification
Metric learning aims to pull together the samples belonging to the same class and push apart those from different classes in embedding space. Existing methods may suffer from inadequate and low-quality sample ...
-
Chapter and Conference Paper
TST: Time-Sparse Transducer for Automatic Speech Recognition
End-to-end model, especially Recurrent Neural Network Transducer (RNN-T), has achieved great success in speech recognition. However, transducer requires a great memory footprint and computing time when process...
-
Chapter and Conference Paper
YueGraph: A Prototype for Yue Opera Lineage Review Based on Knowledge Graph
Yue opera, as one of the representatives of China’s intangible cultural heritage, embodies a profound regional history and folk art. This paper focuses on utilizing knowledge graphs to promote research and pre...
-
Chapter and Conference Paper
MetaVSR: A Novel Approach to Video Super-Resolution for Arbitrary Magnification
Video super-resolution is a pivotal task that involves the recovery of high-resolution video frames from their low-resolution counterparts, possessing a multitude of applications in real-world scenarios. Withi...