![Loading...](https://link.springer.com/static/c4a417b97a76cc2980e3c25e2271af3129e08bbe/images/pdf-preview/spacer.gif)
1,503 Result(s)
-
Article
A new virtual interpolation technology with range as object
Virtual interpolation technology can be applied to direction-of-arrival (DOA) estimation as a preprocessing technique to achieve the DOA estimation for any array. In order to solve the angle-sensitive problem ...
-
Article
Hugs Bring Double Benefits: Unsupervised Cross-Modal Hashing with Multi-granularity Aligned Transformers
Unsupervised cross-modal hashing (UCMH) has been commonly explored to support large-scale cross-modal retrieval of unlabeled data. Despite promising progress, most existing approaches are developed on convolut...
-
Article
Softmax-Free Linear Transformers
Vision transformers (ViTs) have pushed the state-of-the-art for visual perception tasks. The self-attention mechanism underpinning the strength of ViTs has a quadratic complexity in both computation and memory...
-
Article
Does Confusion Really Hurt Novel Class Discovery?
When sampling data of specific classes (i.e., known classes) for a scientific task, collectors may encounter unknown classes (i.e., novel classes). Since these novel classes might be valuable for future research,...
-
Article
A full-detection association tracker with confidence optimization for real-time multi-object tracking
Multi-object tracking (MOT) aims to obtain trajectories with unique identifiers for multiple objects in a video stream. In current approaches, confidence thresholds were frequently used to perform multi-stage ...
-
Article
DML-YOLOv8-SAR image object detection algorithm
Given the challenges posed by noise and varying target scales in SAR images, conventional convolutional neural networks often underperform in SAR image detection. To address this, this paper introduces a novel...
-
Article
Using improved YOLO V5s to recognize tomatoes in a continuous working environment
In the continuous working environment of the picking robots, factors such as illumination change, camera hardware, the movement of the picking robots, and image background interference have a great impact on t...
-
Article
Multi-scale deep echo state network for time series prediction
Echo state network (ESN) has widely attracted many researchers due to its training process without backpropagation. However, it is hard for single ESN to fit those complex and polytrophic situations. Under thi...
-
Article
Quasisynchronization of reaction-diffusion neural networks with time-varying delays by static/dynamic event-triggered control and its application to secure communication
This paper studies the quasisynchronization problems of reaction-diffusion neural networks (RDNNs) with time-varying delays via event-triggered control. Firstly, a static event-triggered mechanism and a dynami...
-
Article
Preference detection of the humanoid robot face based on EEG and eye movement
The face of a humanoid robot can affect the user experience, and the detection of face preference is particularly important. Preference detection belongs to a branch of emotion recognition that has received mu...
-
Article
CSGAT-Net: a conditional pedestrian trajectory prediction network based on scene semantic maps and spatiotemporal graph attention
Pedestrian behavior exhibits high levels of dynamism, and pedestrian trajectories are influenced not only by the pedestrians themselves, but also by interactions with surrounding objects. Efficiently understan...
-
Article
MFMANet: a multispectral pedestrian detection network using multi-resolution RGB feature reuse with multi-scale FIR attentions
In the realm of multispectral pedestrian detection, especially under challenging low-illumination, the existing methods, characterized by cross-modality feature interaction, lack generalization and are hard to...
-
Article
Diff-Font: Diffusion Model for Robust One-Shot Font Generation
Font generation presents a significant challenge due to the intricate details needed, especially for languages with complex ideograms and numerous characters, such as Chinese and Korean. Although various few-s...
-
Article
meTMQI: multi-task and exposure-prior learning for Tone-Mapped Quality Index
With limited dynamic range in consumer-level photographs and electronic displays, high dynamic range images can be rendered as the standard dynamic range image by tone map** algorithms. To quantify the disto...
-
Article
A hardware-friendly logarithmic quantization method for CNNs and FPGA implementation
Convolutional Neural Networks (CNNs) have been widely used in various fields due to their high accuracy and efficiency. The performance of CNNs is mainly affected by the computing capability, memory bandwidth,...
-
Article
DEAR: a novel deep-level semantics feature reinforce framework for Infrared Small Object Segmentation
Infrared Small Object Segmentation (ISOS) faces challenges in isolating small and faint objects from infrared images due to their limited texture details and small spatial presence. Existing deep learning meth...
-
Article
Retraction Note: A streak detection approach for comprehensive two-dimensional gas chromatography based on image analysis
-
Article
Grounded Affordance from Exocentric View
Affordance grounding aims to locate objects’ “action possibilities” regions, an essential step toward embodied intelligence. Due to the diversity of interactive affordance, i.e., the uniqueness of different indiv...
-
Article
Shuff-BiseNet: a dual-branch segmentation network for pavement cracks
In order to accurately obtain the shape and size of pavement cracks, analyze the severity of pavement cracks, avoid deterioration of the situation, and take timely measures, we proposed a dual-branch structure...
-
Article
Delving into Identify-Emphasize Paradigm for Combating Unknown Bias
Dataset biases are notoriously detrimental to model robustness and generalization. The identify-emphasize paradigm appears to be effective in dealing with unknown biases. However, we discover that it is still ...