![Loading...](https://link.springer.com/static/c4a417b97a76cc2980e3c25e2271af3129e08bbe/images/pdf-preview/spacer.gif)
-
Article
Research on water level measurement technology based on the residual length ratio of image characters
Aiming at the low efficiency and poor adaptability of traditional water level measurement methods, a water level measurement technology based on the residual length ratio of image characters is proposed in thi...
-
Article
OV-DAR: Open-Vocabulary Object Detection and Attributes Recognition
In this paper, we endeavor to localize all potential objects in an image and infer their visual categories, attributes, and shapes, even in instances where certain objects have not been encompassed in the mode...
-
Article
OV-VIS: Open-Vocabulary Video Instance Segmentation
Conventionally, the goal of Video Instance Segmentation (VIS) is to segment and categorize objects in videos from a closed set of training categories, lacking the generalization ability to handle novel categor...
-
Article
Map** of sand and gravel aggregate level height and volume measurement based on contour map** generation
In order to prevent the abnormal appearance of sand and gravel aggregate level in the concrete mixing plant, and improve the safety of the concrete mixing plant system as well as the efficient and high-quality...
-
Article
Optimization analysis of football match prediction model based on neural network
How to build a football match prediction model and use scientific methods to solve the prediction problem has become a key point in the application of artificial intelligence in the sports industry. In this pa...
-
Chapter and Conference Paper
Stacked Sparse Autoencoder for Audio Object Coding
Compared with channel-based audio coding, the object-based audio coding has a definite advantage in meeting the user’s demands of personalized control. However, in the conventional Spatial Audio Object Coding ...
-
Chapter and Conference Paper
EMRM: Enhanced Multi-source Review-Based Model for Rating Prediction
Rating prediction, whose goal is to predict user preference for unconsumed items, has become one of the core tasks in recommendation systems. Recently, many deep learning-based methods have been applied to the...
-
Chapter and Conference Paper
Synthesizing Large-Scale Datasets for License Plate Detection and Recognition in the Wild
License Plate Detection and Recognition (LPDR) plays a key role in modern intelligent transportation systems. Recent state-of-the-art methods of LPDR are based on deep convolutional neural networks (DCNN), whi...
-
Chapter and Conference Paper
Multi-step Coding Structure of Spatial Audio Object Coding
The spatial audio object coding (SAOC) is an effective meth-od which compresses multiple audio objects and provides flexibility for personalized rendering in interactive services. It divides each frame signal ...
-
Chapter and Conference Paper
Perceptual Localization of Virtual Sound Source Based on Loudspeaker Triplet
When using a loudspeaker triplet for virtual sound localization, the traditional conversion method will result in inaccurate localization. In this paper, we constructed a perceptual localization distortion mod...
-
Chapter and Conference Paper
HMM-Based Person Re-identification in Large-Scale Open Scenario
This paper aims to tackle person re-identification (person re-ID) in large-scale open scenario, which differs from the conventional person re-ID tasks but is significant for some real suspect investigation ca...
-
Chapter and Conference Paper
HRTF Representation with Convolutional Auto-encoder
The head-related transfer function (HRTF) can be considered as some kind of filter that describes how a sound from an arbitrary spatial direction transfers to the listener’s eardrums. HRTF can be used to synth...
-
Chapter and Conference Paper
Few-Shot Semantic Segmentation with Democratic Attention Networks
Few-shot segmentation has recently generated great popularity, addressing the challenging yet important problem of segmenting objects from unseen categories with scarce annotated support images. The crux of fe...
-
Chapter and Conference Paper
A Novel Ensemble Approach for Click-Through Rate Prediction Based on Factorization Machines and Gradient Boosting Decision Trees
Click-Through Rate (CTR) prediction is a significant technique in the field of computational advertising, its accuracy directly affects companies profits and user experience. Achieving great ability of general...
-
Chapter and Conference Paper
Spectral Tilt Estimation for Speech Intelligibility Enhancement Using RNN Based on All-Pole Model
Speech intelligibility enhancement is extremely meaningful for successful speech communication in noisy environments. Several methods based on Lombard effect are used to increase intelligibility. In those meth...
-
Chapter and Conference Paper
The Analysis for Binaural Signal’s Characteristics of a Real Source and Corresponding Virtual Sound Image
3D Audio System could rebuild more realistic and immersive sound effects. The existing 3D audio reconstruction methods mainly consider the physical characteristics of sound filed, less take head’s effect on so...
-
Chapter and Conference Paper
Speech Intelligibility Enhancement in Strong Mechanical Noise Based on Neural Networks
Speech intelligibility is a significant factor for successful speech communication. To enhance the intelligibility, many methods have been proposed, mainly by operating the speech signal such as increasing the...
-
Chapter and Conference Paper
A Splicing Interpolation Method for Head-Related Transfer Function
We proposed a new head-related transfer function (HRTF) interpolation method based on splicing. The sound spreads from sound source to listener’s ears, the wave of head-related impulse response (HRIR) clearly ...
-
Chapter and Conference Paper
An Efficient Method Using the Parameterized HRTFs for 3D Audio Real-Time Rendering on Mobile Devices
3D audio real-time rendering is of great importance for virtual reality (VR) application, especially on mobile devices. However, the limited computational power makes it hard to implement fast generation of sp...
-
Chapter and Conference Paper
Head Related Transfer Function Interpolation Based on Aligning Operation
Head related transfer function (HRTF) is the main technique of binaural synthesis, which is used to reconstruct spatial sound image, and the HRTF data only can be obtained by measurement. A high resolution HRT...