Search
Search Results
-
Whisper-based spoken term detection systems for search on speech ALBAYZIN evaluation challenge
The vast amount of information stored in audio repositories makes necessary the development of efficient and automatic methods to search on audio...
-
Singer identification model using data augmentation and enhanced feature conversion with hybrid feature vector and machine learning
Analyzing songs is a problem that is being investigated to aid various operations on music access platforms. At the beginning of these problems is...
-
Sound field reconstruction using neural processes with dynamic kernels
Accurately representing the sound field with high spatial resolution is crucial for immersive and interactive sound field reproduction technology. In...
-
Automatic classification of the physical surface in sound uroflowmetry using machine learning methods
This work constitutes the first approach for automatically classifying the surface that the voiding flow impacts in non-invasive sound uroflowmetry...
-
Deep learning-based expressive speech synthesis: a systematic review of approaches, challenges, and resources
Speech synthesis has made significant strides thanks to the transition from machine learning to deep learning models. Contemporary text-to-speech...
-
Vulnerability issues in Automatic Speaker Verification (ASV) systems
Claimed identities of speakers can be verified by means of automatic speaker verification (ASV) systems, also known as voice biometric systems....
-
Blind extraction of guitar effects through blind system inversion and neural guitar effect modeling
Audio effects are an ubiquitous tool in music production due to the interesting ways in which they can shape the sound of music. Guitar effects, the...
-
Sub-convolutional U-Net with transformer attention network for end-to-end single-channel speech enhancement
Recent advancements in deep learning-based speech enhancement models have extensively used attention mechanisms to achieve state-of-the-art methods...
-
Acoustical feature analysis and optimization for aesthetic recognition of Chinese traditional music
Chinese traditional music, a vital expression of Chinese cultural heritage, possesses both a profound emotional resonance and artistic allure. This...
-
Gated recurrent unit predictor model-based adaptive differential pulse code modulation speech decoder
Speech coding is a method to reduce the amount of data needs to represent speech signals by exploiting the statistical properties of the speech...
-
Generating chord progression from melody with flexible harmonic rhythm and controllable harmonic density
Melody harmonization, which involves generating a chord progression that complements a user-provided melody, continues to pose a significant...
-
-
Neural electric bass guitar synthesis framework enabling attack-sustain-representation-based technique control
Musical instrument sound synthesis (MISS) often utilizes a text-to-speech framework because of its similarity to speech in terms of generating sounds...
-
Significance of relative phase features for shouted and normal speech classification
Shouted and normal speech classification plays an important role in many speech-related applications. The existing works are often based on...
-
Deep semantic learning for acoustic scene classification
Acoustic scene classification (ASC) is the process of identifying the acoustic environment or scene from which an audio signal is recorded. In this...
-
Online distributed waveform-synchronization for acoustic sensor networks with dynamic topology
Acoustic sensing by multiple devices connected in a wireless acoustic sensor network (WASN) creates new opportunities for multichannel signal...
-
Lightweight target speaker separation network based on joint training
Target speaker separation aims to separate the speech components of the target speaker from mixed speech and remove extraneous components such as...
-
Efficient bandwidth extension of musical signals using a differentiable harmonic plus noise model
The task of bandwidth extension addresses the generation of missing high frequencies of audio signals based on knowledge of the low-frequency part of...
-
Piano score rearrangement into multiple difficulty levels via notation-to-notation approach
Musical score rearrangement is an emerging area in symbolic music processing, which aims to transform a musical score into a different style. This...