Search
Search Results
-
Scanning dial: the instantaneous audio classification transformer
A number of remarkable accomplishments have been achieved in the field of audio classification using algorithms based on Transformers in recent...
-
Mel-Frequency-based Feature Analysis of Audio Signals in the Context of Holy Quran Recitation
Different sounds have various effects on human health, and by introducing the ones that are therapeutic, a healing environment can be created. This...
-
Shallow and deep feature fusion for digital audio tampering detection
Digital audio tampering detection can be used to verify the authenticity of digital audio. However, most current methods use standard electronic...
-
Predominant audio source separation in polyphonic music
Predominant source separation is the separation of one or more desired predominant signals, such as voice or leading instruments, from polyphonic...
-
Multi-rate modulation encoding via unsupervised learning for audio event detection
Technologies in healthcare, smart homes, security, ecology, and entertainment all deploy audio event detection (AED) in order to detect sound events...
-
Diagnosis of Parkinson's Disease Using Convolutional Neural Network-Based Audio Signal Processing on FPGA
This study proposes a new method for diagnosing Parkinson's disease using audio signals and FPGA-based convolutional neural networks. The proposed...
-
Transfer Learning for Speaker Verification with Short-Duration Audio
Speaker verification or identifying the legitimacy of a speaker's identification from their voice is a fundamental problem in speech processing and... -
Acoustic domain mismatch compensation in bird audio detection
Detecting bird calls in audio is an important task for automatic wildlife monitoring, as well as in citizen science and audio library management....
-
An overview of machine learning and other data-based methods for spatial audio capture, processing, and reproduction
The domain of spatial audio comprises methods for capturing, processing, and reproducing audio content that contains spatial information. Data-based...
-
Classification of audio signals using spectrogram surfaces and extrinsic distortion measures
Representation of one-dimensional (1D) signals as surfaces and higher-dimensional manifolds reveals geometric structures that can enhance assessment...
-
A 4\(\mu\)W Low-Power Audio Processor System for Real-Time Jaw Movements Recognition in Grazing Cattle
Precision livestock farming consists of technological tools and techniques to improve livestock management. Proper detection and classification of...
-
“Seeing Sound”: Audio Classification Using the Wigner-Ville Distribution and Convolutional Neural Networks
With big data becoming increasingly available, IoT hardware becoming widely adopted, and AI capabilities becoming more powerful, organizations are... -
Towards Analog Implementation of Spiking Neural Networks for Audio Signals
This publication presents a novel approach to the training and deploying Spiking Neural Networks on analog hardware platforms. We proposed a scheme... -
Classifying Audio Music Genres Using a Multilayer Sequential Model
In this paper, we discuss applying a neural network model to classify a dataset of type audio. We used a GTZAN dataset that includes different audio... -
Bi-level Acoustic Scene Classification Using Lightweight Deep Learning Model
Identifying a scene based on the environment in which the related audio is recorded is known as acoustic scene classification (ASC). In this paper, a...
-
Fault Detection and Classification in Automobile Engine Based on Its Audio Signature Using Support Vector Machine
Numerous attempts have been made in recent years for detecting various faults in an automobile engine. The aim of develo** this technique is to... -
Efficient bandwidth extension of musical signals using a differentiable harmonic plus noise model
The task of bandwidth extension addresses the generation of missing high frequencies of audio signals based on knowledge of the low-frequency part of...
-
DeepDet: YAMNet with BottleNeck Attention Module (BAM) for TTS synthesis detection
Spoofed speeches are becoming a big threat to society due to advancements in artificial intelligence techniques. Therefore, there must be an...
-
Audio Content-Based Framework for Emotional Music Recognition
Music is a language of emotions and music emotional recognition has been addressed by different disciplines (e.g., psychology, cognitive science and... -
A New Algorithm for Speech Feature Extraction Using Polynomial Chirplet Transform
Time–frequency analysis (TFA) is a powerful tool for signal feature representation. In the time–frequency plane, the primary data properties are...