Search
Search Results
-
Emotional voice conversion using DBiLSTM-NN with MFCC and LogF0 features
Emotional voice conversion(EVC) aims to convert the speaker’s voice from one emotion state to another without changing the speaker and the voice...
-
WaveVC: Speech and Fundamental Frequency Consistent Raw Audio Voice Conversion
Voice conversion (VC) is a task for changing the speech of a source speaker to the target voice while preserving linguistic information of the source...
-
MelMAE-VC: Extending Masked Autoencoders to Voice Conversion
Voice conversion is a technique that generates speeches with text contents identical to source speeches and timbre features similar to reference... -
Voice Conversion with Denoising Diffusion Probabilistic GAN Models
Voice conversion is a method that allows for the transformation of speaking style while maintaining the integrity of linguistic information. There... -
Improving Voice Style Conversion via Self-attention VAE with Feature Disentanglement
Voice conversion (VC) is a widely used technique in intelligent speech processing, that aims to modify the speaker’s information while preserving the... -
Boosting StarGANs for Voice Conversion with Contrastive Discriminator
Nonparallel multi-domain voice conversion methods such as the StarGAN-VCs have been widely applied in many scenarios. However, the training of these... -
Voice Conversion for Stuttered Speech, Instruments, Unseen Languages and Textually Described Voices
Voice conversion aims to convert source speech into a target voice using recordings of the target speaker as a reference. Newer models are producing... -
Zero-Shot Singing Voice Conversion Based on Timbre Space Modeling and Excitation Signal Control
In recent years, singing voice conversion technology has rapidly advanced and is capable of generating high-quality singing voices. However,... -
VC-AUG: Voice Conversion Based Data Augmentation for Text-Dependent Speaker Verification
In this paper, we focus on improving the performance of the text-dependent speaker verification system in the scenario of limited training data. The... -
Voice Conversion Using Learnable Similarity-Guided Masked Autoencoder
Voice conversion (VC) is an important voice forgery method that poses a serious threat to personal privacy protection, especially with remarkable... -
Analysis of Mandarin vs English Language for Emotional Voice Conversion
Emotional Voice Conversion (EVC) is a method to convert the emotional state of an utterance to another without changing the linguistic information... -
Battling voice spoofing: a review, comparative analysis, and generalizability evaluation of state-of-the-art voice spoofing counter measures
With the advent of automated speaker verification (ASV) systems comes an equal and opposite development: malicious actors may seek to use voice...
-
Voice conversion spoofing detection by exploring artifacts estimates
Automatic speaker verification or voice biometrics is an approach to verify the person’s claimed identity through his/her voice. Voice biometrics...
-
Voice spoofing countermeasure for voice replay attacks using deep learning
In our everyday lives, we communicate with each other using several means and channels of communication, as communication is crucial in the lives of...
-
Cross-Lingual Knowledge Distillation via Flow-Based Voice Conversion for Robust Polyglot Text-to-Speech
In this work, we introduce a framework for cross-lingual speech synthesis, which involves an upstream Voice Conversion (VC) model and a downstream... -
Detection of Voice Conversion Spoofing Attacks Using Voiced Speech
Speech consists of voiced and unvoiced segments that differ in their production process and exhibit different characteristics. In this paper, we... -
Region Normalized Capsule Network Based Generative Adversarial Network for Non-parallel Voice Conversion
Voice conversion (VC) involves altering the vocal characteristics of a source speaker to resemble those of a target speaker while maintaining the... -
Voice Privacy Using Time-Scale and Pitch Modification
There is a growing demand toward digitization of various day-to-day work and hence, there is a surge in use of Intelligent Personal Assistants. The...
-
Physiological-physical feature fusion for automatic voice spoofing detection
Biometric speech recognition systems are often subject to various spoofing attacks, the most common of which are speech synthesis and speech...
-
User-centered AI-based voice-assistants for safe mobility of older people in urban context
Voice-assistants are becoming increasingly popular and can be deployed to offers a low-cost tool that can support and potentially reduce falls,...