Search Page | SpringerLink

Emotional voice conversion using DBiLSTM-NN with MFCC and LogF0 features

Emotional voice conversion(EVC) aims to convert the speaker’s voice from one emotion state to another without changing the speaker and the voice...

Danyang Cao, Chengzhi Miao in Multimedia Tools and Applications

Article 16 May 2024

WaveVC: Speech and Fundamental Frequency Consistent Raw Audio Voice Conversion

Voice conversion (VC) is a task for changing the speech of a source speaker to the target voice while preserving linguistic information of the source...

Kyungdeuk Ko, Donghyeon Kim, ... Hanseok Ko in Neural Processing Letters

Article Open access 08 May 2024

MelMAE-VC: Extending Masked Autoencoders to Voice Conversion

Voice conversion is a technique that generates speeches with text contents identical to source speeches and timbre features similar to reference...

Yuhao Wang, Yuantao Gu in Neural Information Processing

Conference paper 2024

Voice Conversion with Denoising Diffusion Probabilistic GAN Models

Voice conversion is a method that allows for the transformation of speaking style while maintaining the integrity of linguistic information. There...

Xulong Zhang, Jianzong Wang, ... **g **ao in Advanced Data Mining and Applications

Conference paper 2023

Improving Voice Style Conversion via Self-attention VAE with Feature Disentanglement

Voice conversion (VC) is a widely used technique in intelligent speech processing, that aims to modify the speaker’s information while preserving the...

Hui Yuan, ** Li, ... Jun Zhang in Computer Supported Cooperative Work and Social Computing

Conference paper 2024

Boosting StarGANs for Voice Conversion with Contrastive Discriminator

Nonparallel multi-domain voice conversion methods such as the StarGAN-VCs have been widely applied in many scenarios. However, the training of these...

Shi**g Si, Jianzong Wang, ... **g **ao in Neural Information Processing

Conference paper 2023

Voice Conversion for Stuttered Speech, Instruments, Unseen Languages and Textually Described Voices

Voice conversion aims to convert source speech into a target voice using recordings of the target speaker as a reference. Newer models are producing...

Matthew Baas, Herman Kamper in Artificial Intelligence Research

Conference paper 2023

Zero-Shot Singing Voice Conversion Based on Timbre Space Modeling and Excitation Signal Control

In recent years, singing voice conversion technology has rapidly advanced and is capable of generating high-quality singing voices. However,...

Yuan Jiang, Yan-Nian Chen, ... Zhen-Hua Ling in Man-Machine Speech Communication

Conference paper 2024

VC-AUG: Voice Conversion Based Data Augmentation for Text-Dependent Speaker Verification

In this paper, we focus on improving the performance of the text-dependent speaker verification system in the scenario of limited training data. The...

**aoyi Qin, Yaogen Yang, ... Ming Li in Man-Machine Speech Communication

Conference paper 2023

Voice Conversion Using Learnable Similarity-Guided Masked Autoencoder

Voice conversion (VC) is an important voice forgery method that poses a serious threat to personal privacy protection, especially with remarkable...

Yewei Gu, **anfeng Zhao, ... Junchao **ao in Digital Forensics and Watermarking

Conference paper 2023

Analysis of Mandarin vs English Language for Emotional Voice Conversion

Emotional Voice Conversion (EVC) is a method to convert the emotional state of an utterance to another without changing the linguistic information...

S. Uthiraa, Hemant A. Patil in Speech and Computer

Conference paper 2023

Battling voice spoofing: a review, comparative analysis, and generalizability evaluation of state-of-the-art voice spoofing counter measures

With the advent of automated speaker verification (ASV) systems comes an equal and opposite development: malicious actors may seek to use voice...

Awais Khan, Khalid Mahmood Malik, ... Mikul Saravanan in Artificial Intelligence Review

Article 28 June 2023

Voice conversion spoofing detection by exploring artifacts estimates

Automatic speaker verification or voice biometrics is an approach to verify the person’s claimed identity through his/her voice. Voice biometrics...

R. Hemavathi, R. Kumaraswamy in Multimedia Tools and Applications

Article 06 January 2021

Voice spoofing countermeasure for voice replay attacks using deep learning

In our everyday lives, we communicate with each other using several means and channels of communication, as communication is crucial in the lives of...

**cheng Zhou, Tao Hai, ... Cresantus Biamba in Journal of Cloud Computing

Article Open access 24 September 2022

Cross-Lingual Knowledge Distillation via Flow-Based Voice Conversion for Robust Polyglot Text-to-Speech

In this work, we introduce a framework for cross-lingual speech synthesis, which involves an upstream Voice Conversion (VC) model and a downstream...

Dariusz Piotrowski, Renard Korzeniowski, ... Kayoko Yanagisawa in Neural Information Processing

Conference paper 2024

Detection of Voice Conversion Spoofing Attacks Using Voiced Speech

Speech consists of voiced and unvoiced segments that differ in their production process and exhibit different characteristics. In this paper, we...

Arun Sankar Muttathu Sivasankara Pillai, Phillip L. De Leon, Utz Roedig in Secure IT Systems

Conference paper 2022

Region Normalized Capsule Network Based Generative Adversarial Network for Non-parallel Voice Conversion

Voice conversion (VC) involves altering the vocal characteristics of a source speaker to resemble those of a target speaker while maintaining the...

Md. Tousin Akhter, Padmanabha Banerjee, ... Nanda Dulal Jana in Speech and Computer

Conference paper 2023

Voice Privacy Using Time-Scale and Pitch Modification

There is a growing demand toward digitization of various day-to-day work and hence, there is a surge in use of Intelligent Personal Assistants. The...

Dipesh K. Singh, Gauri P. Prajapati, Hemant A. Patil in SN Computer Science

Article 27 January 2024