![Loading...](https://link.springer.com/static/c4a417b97a76cc2980e3c25e2271af3129e08bbe/images/pdf-preview/spacer.gif)
-
Article
Open AccessWaveVC: Speech and Fundamental Frequency Consistent Raw Audio Voice Conversion
Voice conversion (VC) is a task for changing the speech of a source speaker to the target voice while preserving linguistic information of the source speech. The existing VC methods typically use mel-spectrogr...
-
Chapter and Conference Paper
SpeechBalloon: A New Approach of Providing User Interface for Real-Time Generation of Meeting Notes
This paper proposes SpeechBalloon, a solution for real-time generation of meeting notes for the purpose of facilitating effective communications among participants. This is especially important when some of th...
-
Chapter and Conference Paper
Injecting 3D Perception of Controllable NeRF-GAN into StyleGAN for Editable Portrait Image Synthesis
Over the years, 2D GANs have achieved great successes in photorealistic portrait generation. However, they lack 3D understanding in the generation process, thus they suffer from multi-view inconsistency proble...
-
Chapter and Conference Paper
Pre-trained Language Model for Biomedical Question Answering
The recent success of question answering systems is largely attributed to pre-trained language models. However, as language models are mostly pre-trained on general domain corpora such as Wikipedia, they often...
-
Chapter and Conference Paper
Model-Based Gait Recognition Using Multiple Feature Detection
This paper presents a gait recognition algorithm for human identification from a sequence of segmented noisy silhouettes in a low-resolution video. The main contribution of the proposed work is the use of the ...