Search Results - Springer

Sort By Newest First Oldest First

Article

Open Access

WaveVC: Speech and Fundamental Frequency Consistent Raw Audio Voice Conversion

Voice conversion (VC) is a task for changing the speech of a source speaker to the target voice while preserving linguistic information of the source speech. The existing VC methods typically use mel-spectrogr...

Kyungdeuk Ko, Donghyeon Kim, Kyungseok Oh, Hanseok Ko in Neural Processing Letters (2024)

Download PDF (538 KB) View Article
Chapter and Conference Paper

SpeechBalloon: A New Approach of Providing User Interface for Real-Time Generation of Meeting Notes

This paper proposes SpeechBalloon, a solution for real-time generation of meeting notes for the purpose of facilitating effective communications among participants. This is especially important when some of th...

Donghyeon Kim, Suyeon Yoon, Jiyoung Seo… in Universal Access in Human-Computer Interac… (2023)
Chapter and Conference Paper

Injecting 3D Perception of Controllable NeRF-GAN into StyleGAN for Editable Portrait Image Synthesis

Over the years, 2D GANs have achieved great successes in photorealistic portrait generation. However, they lack 3D understanding in the generation process, thus they suffer from multi-view inconsistency proble...

Jeong-gi Kwak, Yuanming Li, Dongsik Yoon, Donghyeon Kim… in Computer Vision – ECCV 2022 (2022)
Chapter and Conference Paper

Pre-trained Language Model for Biomedical Question Answering

The recent success of question answering systems is largely attributed to pre-trained language models. However, as language models are mostly pre-trained on general domain corpora such as Wikipedia, they often...

Won** Yoon, **hyuk Lee, Donghyeon Kim… in Machine Learning and Knowledge Discovery i… (2020)

Download PDF (904 KB) View Chapter
Chapter and Conference Paper

Model-Based Gait Recognition Using Multiple Feature Detection

This paper presents a gait recognition algorithm for human identification from a sequence of segmented noisy silhouettes in a low-resolution video. The main contribution of the proposed work is the use of the ...

Donghyeon Kim, Daehee Kim, Joonki Paik in Advanced Concepts for Intelligent Vision Systems (2008)

5 Result(s)

WaveVC: Speech and Fundamental Frequency Consistent Raw Audio Voice Conversion

SpeechBalloon: A New Approach of Providing User Interface for Real-Time Generation of Meeting Notes

Injecting 3D Perception of Controllable NeRF-GAN into StyleGAN for Editable Portrait Image Synthesis

Pre-trained Language Model for Biomedical Question Answering

Model-Based Gait Recognition Using Multiple Feature Detection

Our Content

Other Sites

Help & Contacts