Search Results - "Bak, Taejun"
-
1
SYNTHE-SEES: Face Based Text-to-Speech for Virtual Speaker
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14-04-2024)“…Recent virtual voice generation researches have limitations in that they results in low-quality voice and generate inconsistent voice from the same speaker's…”
Get full text
Conference Proceeding -
2
MultiVerse: Efficient and Expressive Zero-Shot Multi-Task Text-to-Speech
Published 04-10-2024“…Text-to-speech (TTS) systems that scale up the amount of training data have achieved significant improvements in zero-shot speech synthesis. However, these…”
Get full text
Journal Article -
3
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
Published 27-06-2022“…Neural vocoders based on the generative adversarial neural network (GAN) have been widely used due to their fast inference speed and lightweight networks while…”
Get full text
Journal Article -
4
GANSpeech: Adversarial Training for High-Fidelity Multi-Speaker Speech Synthesis
Published 29-06-2021“…Recent advances in neural multi-speaker text-to-speech (TTS) models have enabled the generation of reasonably good speech quality with a single model and made…”
Get full text
Journal Article -
5
FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis
Published 29-06-2021“…Methods for modeling and controlling prosody with acoustic features have been proposed for neural text-to-speech (TTS) models. Prosodic speech can be generated…”
Get full text
Journal Article