Abstract: Vocoder-based speech synthesis has become a promising technique to accommodate the demands of high-quality speech analysis, manipulation, and synthesis. However, most existing works focus on ...
Abstract: Recently, GAN vocoders have seen rapid progress in speech synthesis, starting to outperform autoregressive models in perceptual quality with much higher generation speed. However, ...
2025-12-21 Smark: A Watermark for Text-to-Speech Diffusion Models via Discrete Wavelet Transform Yichuan Zhang et.al. 2512.18791 null 2025-12-21 Task Vector in TTS: Toward Emotionally Expressive ...