Projects

Description-based Controllable Text-to-Speech with Cross-Lingual Voice Control
Submitted to ICASSP 2025
CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection
Accepted to Interspeech 2024.
Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment
Accepted to Interspeech 2024.
SRC4VC: Smartphone-Recorded Corpus for Voice Conversion Benchmark
Accepted to Interspeech 2024.
PromptTTS++: Controlling Speaker Identity in Prompt-based Text-to-Speech using Natural Language Descriptions
Accepted to ICASSP 2024
Enhancing Multilingual TTS with Voice Conversion based Data Augmentation and Posterior Embedding
Accepted to ICASSP 2024
Electrolaryngeal Speech Intelligibility Enhancement Through Robust Linguistic Encoders
Accepted to ICASSP 2024