Interspeech

CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection
Accepted to Interspeech 2024.
Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment
Accepted to Interspeech 2024.
SRC4VC: Smartphone-Recorded Corpus for Voice Conversion Benchmark
Accepted to Interspeech 2024.
TTS-by-TTS 2: Data-selective Augmentation for Neural Speech Synthesis Using Ranking Support Vector Machine with Variational Autoencoder
Accepted to Interspeech 2022
Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation
Accepted to Interspeech 2022
Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems
Accepted to Interspeech 2022