LESS IS MORE
LESS IS MORE
Home
Projects
Posts
Light
Dark
Automatic
Projects
Description-based Controllable Text-to-Speech with Cross-Lingual Voice Control
Submitted to ICASSP 2025
Ryuichi Yamamoto
,
Yuma Shirahata
,
Masaya Kawamura
,
Kentaro Tachibana
LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning
Accepted to Interspeech 2024.
Masaya Kawamura
,
Ryuichi Yamamoto
,
Yuma Shirahata
,
Takuya Hasumi
,
Kentaro Tachibana
CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection
Accepted to Interspeech 2024.
Yongyi Zang
,
Jiatong Shi
,
You Zhang
,
Ryuichi Yamamoto
,
Jionghao Han
,
Yuxun Tang
,
Shengyuan Xu
,
Wenxiao Zhao
,
Jing Guo
,
Tomoki Toda
,
Zhiyao Duan
Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment
Accepted to Interspeech 2024.
Takuto Igarashi
,
Yuki Saito
,
Kentaro Seki
,
Shinnosuke Takamichi
,
Ryuichi Yamamoto
,
Kentaro Tachibana
,
Hiroshi Saruwatari
SRC4VC: Smartphone-Recorded Corpus for Voice Conversion Benchmark
Accepted to Interspeech 2024.
Yuki Saito
,
Takuto Igarashi
,
Kentaro Seki
,
Shinnosuke Takamichi
,
Ryuichi Yamamoto
,
Kentaro Tachibana
,
Hiroshi Saruwatari
Audio-conditioned phonemic and prosodic annotation for building text-to-speech models from unlabeled speech data
Accepted to Interspeech 2024.
Yuma Shirahata
,
Byeongseon Park
,
Ryuichi Yamamoto
,
Kentaro Tachibana
PromptTTS++: Controlling Speaker Identity in Prompt-based Text-to-Speech using Natural Language Descriptions
Accepted to
ICASSP 2024
Reo Shimizu
,
Ryuichi Yamamoto
,
Masaya Kawamura
,
Yuma Shirahata
,
Hironori Doi
,
Tatsuya Komatsu
,
Kentaro Tachibana
Enhancing Multilingual TTS with Voice Conversion based Data Augmentation and Posterior Embedding
Accepted to
ICASSP 2024
Hyun-Wook Yoon
,
Jin-Seob Kim
,
Ryuichi Yamamoto
,
Ryo Terashima
,
Chan-Ho Song
,
Jae-Min Kim
,
Eunwoo Song
Electrolaryngeal Speech Intelligibility Enhancement Through Robust Linguistic Encoders
Accepted to
ICASSP 2024
Lester Phillip Violeta1
,
Wen-Chin Huang
,
Ding Ma
,
Ryuichi Yamamoto
,
Kazuhiro Kobayashi
,
Tomoki Toda1
A Comparative Study of Voice Conversion Models with Large-Scale Speech and Singing Data: The T13 Systems for the Singing Voice Conversion Challenge 2023
Accepted to
ASRU 2023
Ryuichi Yamamoto
,
Reo Yoneyama
,
Lester Phillip Violeta
,
Wen-Chin Huang
,
Tomoki Toda
»
Cite
×