Deep Learning

PromptTTS++: Controlling Speaker Identity in Prompt-based Text-to-Speech using Natural Language Descriptions

Accepted to ICASSP 2024

Reo Shimizu, Ryuichi Yamamoto, Masaya Kawamura, Yuma Shirahata, Hironori Doi, Tatsuya Komatsu, Kentaro Tachibana

Enhancing Multilingual TTS with Voice Conversion based Data Augmentation and Posterior Embedding

Accepted to ICASSP 2024

Hyun-Wook Yoon, Jin-Seob Kim, Ryuichi Yamamoto, Ryo Terashima, Chan-Ho Song, Jae-Min Kim, Eunwoo Song

Electrolaryngeal Speech Intelligibility Enhancement Through Robust Linguistic Encoders

Accepted to ICASSP 2024

Lester Phillip Violeta1, Wen-Chin Huang, Ding Ma, Ryuichi Yamamoto, Kazuhiro Kobayashi, Tomoki Toda1

A Comparative Study of Voice Conversion Models with Large-Scale Speech and Singing Data: The T13 Systems for the Singing Voice Conversion Challenge 2023

Accepted to ASRU 2023

Ryuichi Yamamoto, Reo Yoneyama, Lester Phillip Violeta, Wen-Chin Huang, Tomoki Toda

NNSVS: A Neural Network-Based Singing Voice Synthesis Toolkit

Accepted to ICASSP 2023

Ryuichi Yamamoto, Reo Yoneyama, Tomoki Toda

Non-parallel High-Quality Audio Super Resolution with Domain Adaptation and Resampling CycleGANs

Accepted to ICASSP 2023

Reo Yoneyama, Ryuichi Yamamoto, Kentaro Tachibana

Period VITS: Variational Inference With Explicit Pitch Modeling For End-to-End Emotional Speech Synthesis

Accepted to ICASSP 2023

Yuma Shirahata, Ryuichi Yamamoto, Eunwoo Song, Ryo Terashima, Jae-Min Kim, Kentaro Tachibana

Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform

Accepted to ICASSP 2023

Masaya Kawamura1, Yuma Shirahata, Ryuichi Yamamoto, Kentaro Tachibana

DRSpeech: Degradation-Robust Text-to-Speech Synthesis with Frame-Level and Utterance-Level Acoustic Representation Learning

Accepted to Interspeech 2022

Takaaki Saeki, Kentaro Tachibana, Ryuichi Yamamoto

TTS-by-TTS 2: Data-selective Augmentation for Neural Speech Synthesis Using Ranking Support Vector Machine with Variational Autoencoder

Accepted to Interspeech 2022

Eunwoo Song, Ryuichi Yamamoto, Ohsung Kwon, Chan-Ho Song, Min-Jae Hwang, Suhyeon Oh, Hyun-Wook Yoon, Jin-Seob Kim, Jae-Min Kim