TTS | LESS IS MORE

PromptTTS++: Controlling Speaker Identity in Prompt-based Text-to-Speech using Natural Language Descriptions

Accepted to ICASSP 2024

Reo Shimizu, Ryuichi Yamamoto, Masaya Kawamura, Yuma Shirahata, Hironori Doi, Tatsuya Komatsu, Kentaro Tachibana

Enhancing Multilingual TTS with Voice Conversion based Data Augmentation and Posterior Embedding

Accepted to ICASSP 2024

Hyun-Wook Yoon, Jin-Seob Kim, Ryuichi Yamamoto, Ryo Terashima, Chan-Ho Song, Jae-Min Kim, Eunwoo Song

Period VITS: Variational Inference With Explicit Pitch Modeling For End-to-End Emotional Speech Synthesis

Accepted to ICASSP 2023

Yuma Shirahata, Ryuichi Yamamoto, Eunwoo Song, Ryo Terashima, Jae-Min Kim, Kentaro Tachibana

Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform

Accepted to ICASSP 2023

Masaya Kawamura1, Yuma Shirahata, Ryuichi Yamamoto, Kentaro Tachibana

DRSpeech: Degradation-Robust Text-to-Speech Synthesis with Frame-Level and Utterance-Level Acoustic Representation Learning

Accepted to Interspeech 2022

Takaaki Saeki, Kentaro Tachibana, Ryuichi Yamamoto

TTS-by-TTS 2: Data-selective Augmentation for Neural Speech Synthesis Using Ranking Support Vector Machine with Variational Autoencoder

Accepted to Interspeech 2022

Eunwoo Song, Ryuichi Yamamoto, Ohsung Kwon, Chan-Ho Song, Min-Jae Hwang, Suhyeon Oh, Hyun-Wook Yoon, Jin-Seob Kim, Jae-Min Kim

Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation

Accepted to Interspeech 2022

Ryo Terashima, Ryuichi Yamamoto, Eunwoo Song, Yuma Shirahata, Hyun-Wook Yoon, Jae-Min Kim, Kentaro Tachibana

Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems

Accepted to Interspeech 2022

Hyunwook Yoon, Ohsung Kwon, Hoyeon Lee, Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim, Min-Jae Hwang

企業における音声合成の研究開発 / Research and development for TTS in industry @名古屋工業大学

Jan 25, 2022 10:30 AM — 12:00 PM

Ryuichi Yamamoto

ESPnet2-TTS: Extending the Edge of TTS Research

Preprint: arXiv:2110.07840 (submitted to ICASSP 2022)

Tomoki Hayashi, Ryuichi Yamamoto, Takenori Yoshimura, Peter Wu, Jiatong Shi, Takaaki Saeki, Yooncheol Ju, Yusuke Yasuda, Shinnosuke Takamichi, Shinji Watanabe