Deep Learning | LESS IS MORE

Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation

Accepted to Interspeech 2022

Ryo Terashima, Ryuichi Yamamoto, Eunwoo Song, Yuma Shirahata, Hyun-Wook Yoon, Jae-Min Kim, Kentaro Tachibana

A Unified Accent Estimation Method Based on Multi-Task Learning for Japanese Text-to-Speech

Accepted to Interspeech 2022

Byeongseon Park, Ryuichi Yamamoto, Kentaro Tachibana

Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems

Accepted to Interspeech 2022

Hyunwook Yoon, Ohsung Kwon, Hoyeon Lee, Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim, Min-Jae Hwang

ESPnet2-TTS: Extending the Edge of TTS Research

Preprint: arXiv:2110.07840 (submitted to ICASSP 2022)

Tomoki Hayashi, Ryuichi Yamamoto, Takenori Yoshimura, Peter Wu, Jiatong Shi, Takaaki Saeki, Yooncheol Ju, Yusuke Yasuda, Shinnosuke Takamichi, Shinji Watanabe

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

Ryuichi Yamamoto, Shinnosuke Takamichi

Voicing-Aware Parallel WaveGAN for High-Quality Speech Synthesis

Submitted to IEEE signal processing letters (rejected)

Ryuichi Yamamoto, Eunwoo Song, Min-Jae Hwang

High-fidelity Parallel WaveGAN with Multi-band Harmonic-plus-Noise Model

Published version: ISCA Archive Interspeech 2021

Min-Jae Hwang, Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim

Phrase break prediction with bidirectional encoder representations in Japanese text-to-speech synthesis

Preprint: arXiv:2104.12395, Published version: ISCA Archive Interspeech 2021

Kosuke Futamata, Byeongseon Park, Ryuichi Yamamoto, Kentaro Tachibana

ここまで来た音声技術・今後の展望 / Current progress on speech technologies and its future prospects @ LINE DEV DAY 2020

Nov 25, 2020 4:40 PM — 5:20 PM

Togami Masahito, Yusuke Kida, Ryuichi Yamamoto, Keisuke Imoto

Parallel WaveGAN: GPUを利用した高速かつ高品質な音声合成 / Parallel WaveGAN: Fast and High-Quality GPU Text-to-Speech @ LINE DEV DAY 2020

Nov 25, 2020 2:20 PM — 2:50 PM

Ryuichi Yamamoto