Speech Synthesis

企業における音声合成の研究開発 / Research and development for TTS in industry @名古屋工業大学

Jan 25, 2022 10:30 AM — 12:00 PM

Ryuichi Yamamoto

LJSpeech は価値のあるデータセットですが、ニューラルボコーダの品質比較には向かないと思います

LJSpeech Dataset: https://keithito.com/LJ-Speech-Dataset/

Jun 11, 2019 1 min read

WN-based TTSやりました / Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions [arXiv:1712.05884]

Audio samples: https://r9y9.github.io/wavenet_vocoder/

May 20, 2018 3 min read

WaveNet vocoder をやってみましたので、その記録です / WaveNet: A Generative Model for Raw Audio [arXiv:1609.03499]

Audio samples: https://r9y9.github.io/wavenet_vocoder/

Jan 28, 2018 2 min read

【108 話者編】Deep Voice 3: 2000-Speaker Neural Text-to-Speech / arXiv:1710.07654 [cs.SD]

Audio samples: https://r9y9.github.io/deepvoice3_pytorch/

Dec 22, 2017 4 min read

【単一話者編】Deep Voice 3: 2000-Speaker Neural Text-to-Speech / arXiv:1710.07654 [cs.SD]

Audio samples: https://r9y9.github.io/deepvoice3_pytorch/

Dec 13, 2017 3 min read

Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention. [arXiv:1710.08969]

GitHub: https://github.com/r9y9/deepvoice3_pytorch

Nov 23, 2017 3 min read

日本語 End-to-end 音声合成に使えるコーパス JSUT の前処理 [arXiv:1711.00354]

JSUT: https://sites.google.com/site/shinnosuketakamichi/publication/jsut

Nov 12, 2017 1 min read

Tacotron: Towards End-to-End Speech Synthesis / arXiv:1703.10135 [cs.CL]

GitHub: https://github.com/r9y9/tacotron_pytorch

Oct 15, 2017 7 min read

GAN 日本語音声合成 [arXiv:1709.08041]

IEEE TASLP: https://ieeexplore.ieee.org/document/8063435/

Oct 10, 2017 3 min read