LESS IS MORE
LESS IS MORE
Home
Projects
Posts
Light
Dark
Automatic
Speech Synthesis
企業における音声合成の研究開発 / Research and development for TTS in industry @名古屋工業大学
Jan 25, 2022 10:30 AM — 12:00 PM
Ryuichi Yamamoto
LJSpeech は価値のあるデータセットですが、ニューラルボコーダの品質比較には向かないと思います
LJSpeech Dataset:
https://keithito.com/LJ-Speech-Dataset/
Jun 11, 2019
1 min read
WN-based TTSやりました / Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions [arXiv:1712.05884]
Audio samples:
https://r9y9.github.io/wavenet_vocoder/
May 20, 2018
3 min read
WaveNet vocoder をやってみましたので、その記録です / WaveNet: A Generative Model for Raw Audio [arXiv:1609.03499]
Audio samples:
https://r9y9.github.io/wavenet_vocoder/
Jan 28, 2018
2 min read
【108 話者編】Deep Voice 3: 2000-Speaker Neural Text-to-Speech / arXiv:1710.07654 [cs.SD]
Audio samples:
https://r9y9.github.io/deepvoice3_pytorch/
Dec 22, 2017
4 min read
【単一話者編】Deep Voice 3: 2000-Speaker Neural Text-to-Speech / arXiv:1710.07654 [cs.SD]
Audio samples:
https://r9y9.github.io/deepvoice3_pytorch/
Dec 13, 2017
3 min read
Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention. [arXiv:1710.08969]
GitHub:
https://github.com/r9y9/deepvoice3_pytorch
Nov 23, 2017
3 min read
日本語 End-to-end 音声合成に使えるコーパス JSUT の前処理 [arXiv:1711.00354]
JSUT:
https://sites.google.com/site/shinnosuketakamichi/publication/jsut
Nov 12, 2017
1 min read
Tacotron: Towards End-to-End Speech Synthesis / arXiv:1703.10135 [cs.CL]
GitHub:
https://github.com/r9y9/tacotron_pytorch
Oct 15, 2017
7 min read
GAN 日本語音声合成 [arXiv:1709.08041]
IEEE TASLP:
https://ieeexplore.ieee.org/document/8063435/
Oct 10, 2017
3 min read
»
Cite
×