End-to-End Speech Synthesis
At Google, I am now a member of the team that brought you Tacotron, an end-to-end speech synthesis system that uses neural networks to convert text directly to audio. Check out the audio samples from the recently released Tacotron 2 system, which combines Tacotron with a Wavenet-based vocoder.
Related
Publications
Preprint PDF Project Source Document Ref Audio Examples Blog Post
Preprint PDF Project Poster Slides Video Ref Audio Examples Blog Post