Publications

. Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis. arXiv, 2019.

Preprint PDF Project

. Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis. arXiv, 2019.

Preprint PDF Project Audio Examples

. Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron. ICML, 2018.

Preprint PDF Project Poster Slides Video Source Document Audio Examples Blog Post

. Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis. ICML, 2018.

Preprint PDF Project Source Document Audio Examples Blog Post

. Uncovering Latent Style Factors for Expressive Speech Synthesis. NIPS ML4Audio Workshop, 2017.

Preprint PDF Project Poster Audio Examples Workshop

. Exploring Neural Transducers for End-to-End Speech Recognition. ASRU, 2017.

Preprint PDF Project Source Document

. Reducing Bias in Production Speech Models. arXiv, 2017.

Preprint PDF Project

. Deep Speech 2: End-to-End Speech Recognition in English and Mandarin. ICML, 2016.

Preprint PDF Project Slides Source Document

. Lasagne: First Release. GitHub, 2015.

Code Project 0.1 Ref

. LibROSA: Audio and Music Signal Analysis in Python. SciPy, 2015.

PDF Code Project 0.5.0 Ref

. Scalable Multimedia Content Analysis on Parallel Platforms Using Python. TOMCCAP, 2014.

PDF Project Source Document

. Real-Time Musical Applications on an Experimental Operating System for Multi-Core Processors. ICMC, 2011.

PDF Project

. Advances in the Parallelization of Music and Audio Applications. ICMC, 2010.

PDF Project

. Optimizing Hearing Aids for Music Listening. ICA, 2007.

PDF