Follow
Jonathan Shen
Jonathan Shen
Verified email at google.com
Title
Cited by
Cited by
Year
Natural tts synthesis by conditioning wavenet on mel spectrogram predictions
J Shen, R Pang, RJ Weiss, M Schuster, N Jaitly, Z Yang, Z Chen, Y Zhang, ...
2018 IEEE international conference on acoustics, speech and signal …, 2018
17812018
Transfer learning from speaker verification to multispeaker text-to-speech synthesis
Y Jia, Y Zhang, R Weiss, Q Wang, J Shen, F Ren, P Nguyen, R Pang, ...
Advances in neural information processing systems 31, 2018
4992018
Hierarchical generative modeling for controllable speech synthesis
WN Hsu, Y Zhang, RJ Weiss, H Zen, Y Wu, Y Wang, Y Cao, Y Jia, Z Chen, ...
arXiv preprint arXiv:1810.07217, 2018
1742018
Lingvo: a modular and scalable framework for sequence-to-sequence modeling
J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ...
arXiv preprint arXiv:1902.08295, 2019
1422019
SATzilla2012: Improved algorithm selection based on cost-sensitive classification models
L Xu, F Hutter, J Shen, HH Hoos, K Leyton-Brown
Proceedings of SAT Challenge, 57-58, 2012
982012
Parallel tacotron: Non-autoregressive and controllable tts
I Elias, H Zen, J Shen, Y Zhang, Y Jia, RJ Weiss, Y Wu
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
542021
Neural program synthesis with priority queue training
DA Abolafia, M Norouzi, J Shen, R Zhao, QV Le
arXiv preprint arXiv:1801.03526, 2018
482018
Non-attentive tacotron: Robust and controllable neural TTS synthesis including unsupervised duration modeling
J Shen, Y Jia, M Chrzanowski, Y Zhang, I Elias, H Zen, Y Wu
arXiv preprint arXiv:2010.04301, 2020
392020
In teacher we trust: Learning compressed models for pedestrian detection
J Shen, N Vesdapunt, VN Boddeti, KM Kitani
arXiv preprint arXiv:1612.00478, 2016
312016
PnG BERT: Augmented BERT on phonemes and graphemes for neural TTS
Y Jia, H Zen, J Shen, Y Zhang, Y Wu
arXiv preprint arXiv:2103.15060, 2021
262021
Parallel tacotron 2: A non-autoregressive neural TTS model with differentiable duration modeling
I Elias, H Zen, J Shen, Y Zhang, Y Jia, RJ Skerry-Ryan, Y Wu
arXiv preprint arXiv:2103.14574, 2021
192021
Synthesizing speech from text using neural networks
Y Wu, J Shen, R Pang, RJ Weiss, M Schuster, N Jaitly, Z Yang, Z Chen, ...
US Patent 10,971,170, 2021
132021
Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions.(2018)
J Shen, R Pang, RJ Weiss, M Schuster, N Jaitly, Z Yang, Z Chen, Y Zhang, ...
arXiv preprint cs.CL/1712.05884, 2018
22018
Modelling intonation in spectrograms for neural vocoder based text-to-speech
V Wan, J Shen, H Silen, R Clark
12020
Building a text-to-speech system from a small amount of speech data
Y Jia, B Chun, ODA Yusuke, N Casagrande, T Iyer, F Luo, ...
US Patent 11,335,321, 2022
2022
Parallel Tacotron Non-Autoregressive and Controllable TTS
I Elias, J Shen, Y Zhang, Y Jia, RJ Weiss, Y Wu, B Chun
US Patent App. 17/327,076, 2022
2022
Text-to-speech using duration prediction
Y Zhang, I Elias, B Chun, Y Jia, Y Wu, M Chrzanowski, J Shen
US Patent App. 17/492,543, 2022
2022
Examining Scaling and Transfer of Language Model Architectures for Machine Translation
B Zhang, B Ghorbani, A Bapna, Y Cheng, X Garcia, J Shen, O Firat
arXiv preprint arXiv:2202.00528, 2022
2022
Synthesis of Speech from Text in a Voice of a Target Speaker Using Neural Networks
Y Jia, Z Chen, Y Wu, J Shen, R Pang, RJ Weiss, IL Moreno, F Ren, ...
US Patent App. 17/055,951, 2021
2021
Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Alignments
I Elias, H Zen, J Shen, Y Zhang, Y Jia, RJ Skerry-Ryan, Y Wu
2021
The system can't perform the operation now. Try again later.
Articles 1–20