[논문리뷰] Tacotron: Towards End-to-End Speech Synthesis (INTERSPEECH17)
제목: TACOTRON: Towards End-to-End Speech Synthesis 저자: Yuxuan Wang, RJ Skerry-Ryan, Daisy Stanton, Yonghui Wu, Ron J. Weiss, Navdeep Jaitly, Zongheng Yang, Ying Xiao, Zhifeng Chen, Samy Bengio, Quoc Le, Yannis Agiomyrgiannakis, Rob Clark, Rif A. Saurous 소속: Google 발표: INTERSPEECH 2017 논문: https://arxiv.org/abs/1703.10135 오디오샘플: https://google.github.io/tacotron/ - Tacotron - 정말로 처음부터 끝까지 한 번에 다하고..
[논문리뷰] Deep Voice: Real-time Neural Text-to-Speech (ICML17)
논문제목: Deep Voice: Real-time Neural Text-to-Speech 저자: Sercan O Arık, Mike Chrzanowski, Adam Coates, Gregory Diamos, Andrew Gibiansky, Yongguo Kang, Xian Li, John Miller, Andrew Ng, Jonathan Raiman, Shubho Sengupta, Mohammad Shoeybi 소속: Baidu Research 논문: https://arxiv.org/abs/1702.07825 발표: ICML 2017 - Deep Voice 첫번째 버전 - TTS에서 구성성분을 5개의 블럭으로 구성하고 각각을 모두 NN으로 구현함. 여기서 phoneme boundary segmentati..
[논문리뷰] Parallel WaveNet: Fast High-Fidelity Speech Synthesis (ICML18)
논문제목: Parallel WaveNet: Fast High-Fidelity Speech Synthesis 저자: Aaron van den Oord, Yazhe Li, Igor Babuschkin, Karen Simonyan, Oriol Vinyals, Koray Kavukcuoglu, George van den Driessche, Edward Lockhart, Luis C. Cobo, Florian Stimberg, Norman Casagrande, Dominik Grewe, Seb Noury, Sander Dieleman, Erich Elsen, Nal Kalchbrenner, Heiga Zen, Alex Graves, Helen King, Tom Walters, Dan Belov, Demis Has..