Merlin is a toolkit for building Deep Neural Network models for statistical parametric speech synthesis. It is a typical frame-by-frame approach, pre-dating sequence-to-sequence models.
Zhizheng Wu, Oliver Watts, Simon King. “Merlin: An Open Source Neural Network Speech Synthesis System” in Proc. 9th ISCA Workshop on Speech Synthesis Workshop (SSW 9), 202–207 DOI: 10.21437/SSW.2016-33