measure the run-time on GPU and CPU. (1 sec audio takes ~47 secs) If anyone knows additional tricks from the paper, let me know. So far I asked the authors but nobody returned.
train on LJSpeech spectrograms.
distill model as in Parallel WaveNet paper.
Open Source Agenda is not affiliated with "Erogol FFTNet" Project. README Source: erogol/FFTNet