FastSpeech 2 and 2s: Fast, High-Quality, and Fully End-to-End TTS
Published:
FastSpeech 2 simplifies the TTS training pipeline by eliminating the teacher-student distillation process and adding pitch, energy, and duration as explicit conditioning features. FastSpeech 2s takes this a step further by directly generating waveform in a fully end-to-end manner.
