2025-03-18 class format:
- FastPitch – case study: model training
- SoundStream – learning to encode speech
- VALL-E – a Large Speech Language Model
2025-03-25 class format:
- VALL-E – a Large Speech Language Model (continued)
- Tasks beyond TTS, including Voice Conversion
Demo pages:
- Example audio codec: SoundStream
- Example Large Speech Language Models:
- Example speech editing model: VoiceCraft
- Example Voice Conversion models: