CalebFenton/Voice-synthesis

CalebFenton

Fetched on 2026/07/13 16:22

This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to-speech model trained to generalize to new voices. - View it on GitHub

Star

Rank

6119737

CalebFenton

CalebFenton / Voice-synthesis