F5-TTS

Name: F5-TTS
Rating: 2.5 (1 reviews)
Author: Star Stack

Open-source zero-shot voice cloning using flow matching

Voice Cloning

Visit Website GitHub 14.8K

About

F5-TTS is an open-source text-to-speech model that performs zero-shot voice cloning from short audio samples. It utilizes a flow matching architecture with diffusion transformers to generate fluent and faithful speech.