StyleTTS 2

Name: StyleTTS 2
Rating: 2.5 (1 reviews)
Author: Star Stack

Text-to-speech model using style diffusion and adversarial training

Text-to-Speech (TTS)

Visit Website GitHub 6.3K

About

StyleTTS 2 is a text-to-speech model that generates human-level speech by modeling styles as latent random variables through diffusion. It uses large speech language models as discriminators to improve naturalness without requiring reference speech.