Skip to main content
Cartesia Sonic logo

Cartesia Sonic

Streaming text-to-speech API with emotion and ultra-low latency

About

Cartesia Sonic is a streaming text-to-speech API that generates human-like speech with emotional expressions and laughter. It is designed for real-time voice agents requiring sub-100ms latency and context-aware pronunciation.