🐸Coqui.ai News # 📣 ⓍTTSv2 is here with 16 languages and better performance across the board. 📣 ⓍTTS fine-tuning code is out. Check the example recipes.

Understanding the Context

📣 ⓍTTS can now stream with <200ms latency. 📣 ⓍTTS, our production TTS model that can speak 13 languages, is released Blog Post, Demo, Docs 📣 🐶Bark is now available for inference with unconstrained voice cloning ... This is the same model that powers Coqui Studio, and Coqui API, however we apply a few tricks to make it faster and support streaming inference. Features # Voice cloning.

Key Insights

Cross-language voice cloning. Multi-lingual speech generation. 24khz sampling rate. Streaming inference with < 200ms latency. (See Streaming inference) Fine-tuning support ...

Final Thoughts

What makes a good TTS dataset - TTS 0.22.0 documentation - Coqui