Train a xtts_v2 voice

Posted on 19 April 2024

Coqui TTS xtts_v2 voice is a natural sounding AI voice text-to-speech model. It supports Portuguese, however it's the pt-BR variant instead of pt-PT, and it still sounds like pt-BR even in voice cloning mode.

Training a new xtts_v2 voice

https://docs.coqui.ai/en/latest/tutorial_for_nervous_beginners.html#cli-way

https://medium.com/@klintcho/creating-an-open-speech-recognition-dataset-for-almost-any-language-c532fb2bc0cf

https://old.reddit.com/r/Oobabooga/comments/18pvcce/alltalk_tts_v17_now_with_xtts_model_finetuning/

https://github.com/erew123/alltalk_tts

python train_gpt_xtts.py --config_path config.json