Plachtaa / VITS-fast-fine-tuning
- понедельник, 14 августа 2023 г. в 00:00:06
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
This repo will guide you to add your own character voices, or even your own voice, into existing VITS TTS model to make it able to do the following tasks in less than 1 hour:
Welcome to play around with the base models!
Chinese & English & Japanese: Author: Me
Chinese & Japanese: Author: SayaSS
Chinese only:(No running huggingface spaces) Author: Wwwwhy230825
See LOCAL.md for local training guide.
Alternatively, you can perform fine-tuning on Google Colab
inference
, which are named G_latest.pth
and finetune_speaker.json
, respectively.inference
├───inference.exe
├───...
├───finetune_speaker.json
└───G_latest.pth
inference.exe
, the browser should pop up automatically.ffmpeg
to enable voice conversion feature.G_latest.pth
and moegoe_config.json
, respectively.If you have any questions, please feel free to open an issue or join our Discord server.