34j / so-vits-svc-fork
- вторник, 21 марта 2023 г. в 00:13:47
so-vits-svc fork with REALTIME support (voice changer) and greatly improved interface.
A fork of so-vits-svc with realtime support and greatly improved interface. Based on branch 4.0 (v1) and the models are compatible.
Install this via pip (or your favourite package manager):
pip install -U torch torchaudio --index-url https://download.pytorch.org/whl/cu117
pip install so-vits-svc-forkpip.GUI launches with the following command:
svcgsvc vc --model-path <model-path>svc --model-path <model-path> source.wavPlace your dataset like dataset_raw/{speaker_id}/**/{wav_file}.{any_format} (subfolders are acceptable) and run:
svc pre-resample
svc pre-config
svc pre-hubert
svc trainconfig.json before the train command to match the VRAM capacity. As tested, the default requires about 14 GB.For more details, run svc -h or svc <subcommand> -h.
> svc -h
Usage: svc [OPTIONS] COMMAND [ARGS]...
so-vits-svc allows any folder structure for training data.
However, the following folder structure is recommended.
When training: dataset_raw/{speaker_name}/{wav_name}.wav
When inference: configs/44k/config.json, logs/44k/G_XXXX.pth
If the folder structure is followed, you DO NOT NEED TO SPECIFY model path, config path, etc.
(The latest model will be automatically loaded.)
To train a model, run pre-resample, pre-config, pre-hubert, train.
To infer a model, run infer.
Options:
-h, --help Show this message and exit.
Commands:
clean Clean up files, only useful if you are using the default file structure
infer Inference
onnx Export model to onnx
pre-config Preprocessing part 2: config
pre-hubert Preprocessing part 3: hubert If the HuBERT model is not found, it will be...
pre-resample Preprocessing part 1: resample
train Train model If D_0.pth or G_0.pth not found, automatically download from hub.
train-cluster Train k-means clustering
vc Realtime inference from microphoneThanks goes to these wonderful people (emoji key):
34j |
GarrettConway |
BlueAmulet |
ThrowawayAccount01 |
This project follows the all-contributors specification. Contributions of any kind welcome!