azalea/VITS-fast-fine-tuning

Fork 0

T

Plachta 2834647ec5 upload files

2023-02-15 16:18:49 +08:00

.idea

upload files

2023-02-13 14:17:54 +08:00

configs

upload files

2023-02-15 00:12:18 +08:00

monotonic_align

Delete core.c

2023-02-11 08:25:01 +08:00

text

Add files via upload

2023-02-11 08:22:58 +08:00

user_voice

upload files

2023-02-13 14:17:54 +08:00

attentions.py

Add files via upload

2023-02-11 08:22:58 +08:00

commons.py

upload files

2023-02-13 17:20:19 +08:00

data_utils.py

upload files

2023-02-13 17:20:19 +08:00

demucs_denoise.py

upload files

2023-02-13 14:59:49 +08:00

download_model.py

upload files

2023-02-15 14:33:42 +08:00

finetune_speaker.py

upload files

2023-02-15 00:12:18 +08:00

LICENSE

Initial commit

2023-02-11 08:14:59 +08:00

losses.py

Add files via upload

2023-02-11 08:22:58 +08:00

mel_processing.py

Add files via upload

2023-02-11 08:22:58 +08:00

models_infer.py

upload files

2023-02-15 16:18:49 +08:00

models.py

upload files

2023-02-13 17:20:19 +08:00

modules.py

upload files

2023-02-13 17:20:19 +08:00

preprocess.py

upload files

2023-02-13 17:46:39 +08:00

README_CN.md

upload files

2023-02-15 15:29:32 +08:00

README.md

upload files

2023-02-15 15:28:30 +08:00

requirements_infer.txt

upload files

2023-02-15 16:18:49 +08:00

requirements.txt

Update requirements.txt

2023-02-11 08:43:20 +08:00

transforms.py

Add files via upload

2023-02-11 08:22:58 +08:00

user_voice_collect.py

upload files

2023-02-13 14:42:04 +08:00

utils.py

upload files

2023-02-15 01:36:30 +08:00

VC_inference.py

upload files

2023-02-15 16:18:49 +08:00

README.md

中文文档请点击这里

VITS Voice Conversion

This repo will guide you to add your voice into an existing VITS TTS model to make it a high-quality voice converter to all existing character voices in the model.

Welcome to play around with the base model, a Trilingual Anime VITS!

Currently Supported Tasks:

Convert user's voice to characters listed here
Chinese, English, Japanese TTS with user's voice
Chinese, English, Japanese TTS with custom characters...

Currently Supported Characters for TTS & VC:

Umamusume Pretty Derby
Sanoba Witch
Genshin Impact
Custom characters...

Fine-tuning

It's recommended to perform fine-tuning on Google Colab because the original VITS has some dependencies that are difficult to configure.

How long does it take?

Install dependencies (2 min)
Record at least 10 your own voice (5 min)
Fine-tune (30 min)

Inference or Usage

Install Python if you haven't done so (Python >= 3.7)
Clone this repo:
git clone https://github.com/SongtingLiu/VITS_voice_conversion.git
Install dependencies
pip install -r requirements_infer.txt
run VC_inference.py
python VC_inference.py