2834647ec55c73442fc9023d7f82e6265864c82a
VITS Voice Conversion
This repo will guide you to add your voice into an existing VITS TTS model to make it a high-quality voice converter to all existing character voices in the model.
Welcome to play around with the base model, a Trilingual Anime VITS!
Currently Supported Tasks:
- Convert user's voice to characters listed here
- Chinese, English, Japanese TTS with user's voice
- Chinese, English, Japanese TTS with custom characters...
Currently Supported Characters for TTS & VC:
- Umamusume Pretty Derby
- Sanoba Witch
- Genshin Impact
- Custom characters...
Fine-tuning
It's recommended to perform fine-tuning on Google Colab because the original VITS has some dependencies that are difficult to configure.
How long does it take?
- Install dependencies (2 min)
- Record at least 10 your own voice (5 min)
- Fine-tune (30 min)
Inference or Usage
- Install Python if you haven't done so (Python >= 3.7)
- Clone this repo:
git clone https://github.com/SongtingLiu/VITS_voice_conversion.git - Install dependencies
pip install -r requirements_infer.txt - run VC_inference.py
python VC_inference.py
Description
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
Languages
Python
99.4%
Cython
0.6%