DistillW2N

The official codes for our paper: for "DistillW2N: A Lightweight One-Shot Whisper to Normal Voice Conversion Model Using Distillation of Self-Supervised Features" (DistillW2N), which is accepted by ICASSP2025.

Setup

Create a Python environment with e.g. conda: conda create --name distillw2n python=3.10.12 --yes
Activate the new environment: conda activate distillw2n
Install torch and torchaudio: pip install torch torchaudio --index-url https://download.pytorch.org/whl/cu121
Update the packages: sudo apt-get update && apt-get install -y libsndfile1 ffmpeg
Install requirements with pip install -r requirements.txt
Download models with links given in txt

Inference

For quickvc and wesper please run: python compare_infer.py
For our models please run: python infer.py

Training

Please run: python u2ss2u.py

Datasets

You just need to download the datasets under YOURPATH.

Dataset Download
- For the libritts, ljspeech, and timit datasets, datahelper will automatically download if they are not found at YOURPATH.
- For the wtimit dataset, you will need to request it via email. Follow the appropriate procedures to obtain access and download the dataset to YOURPATH.
Dataset Preparation (Option)
- datapreper offers options for ppw (Pseudo-whisper) and vad (Voice Activity Detection) versions. You can choose to apply these processing steps according to your project's requirements.

Credits

This implementation builds on

SoundStream for the training pipeline.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
datahelper		datahelper
datapreper		datapreper
experiments		experiments
libs		libs
minimal_quickvc		minimal_quickvc
minimal_wesper		minimal_wesper
models		models
raw		raw
resources		resources
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
compare_infer.py		compare_infer.py
infer.py		infer.py
requirements.txt		requirements.txt
u2ss2u.py		u2ss2u.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DistillW2N

Setup

Inference

Training

Datasets

Credits

About

Releases

Packages

Languages

License

tan90xx/distillw2n

Folders and files

Latest commit

History

Repository files navigation

DistillW2N

Setup

Inference

Training

Datasets

Credits

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages