Skip to content

foteinigk/datasets

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

datasets

load datasets from Hugging Face and Opus

Run scripts

poetry install

cd load_datasets

Download dataset from huggingface for opus100 model, el, en languages and generate a file with 500 sentences

poetry run python huggingface_main.py opus100 el en 0 500

Download dataset from opus

poetry run python opus_main.py GlobalVoices en el

Create a sample from generated files

poetry run pythonsample.py en el 500

About

load datasets

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages