I should have unit tests for this but I dont. I do test everything manually from my terminal using import pdb; pdb.set_trace().
Clean: Lower cased, stripped of white spaces on both sides.Special-Word: Any Spanish word(s) that contain any character(s) in this list['á', 'é', 'í', 'ó', 'ú', 'ü', 'ñ'].Normalize:Clean+ white space replaced with no space +Special-Wordwords get a non-special character copy.
- Clone
- Run
python3 -m venv venv. - Run
. venv/bin/activate. - Run
npm i - Run
python install -e ..
- Run
clean spanish-animals. - Run
run-csv.- Translates existing English ens.vision csv files into Spanish.
- CSV files must be in
/collections/*.csv. - Translated file will be in
/tests/fixtures/.*txt. - Language values hardcoded but can be easily changed to be automated.
- Run
run-txt english-animals-file1 spanish-animals-file2.- Translates
file1and writes output intofile2. file1must exist at/tests/fixtures/*.txt..file2will exist at/tests/fixtures/*.txt.
- Translates
- Run
animals cleaned-animals.- Scraper for animals list.
- Run
node scripts/normalize.js- Update the location of a
normalizedtext file on line 8. - Prints to console ens-collection ready csv data.
- Copy text on console into your local copy of ens-collections, in
ens-collections/collections/*.csv.
- Update the location of a
