A package for curating doc file collections. Prominent features:
- Scrape texts off various sites, such as Wikisource. See example here. (PS: Consider contributing to raw_etexts repo. )
- OCR some pdf with google drive. Automatically splits into 25 page bits and ocrs them individually. See usage example here, function here.
- Autogenerated Docs on readthedocs (might be broken).
- Manually and periodically generated docs here
- For detailed examples and help, please see individual module files in this package.
sudo pip install dict_curation -U
sudo pip install git+https://github.com/sanskrit-coders/dict_curation/@master -U
- Web.
Have a problem or question? Please head to github.
- ~/.pypirc should have your pypi login credentials.
python setup.py bdist_wheel
twine upload dist/* --skip-existing
- sphinx html docs can be generated with
cd docs; make html
Run pytest
in the root directory.