Skip to content

ilektram/pySpark-for-ebook-Classification

Repository files navigation

pySpark-for-ebook-Classification

During this report a python script was put together to process the Gutenberg project Ebooks and create a subject classifier for the Ebooks via utilising machine learning techniques. In order to process the size of the data set, spark was used on City University’s network, with a python API (pyspark).

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages