scrapping dystopian books from wikipedia page
- Install git
- Install python
- Install pip
- Clone this repo
Open a terminal and run
git clone [email protected]:rgrangeiro/dystopian_books.git
- Run requirements
pip install -r ./dystopian_books/requirements.txt
jupyter notebook &
Open a browser: http://localhost:8888/tree/dystopian_books
The Wikipedia page from which the data comes is not fixed content nor is it accurate or formatted. The notebook shown is just a case study of what can be done and requires adjustments.