Skip to content

Latest commit

 

History

History
46 lines (30 loc) · 850 Bytes

Readme.md

File metadata and controls

46 lines (30 loc) · 850 Bytes

IMMO ELIZA SCRAPER

installation steps

  1. clone the repository using git
git clone [email protected]:DeLeb86/immoscraper.git
  1. create a virtual environment using venv
python3 -m venv ~/.venv/eliza_scraping
  1. activate the virtual environment
source ~/.venv/eliza_scraping/bin/activate
  1. install required libraries with pip
pip install -r requirements.txt

Execution

run scrapy command :

scrapy crawl immowescraper -o data/output.json

The output is important because the post process step executed when the spider is done reads data from that file.

Results

  1. raw dataset : 176910 properties
  2. remove null prices and postal code : 116999
  3. remove postal code that are not from Belgium : 115070

Now it's your turn to test !!

spider