not creepy
This spider crawls a list of doctors from SelectHealth's Find a Doctor search page and generates a CSV for easy viewing.
This scraper is built in python. Is this your first time?
- You have python installed
- You have considered creating a virtual environment
- You have installed requirements using
pip install -r requirements.txt
- Navigate to https://selecthealth.org/find-a-doctor
- Fill in the search parameters you desire and execute a search.
- Scroll to the bottom of the page to load all search results (hold down arrow, etc). You will see "No more results"
- Right click in whitespace, "Save As" Complete webpage.
- Open the saved html file in your browser
- Replace your
file://
url intostart_urls
array in this script. - Execute script with
scrapy runspider doctor_spider.py -o file-to-output.json
- Patience...
- Convert the json into a flattened csv format with this super rad tool https://www.convertcsv.com/json-to-csv.htm
- Open CSV and enjoy browsing doctors at a glance in a spreadsheet.
N.B. I did not make it execute the search automatically, so generating the list of search results locally is required. Sorry about that, I couldn't figure out the infinite scroll functionality in the time I had.
Pull requests are welcome!