Reads PDFs from Baltimore's archive of minutes from the Board of Estimates and places the data in a searchable table.
This is where the project overview will go
Follow the steps below to recreate the development environment necessary to start contributing to this project
- Python version 3.6 or later
In order to check which version of python you have installed, run the following command in your command line (for Mac/Linux)
NOTE: in all of the code blocks below, lines preceded with
$
indicate commands you should enter in your command line (excluding the$
itself), while lines preceded with>
indicate the expected output from the previous command.
$ python --version
The output should look something like this:
> Python 3.7.7
If you don't have Python version 3.6 or later installed on your computer. Consider using pyenv to install and manage multiple versions of Python concurrently.
- Fork the repo -- for more information about forking, reference this guide
- Clone your forked repo on your local machine:
$ git clone https://github.com/YOUR_USERNAME/BOE_tabulator.git
- Move into the directory created when you cloned the repo:
$ cd BOE_tabulator/
- Configure the upstream repository for your local fork:
$ git remote add upstream https://github.com/department-of-general-services/BOE_tabulator.git
- Confirm that the upstream repo has been set correctly
$ git remote -v > origin https://github.com/YOUR_USERNAME/BOE_tabulator.git (fetch) > origin https://github.com/YOUR_USERNAME/BOE_tabulator.git (push) > upstream https://github.com/department-of-general-services/BOE_tabulator.git (fetch) > upstream https://github.com/department-of-general-services/BOE_tabulator.git (push)
- Create a new virtual environment in your local directory
$ python -m venv env
- Activate your virtual environment
$ source env/bin/activate
- Install necessary python packages
$ pip install -r requirements.txt
- Run the tests and make sure everything passes
$ pytest > =============== XX passed in XXs ===============
- Open up jupyter notebooks
$ jupyter notebook
- Open the
tabulator.ipynb
from the directory in the browser - Run each cell of the notebook
NOTE: Running the notebook may take 10-20 minutes the first time as you download all of the pdf files from the Comptroller webpage
This is where we will specify how to use the tool
James will add details around the workflow for contributing