Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add nlihc_id using multiple addresses #507

Open
NealHumphrey opened this issue Aug 30, 2017 · 2 comments
Open

Add nlihc_id using multiple addresses #507

NealHumphrey opened this issue Aug 30, 2017 · 2 comments
Labels

Comments

@NealHumphrey
Copy link
Collaborator

We now have a one-to-many address table for projects. Linked data (TOPA notices, tax assessment, etc.) now need to have nlihc_id added to their records using all available mar ids instead of just the one on the project table.

@NealHumphrey
Copy link
Collaborator Author

@ptatian In the Rcasd files you've shared, which contain the TOPA notices, you've already done some geocoding of the records. Can you help clarify what an individual record is in this data set and how best to link this to the projects in our database?

Digging into a few examples, it looks like there is one row for each unique combination of address+notice. If one notice has multiple addresses, the Nidc_rcasd_id will be duplicated for each address. If one address has multiple notices, there will be a separate record for each Nidc_rcasd_id. I'm assuming you've taken the original source data, which had one notice per row, and split each of those rows into multiple rows if multiple addresses were included. Finally, if the address range was written differently in two different TOPA notices the resulting multiple rows could be different.

Is that all accurate? Is there anything I missed?

The case of "2301 Good Hope Court SE" in the address column appears to bear this out - it shows up in two TOPA notices (2017-00012 and 2017-00155); In the first topa notice the Orig_address is 2301-2323 Good Hope Court SE, resulting in 12 unique addresses, and the second is 2301-2327 Good Hope Court SE resulting in 14 unique addresses.

@ptatian
Copy link
Collaborator

ptatian commented Aug 30, 2017

Yes, you are correct in all your assumptions. There is one unique record for each combination of address and notice. We parse out the address ranges into individual addresses. We don't make any real attempt to clean the address ranges -- we take them "as is."

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants