-
Notifications
You must be signed in to change notification settings - Fork 110
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DCHousing is missing total unit counts #495
Comments
Hi @NealHumphrey. Thanks for the question. Unless you have all of the buildings accounted for, you won't get the right unit count from MAR or CAMA. We've been recently trying to tackle this problem of multi-building projects in the Preservation Catalog. In a GitHub issue for PresCat, I describe using owner names in the real property data to try to find all of the related parcels and addresses for a project. This may be more than you want to get into here, however. Of your three options, I would probably go for 1. Note that it is not a problem that the total units > assisted units. That's likely to be the case in newer developments, many of which are mixed income and therefore have both assisted and nonassisted units. I hope that helps. Please let me know if you have more questions. |
My concern with 1) is when we have a mixed income project but don't have the actual total units we would falsely report that it is 100% subsidized. What are your thoughts on that issue? One option for this is to only calculate the "percent_subsidized" field when we aren't doing that substitution - perhaps we could use an additional 'estimated_units_tot' field for the filtering in the map view, and then report the various sources of total units side by side until we have better data to be confident in a way to arrive at the single actual number of units. On a related note - I have been conversing with Open Data about some related data sets to use for calculating the total number of residential units by zone, so that we can can report statistics like the percent of units in Ward 1 that are subsidized. They recently released a new dataset of row-by-row condo and rental units. This might provide another pathway to getting total unit counts, though it too will suffer from the problem of undercounting in cases where the catalog does not have all of the buildings properly accounted for w/ a list of mar_ids. That data set currently has corrupted address_id fields in the csv download which I've asked them to fix; when I get a corrected data set I can do a quick comparison like the one you did in the prescat issue to see how far off the numbers are. I'll report back on that. |
It's a fair point. Flagging the total unit count as "unknown" is also a legitimate option. Perhaps that's better, rather putting up a number that we aren't sure about. |
In discussing w/ the maintainer of the data at the MAR, it sounds like the active_address_unit_count field in the MAR can be pretty much trusted to be accurate - but as Peter notes, only in cases where we have actually captured all of the relevant addresses for the project. I am going to move forward with adding an extra field of 'proj_units_tot_mar' that captures the total units as best we know it in the mar. Then we can use an additional field `proj_units_tot_recommended':
When working through this I'll see if there are any other stipulations that make sense to put on this. |
The DCHousing data set (from DMPED available on opendata.dc.gov) contains "Report_units_affordable" field which we have mapped to the proj_units_assist fields. However, it does not have data for total units i.e.
proj_units_tot
. Therefore, projects added from this data set have missing data for this field.It also so happens that the project with the most subsidized units ("BARRY FARM - ONSITE: 2580 FIRTH STERLING AVENUE SE") comes from this data set. Several user testers were concerned / confused why the maximum total units was larger than the maximum subsidized units.
I have reached out to DMPED re: the data source to see if it is possible to add more fields from their source data, including this field. However, assuming we can't get this data updated from the source, what's the best way to handle this? Options:
@ptatian your input on this would be especially useful. How do you get the parcel mapping in the PresCat? Which of the 3 options would you see as the best path as a user?
The text was updated successfully, but these errors were encountered: