-
-
Notifications
You must be signed in to change notification settings - Fork 159
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Is it valid to aggregate taxcalc data to households? #1961
Comments
Upfront, I'm not sure it's valid to reaggregate to the household level. The All that said, an alternative approach would be to reverse engineer the households only insofar as the underlying tax units share a lowest common weight. Let's say that you use the CPS to isolate two tax units in the Together, these records represent 1,500 tax units. So far, so good. My interpretation of your approach is that you would convert these two records into a single household with weight 750. The alternative would be to recognize that the underlying methodology has judged that one of these units, 1, is more common than the other. So instead, create a household record made up of units 1 & 2 that has a weight of 500 (the minimum weight of all the units in the CPS-matched household). Then create an additional household record made up solely of Unit 1 that captures the residual 500 weight you have left over. Ultimately then, the number of households you create in In this example, you'd be creating 1,000 households across two records, rather than 750 under just one. |
Ah, I figured it would generate more. We’ll see what the NYT folks say, but I’ll bet that “household” in their piece was just a journalistic term of art and they really performed the analysis at the tax unit level.
|
Yes, Ben Casselman confirmed that it was actually at the tax unit level.
…On Sat, Apr 21, 2018, 11:44 AM evtedeschi3 ***@***.***> wrote:
Ah, I figured it would generate more. We’ll see what the NYT folks say,
but I’ll bet that “household” in their piece was just a journalistic term
of art and they really performed the analysis at the tax unit level.
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub
<#1961 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AFy2zyp_E1JsdPVqNO04lrA2kzoJahjqks5tq34IgaJpZM4TLpfj>
.
|
The last comment in issue #1961 was made on April 21, about three and a half months ago. @MattHJensen @MaxGhenis @evtedeschi3 |
It seems like the answer to the question is no, so closing. If anyone thinks of other ideas, please share here as I continue to be interested. |
I'm looking to aggregate taxcalc data to the household level using
h_seq
, and am having trouble thinking through what the appropriate weight would be for each household record. It seems to me that the variance ins006
across tax units within a household means that there's no single weight that can be assigned to households to produce the same aggregates as tax unit totals. Instead the ideal way might be linear programming to match tax-unit-level totals, or administrative totals directly.In this notebook I tried two other simple approaches that I'd expect would produce similar ballpark weights:
Assigning a household's weight as the sum of its tax units'
s006
, divided by the number of tax units.Assigning different weights for households for different variables, e.g. for
XTOT
it would be the sum ofXTOT * s006
across its tax units, divided by the totalXTOT
for the household. I also do this forexpanded_income
, and it could be done for any summable variable. This is really a generalization of (1), where (1) does this for a count.Each of these approaches ((1), (2) for
XTOT
, and (2) forexpanded_income
) yields similar total household counts of between 51.5M and 52.9M (2017 data). This is ~60% below Census's 2017 estimate of 126M households. Breaking down by bothh_seq
andffpos
yields ~68M.Is this discrepancy due to top-coding? Any other ideas on doing this?
Related: did the NYT TCJA calculator analyze by tax unit or somehow aggregate to households? They mention households several times.
The text was updated successfully, but these errors were encountered: