Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Weighting schemes #2

Open
cdowd opened this issue Dec 10, 2019 · 3 comments
Open

Weighting schemes #2

cdowd opened this issue Dec 10, 2019 · 3 comments

Comments

@cdowd
Copy link
Owner

cdowd commented Dec 10, 2019

Should be able to incorporate observation weights with ease. This has been requested by several users.

@cdowd
Copy link
Owner Author

cdowd commented Jan 13, 2020

Okay, so weights on observations are a bit funky actually. For the ECDF the implication is easy -- it merely adjusts heights. I actually build that weight vector anyhow, and it would be easy to adjust.

But the weights may also affect the resampling probabilities (i.e. chances I observed this individual affect the sampling variation associated with drawing them, while the portion of the larger population they represent affects the ecdf height -- survey designs are capable of tying or splitting those values). Typically these weights are related (though they don't need to be), so you need to build in two different weight options, set a reasonable default, and explain which is which in a clear and consistent (and correct) manner.

The good news (I think) is that only the ECDF height weights need to be passed down to the C++, the other weights only affect the sampling level.

@cdowd
Copy link
Owner Author

cdowd commented Jun 13, 2022

Worse. Seems likely that sampling weights break exchangeability assumption.

Easy enough to build each ECDF. Much harder to figure out what the null about the ECDFs are.

E.g. suppose samples from two different surveys... what then?

@luisvalenzuelar
Copy link

But if we are talking about comparing within a sample, for example, male vs female for employment data, then exchangeability assumption ok?

Maybe can implement weights and let individuals know that it only works when weights come from same sampling framework?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants