You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In simulate-methods.R in gratia, for ocat models, predict(type = "response") currently returns the probability of each category. However, the fix.family.rd function expects a vector of linear predictor values instead. This discrepancy leads to incorrect handling in the simulation.gam output for ocat.
I've pasted the mgcv::fix.family.rd function for the ocat family here for reference:
function (mu, wt, scale)
{
theta<- get(".Theta")
R= length(theta) +2alpha<- rep(0, R+1)
alpha[1] <--Infalpha[R+1] <-Infalpha[2] <--1if (R>2) {
ind<-3:R# looks like alpha is the vector of cut points along logistic distributionalpha[ind] <-alpha[2] + cumsum(exp(theta))
}
y<-u<- runif(length(mu))
# mu should be on the same scale as logit(u)?u<-mu+ log(u/(1-u))
for (iin1:R) {
y[u>alpha[i] &u<=alpha[i+1]] <-i
}
y
}
In this example, sim is a $4 \times 1$ matrix because fix.family.rd interpreted the output of predict(type = "response"), which is Pr(y = 1, 2, 3, 4), as linear predictor values, leading to unexpected results. I'm fairly sure this is a bug. If you agree, let me know how you'd like to fix it and I can open a PR. Should predict(type = "link") be used in simulate.gam instead?
Thank you so much for teaching me about gam and {mgcv}.
The text was updated successfully, but these errors were encountered:
Thanks for the report. This is one of those unintentional things; I didn't know the ocat() had an $rd component. These non-standard families need specialist support and that's something I hadn't gotten to yet (my recollection was that none of them had $rd in the family).
This is a bug (I should probably check to see if a particular family is supported and throw an error for those that don't work). And yes, the fix will be to pass in a linear predictor value for each observation we're simulating for. I don't think the fix is that trivial however; I will need to handle all the other non-standard families and some of those will require different handling to that needed for ocat(). I have begun to tackle this in fitted_values(), where I have a list of standard families that just work normally and then other families that need special handling through a pre- or post-processing step. This is how I'm handling ocat() in fitted_values() for example. I'll need to think how best to do this in simulate() and also try to not duplicate code as the details of all these parameterisations for all the families makes my head hurt from time to time.
Oh, I’d be happy to help! To be honest, I'm still getting familiar with posterior simulations on a theoretical level, so working with your implementations in gratia is a great opportunity for me to learn. Nevertheless, I'll go through your vignette more carefully, review the fitted_values code, and see what might be the best approach. In the meantime, if you have an idea and would like help implementing it, just let me know!
In simulate-methods.R in gratia, for
ocat
models,predict(type = "response")
currently returns the probability of each category. However, thefix.family.rd
function expects a vector of linear predictor values instead. This discrepancy leads to incorrect handling in thesimulation.gam
output forocat
.I've pasted the
mgcv::fix.family.rd
function for theocat
family here for reference:Reproducible Example
In this example,$4 \times 1$ matrix because
sim
is afix.family.rd
interpreted the output ofpredict(type = "response")
, which isPr(y = 1, 2, 3, 4)
, as linear predictor values, leading to unexpected results. I'm fairly sure this is a bug. If you agree, let me know how you'd like to fix it and I can open a PR. Shouldpredict(type = "link")
be used insimulate.gam
instead?Thank you so much for teaching me about
gam
and{mgcv}
.The text was updated successfully, but these errors were encountered: