Validate `get_variance()` against remaining families #889

strengejacke · 2024-06-14T09:51:42Z

As a follow-up of #877

Following families need validation or don't yet work:

betabinomial
hurdle models
zero-inflated models
glmmTMB::compois
glmmTMB::genpois
glmmTMB::lognormal - these models have low fixed effects variance, leading to R2 close to 0. Calculation needs revision here?

Tagging @bbolker FYI (also tagging @bwiernik )

How could we possibly validate genpois and compois families? Are there any related families that would return a similar R2 so we have a reference for validating the code?
How to calculate the distribution-specific variance for zero-inflated models? (or at least: what's more accurate, see Test get_variance for zero-inflation models #893 (comment)
Is it expected that the R2 is lower for zero-inflated models, when the full model (zero-inflation and conditional part) is taken into account (as opposed to the conditional component of zero-inflated models only, see following example)?

Regarding Zero-Inflation models

Formerly the dispersion parameter for Poisson and ZI Poisson was set to 1. Now, the behaviour for ZI Poisson only has changed, returning a different dispersion / variance:

# For zero-inflated poisson models, the
# distributional variance is based on Zuur et al. 2012
# ----------------------------------------------
.variance_zip <- function(model, faminfo, family_var) {
  if (inherits(model, "glmmTMB")) {
    p <- stats::predict(model, type = "zprob")
    mu <- stats::predict(model, type = "conditional")
    pvar <- (1 - p) * (mu + p * mu^2)
  } else if (inherits(model, "MixMod")) {
    p <- stats::plogis(stats::predict(model, type_pred = "link", type = "zero_part"))
    mu <- suppressWarnings(stats::predict(model, type = "mean_subject"))
    pvar <- (1 - p) * (mu + p * mu^2)
  } else {
    pvar <- family_var
  }

  mean(pvar)
}

Taking following model, for this particular example, this comes closer to a Bayesian model than setting sigma/dispersion to 1 (for zero-inflation models!)

m <- glmmTMB::glmmTMB(count ~ mined + cover + (1 + cover | site),
  ziformula = ~mined,
  family = poisson(), data = Salamanders
)
performance::r2_nakagawa(m)

Formerly with sigma/dispersion = 1

# R2 for Mixed Models

  Conditional R2: 0.650
     Marginal R2: 0.525

Now

# R2 for Mixed Models

  Conditional R2: 0.414
     Marginal R2: 0.334

brms returns (marginal R2 only)

library(brms)
m2 <- brms::brm(bf(count ~ mined + (1 | site), zi ~mined),
  family = brms::zero_inflated_poisson(), data = Salamanders, backend = "rstan"
)
brms::bayes_R2(m2)
    Estimate  Est.Error      Q2.5     Q97.5
R2 0.1686378 0.01874732 0.1334217 0.2070608

Any ideas how to validate the results? @bbolker @bwiernik ?

strengejacke · 2024-06-14T09:56:17Z

We have following deviation to MuMIn:

Computation of fixed effects variance: warnings/approximation of observation-level variance in .get_variance_distributional #877 (comment)
Computation of distribution-specific variance for negative binomial models (negbin/nbinom1): warnings/approximation of observation-level variance in .get_variance_distributional #877 (comment)
Computation of Sigma for glmmTMB: warnings/approximation of observation-level variance in .get_variance_distributional #877 (comment)

However, the things we do differently are in line with the code in the Supplement 2 of Nakagawa. Thus, I would say we go with the current implementation of get_variance(). I added lots of tests to validate against external evidence of correct results.

bbolker · 2024-06-14T13:43:53Z

Presumably it would be

formula = count ~ 1 + (1 | persons),
 ziformula =    ~1 + (1 | persons)

?

strengejacke mentioned this issue Jun 14, 2024

warnings/approximation of observation-level variance in .get_variance_distributional #877

Closed

strengejacke added the get_variance function specific labels label Jun 14, 2024

This comment was marked as outdated.

Sign in to view

This was referenced Jun 14, 2024

null_model() for zero-inflated models. #891

Closed

Test get_variance for zero-inflation models #893

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Validate `get_variance()` against remaining families #889

Validate `get_variance()` against remaining families #889

strengejacke commented Jun 14, 2024 •

edited

Loading

strengejacke commented Jun 14, 2024

This comment was marked as outdated.

bbolker commented Jun 14, 2024

Validate get_variance() against remaining families #889

Validate get_variance() against remaining families #889

Comments

strengejacke commented Jun 14, 2024 • edited Loading

Regarding Zero-Inflation models

strengejacke commented Jun 14, 2024

This comment was marked as outdated.

bbolker commented Jun 14, 2024

Validate `get_variance()` against remaining families #889

Validate `get_variance()` against remaining families #889

strengejacke commented Jun 14, 2024 •

edited

Loading