Note: All features of wildboottestjlr
are fully integrated into fwildclusterboot.
wildboottestjlr
ports the functionality of the
WildBootTests.jl package
to R via the
JuliaConnectoR
package. At the moment, it supports the following features of
WildBootTests.jl
:
- The wild bootstrap for OLS (Wu 1986).
- The Wild Restricted Efficient bootstrap (WRE) for IV/2SLS/LIML (Davidson and MacKinnon 2010).
- The subcluster bootstrap (MacKinnon and Webb 2018).
- Confidence intervals formed by inverting the test and iteratively searching for bounds.
- Multiway clustering.
- Arbitrary and multiple linear hypotheses in the parameters.
- One-way fixed effects.
The following model objects are currently supported:
- OLS:
lm
(from stats),fixest
(from fixest),felm
from (lfe) - IV:
ivreg
(from ivreg).
In the future, IV methods for fixest
and lfe
will be added.
wildboottestjlr
can be installed by running
library(devtools)
install_github("s3alfisc/wildboottestjlr")
You can install Julia by following the steps described here:
https://julialang.org/downloads/. WildBootTests.jl
can then be
installed via Julia’s package management system.
To install WildBootTests.jl
and Julia from within R, you can use
wildboottestjlr::wildboottestjlr_setup()
. Via
wildboottestjlr_setup()
, you can install Julia and WildBootTests.jl
and connect R and Julia. You simply have to follow the instructions
printed in the console!
library(wildboottestjlr)
wildboottestr_setup()
Similarly, you can set the number of Julia threads by running
julia_set_ntreads()
and following the instructions.
wildboottestjlr's
central function is boottest()
. Beyond few minor
differences, it largely mirrors the boottest()
function from the
fwildclusterboot
package.
library(wildboottestjlr)
# set a 'global' seed in the Julia session
set_julia_seed(rng = 12313452435)
#> <Julia object of type MersenneTwister>
#> MersenneTwister(12313452435)
data(voters)
library(fixest)
library(lfe)
# estimation via lm(), fixest::feols() or lfe::felm()
lm_fit <- lm(proposition_vote ~ treatment + log_income, data = voters)
feols_fit <- feols(proposition_vote ~ treatment + log_income, data = voters)
felm_fit <- felm(proposition_vote ~ treatment + log_income, data = voters)
boot_lm <- boottest(lm_fit, clustid = "group_id1", B = 999, param = "treatment", rng = 7651427)
boot_feols <- boottest(feols_fit, clustid = "group_id1", B = 999, param = "treatment", rng = 7651427)
boot_felm <- boottest(felm_fit, clustid = "group_id1", B = 999, param = "treatment", rng = 7651427)
# summarize results via summary() method
#summary(boot_lm)
# also possible: use msummary() from modelsummary package
library(modelsummary)
msummary(list(boot_lm, boot_feols, boot_felm),
estimate = "{estimate} ({p.value})",
statistic = "[{conf.low}, {conf.high}]"
)
Model 1 | Model 2 | Model 3 | |
---|---|---|---|
1\*treatment = 0 | 0.089 (0.001) | 0.089 (0.001) | 0.089 (0.001) |
\[0.039, 0.142\] | \[0.039, 0.142\] | \[0.039, 0.142\] | |
Num.Obs. | 300 | 300 | 300 |
R2 | 0.045 | 0.045 | 0.045 |
R2 Adj. | 0.039 | 0.039 | 0.039 |
R2 Within | |||
R2 Pseudo | |||
AIC | -1.9 | -3.9 | |
BIC | 12.9 | 7.2 | |
Log.Lik. | 4.950 | 4.950 |
# plot(boot_lm)
If boottest()
is applied based on an object of type ivreg
, the WRE
bootstrap Davidson & MacKinnon
(2010) is
run.
library(ivreg)
data("SchoolingReturns", package = "ivreg")
data <- SchoolingReturns
ivreg_fit <- ivreg(log(wage) ~ education + age + ethnicity + smsa + south + parents14 |
nearcollege + age + ethnicity + smsa + south + parents14,
data = data)
boot_ivreg <- boottest(object = ivreg_fit, B = 999, param = "education", clustid = "fameducation", type = "webb")
summary(boot_ivreg)
#> boottest.ivreg(object = ivreg_fit, clustid = "fameducation",
#> param = "education", B = 999, type = "webb")
#>
#> Hypothesis: 1*education = 0
#> Observations: 3010
#> Bootstr. Iter: 999
#> Bootstr. Type: webb
#> Clustering: 1-way
#> Confidence Sets: 95%
#> Number of Clusters: 9
#>
#> term estimate statistic p.value conf.low conf.high
#> 1 1*education = 0 0.0904587 2.207293 0.01901902 0.01356883 0.2421277
After compilation, wildboottestjlr
is orders of magnitude faster than
fwildclusterboot
, in particular when the number of clusters N_G and
the number of bootstrap iterations B get large.
The benchmarks plot the median value of 3 runs of a linear regression with N = 10.000 and k = 21.