Skip to content
This repository has been archived by the owner on Jun 6, 2022. It is now read-only.

Latest commit

 

History

History
316 lines (276 loc) · 8.4 KB

README.md

File metadata and controls

316 lines (276 loc) · 8.4 KB

Lifecycle: experimental

R-CMD-check

Note: All features of wildboottestjlr are fully integrated into fwildclusterboot.

wildboottestjlr

wildboottestjlr ports the functionality of the WildBootTests.jl package to R via the JuliaConnectoR package. At the moment, it supports the following features of WildBootTests.jl:

  • The wild bootstrap for OLS (Wu 1986).
  • The Wild Restricted Efficient bootstrap (WRE) for IV/2SLS/LIML (Davidson and MacKinnon 2010).
  • The subcluster bootstrap (MacKinnon and Webb 2018).
  • Confidence intervals formed by inverting the test and iteratively searching for bounds.
  • Multiway clustering.
  • Arbitrary and multiple linear hypotheses in the parameters.
  • One-way fixed effects.

The following model objects are currently supported:

  • OLS: lm (from stats), fixest (from fixest), felm from (lfe)
  • IV: ivreg (from ivreg).

In the future, IV methods for fixest and lfe will be added.

Installation / Getting Started

wildboottestjlr can be installed by running

library(devtools)
install_github("s3alfisc/wildboottestjlr")

You can install Julia by following the steps described here: https://julialang.org/downloads/. WildBootTests.jl can then be installed via Julia’s package management system.

To install WildBootTests.jl and Julia from within R, you can use wildboottestjlr::wildboottestjlr_setup(). Via wildboottestjlr_setup(), you can install Julia and WildBootTests.jl and connect R and Julia. You simply have to follow the instructions printed in the console!

library(wildboottestjlr)
wildboottestr_setup()

Similarly, you can set the number of Julia threads by running

julia_set_ntreads()

and following the instructions.

Example

wildboottestjlr's central function is boottest(). Beyond few minor differences, it largely mirrors the boottest() function from the fwildclusterboot package.

The Wild Bootstrap for OLS

library(wildboottestjlr)
# set a 'global' seed in the Julia session
set_julia_seed(rng = 12313452435)
#> <Julia object of type MersenneTwister>
#> MersenneTwister(12313452435)

data(voters)
library(fixest)
library(lfe)

# estimation via lm(), fixest::feols() or lfe::felm()
lm_fit <- lm(proposition_vote ~ treatment  + log_income, data = voters)
feols_fit <- feols(proposition_vote ~ treatment  + log_income, data = voters)
felm_fit <- felm(proposition_vote ~ treatment  + log_income, data = voters)

boot_lm <- boottest(lm_fit, clustid = "group_id1", B = 999, param = "treatment", rng = 7651427)
boot_feols <- boottest(feols_fit, clustid = "group_id1", B = 999, param = "treatment", rng = 7651427)
boot_felm <- boottest(felm_fit, clustid = "group_id1", B = 999, param = "treatment", rng = 7651427)

# summarize results via summary() method
#summary(boot_lm)

# also possible: use msummary() from modelsummary package
library(modelsummary)
msummary(list(boot_lm, boot_feols, boot_felm), 
        estimate = "{estimate} ({p.value})", 
        statistic = "[{conf.low}, {conf.high}]"
        )  
Model 1 Model 2 Model 3
1\*treatment = 0 0.089 (0.001) 0.089 (0.001) 0.089 (0.001)
\[0.039, 0.142\] \[0.039, 0.142\] \[0.039, 0.142\]
Num.Obs. 300 300 300
R2 0.045 0.045 0.045
R2 Adj. 0.039 0.039 0.039
R2 Within
R2 Pseudo
AIC -1.9 -3.9
BIC 12.9 7.2
Log.Lik. 4.950 4.950
# plot(boot_lm)

The Wild Bootstrap for IV (WRE)

If boottest() is applied based on an object of type ivreg, the WRE bootstrap Davidson & MacKinnon (2010) is run.

library(ivreg)
data("SchoolingReturns", package = "ivreg")
data <- SchoolingReturns

ivreg_fit <- ivreg(log(wage) ~ education + age + ethnicity + smsa + south + parents14 |
                  nearcollege + age  + ethnicity + smsa + south + parents14,
                data = data)

boot_ivreg <- boottest(object = ivreg_fit, B = 999, param = "education", clustid = "fameducation", type = "webb")

summary(boot_ivreg)
#> boottest.ivreg(object = ivreg_fit, clustid = "fameducation", 
#>     param = "education", B = 999, type = "webb")
#>  
#>  Hypothesis: 1*education = 0
#>  Observations: 3010
#>  Bootstr. Iter: 999
#>  Bootstr. Type: webb
#>  Clustering: 1-way
#>  Confidence Sets: 95%
#>  Number of Clusters: 9
#> 
#>              term  estimate statistic    p.value   conf.low conf.high
#> 1 1*education = 0 0.0904587  2.207293 0.01901902 0.01356883 0.2421277

Benchmarks

After compilation, wildboottestjlr is orders of magnitude faster than fwildclusterboot, in particular when the number of clusters N_G and the number of bootstrap iterations B get large.

The benchmarks plot the median value of 3 runs of a linear regression with N = 10.000 and k = 21.