Inference based on cluster-robust standard errors in linear regression models, using either the Student's t-distribution or the wild cluster bootstrap, is known to fail when the number of treated clusters is very small. We propose a family of new procedures called the subcluster wild bootstrap, which includes the ordinary wild bootstrap as a limiting case. In the case of pure treatment models, where all observations within clusters are either treated or not, the latter procedure can work remarkably well. The key requirement is that all cluster sizes, regardless of treatment, should be similar. Unfortunately, the analogue of this requirement is not likely to hold for difference-in-differences regressions. Our theoretical results are supported by extensive simulations and an empirical example.

Cluster-robust variance estimator, Clustered data, Difference-in-differences, Grouped data, Robust inference, Subclustering, Treatment model, Wild bootstrap, Wild cluster bootstrap
dx.doi.org/10.1111/ectj.12107
Econometrics Journal
Department of Economics

MacKinnon, J.G. (James G.), & Webb, M. (2018). The wild bootstrap for few (treated) clusters. Econometrics Journal, 21(2), 114–135. doi:10.1111/ectj.12107