Boostrapping is a non-parametric approach for computing statistical estimates using a resampling approach as the basis for estimating variance in the data. The idea is to take the sample data and treat it as your “population” from which you take repeated samples. Then compute the statistic you’re interested in from each sample and then look at the distribution of that statistic across all of the repeated samples. Here is the basic workflow:
NOTE: There are multiple approaches for computing this 95% confidence interval for the bootstrapped estimate, so be sure to read the documentation for the method and software you choose.
SAS
R
We are still working with the HELP (Health Evaluation and Linkage to Primary Care) dataset. See details at https://melindahiggins2000.github.io/N736Fall2017_lesson07/lesson07_univStats.html.
Run examples shown in lesson09_Rcode.R
Run examples shown in lesson09_SAScode.sas