LESSON: Collecting Data
Site: | Mountain Heights Academy OER |
Course: | Introductory Statistics Q2 |
Book: | LESSON: Collecting Data |
Printed by: | Guest user |
Date: | Friday, 4 April 2025, 11:56 AM |
Intro to Bootstrapping
In this lesson, we're going to continue our focus on the question:
What is the average starting salary for graduates majoring in petroleum engineering?
In past assignments, we have used sample data to calculate a sample mean that would approximate the population mean through inferential statistics. However, moving forward, we will learn how to use multiple sample means to help us construct an interval of values that would capture the population mean. We can feel more confident that we have captured the true population mean when we state an interval estimate, rather than when we state a single point estimate.
Bootstrapping & Sampling with Replacement
In reality, surveying an entire population typically cannot be done. In the case of surveying graduates to determine their starting salaries, privacy laws would prohibit colleges and universities from supplying researchers with graduates’ contact information. Even if populations can be surveyed, the costs associated with doing so often would be prohibitive. We get our best guesses about characteristics of a population from using a sample randomly selected from the population.
Because we do not anticipate that the sample will match the population exactly, we estimate population characteristics using intervals of values (interval estimates) rather than individual values (point estimates). One method for constructing interval estimates is known as the bootstrapping method. The method’s name comes from the saying to “pull yourself up by your bootstraps” (Cleophas, Zwinderman, Cleophas, & Cleophas, 2009), which refers to using one’s own efforts to get out of a difficult or impossible situation—to make the seemingly impossible become possible. In the case of statistics, the bootstrap method allows us to make estimates for the population through brute force—no formulas necessary!
We are interested in estimating the actual mean starting salary for 2014 petroleum engineering graduates. We could select additional samples of engineers and calculate their mean starting salaries to form an interval estimate for the population mean. However, because sampling from the population can be expensive, we instead use our best estimate for the population—the sample—and use it as if it were the population. We select samples, called bootstrap samples, using the data from our sample, a process called resampling. Because there are a finite number of values in our sample, we use sampling with replacement, meaning that after being selected, each salary is recorded and returned to the collection before the next salary is selected.
Example
In this lesson video you will see how we can use the "bootstrapping" method to help us create more sample groups -- which will in turn help us to create a better interval estimate of the population mean.
It will be very important that you watch this lesson video before attempting your next assignment.
Assignment Help
This video was created to help you know how to get started with your assignment, Bootstrapping & Sampling with Replacement. I recommend watching this video before getting started with your assignment!