Section 8.1 Statistics and Sampling Variability Introduction Suppose I want to know the average GPA of seniors at Glacier Peak. I could track down every senior at GP and ask them their GPA, this would give me the true mean of the population OR I could take a sample The Foundation of Chapter 8 Suppose that x is a discrete random variable that is equal to an individual’s score on the 2010 AP Statistics Exam. As a first year AP stats teacher, I am interested in the mean score of all students who took the exam in 2010 so that I may set realistic expectations for myself and my students for this year. What is the population that I am interested in?_____________________ What statistic am I interested in? __________________ The Foundation of Chapter 8 Let let µ denote the true mean score of all students who took the AP Stats exam in 2010. To learn something about µ, I might obtain a sample of 50 students and determine the mean score from the sample. This sample may produce a mean of 3.01. So, x = 3.01. How close is this mean to the true population mean, µ? If I selected another sample of 50 students and computed the mean score, would this second x be near 3.01 or would it be quite different? If I repeated this process many, many times and plotted the resulting means, I would create a sampling distribution of x. This would give me an idea about the long run behavior of the sample mean and could help me determine the true population mean, µ. The Foundation of Chapter 8 Questions regarding the repeated sampling can be addressed by studying what is called the sampling distribution of Just as the distribution of a numerical variable describes its long-run behavior, the sampling distribution of provides x information about the long-run behavior of when sample after sample is selected. x x I can obtain information about a population characteristic by selecting a sample. Sample Mean: Population Mean: Often different from one another, and rarely actual values from the data set Basic Terms Any quantity computed from values in a sample is called a statistic (x, s, p, etc.) Any quantity computed from values in a population is called a population characteristic or parameter (, , ) Sampling Variability The observed value of a statistic depends on the particular sample selected from the population. Typically, the value of the statistic varies from sample to sample. This variability is called sampling variability. Constructing a Sampling Distribution Suppose I want to take a random sample of size n = 50 from the population of all students who took the 2009 AP Stats exam. There are many, many different possible samples that might result. We now define a hypothetical population, which consists of all the different possible samples of size n = 50. This is called a population of samples. The population of samples is viewed as a population because it consists of every different sample; it is a complete collection of all possible samples. Constructing a Sampling Distribution Just as a variable associates a value with every individual in the population and can be described by its distribution, a statistic associates a value with each individual sample in the population of samples. Therefore, a statistic can also be described by a distribution. The distribution of a statistic is called its sampling distribution. Ex: For 4 students, the number of siblings they each have is 0, 1, 3, 4. Select samples of size n=2 and find the average number of siblings in each sample. Obtain the sampling distribution of x. Sample of Size 2 x Probability (x ) Ex: From the previous example, determine the x , the mean value of x . Ex: Consider a population consisting of the following five values, which represent the number of DVD rentals from the Red Box at Fred Meyer for a given month of five families. The values are: 8, 14, 16, 10, 11. Compute the mean of this population. Notice that the 5 slips of paper you created for the warm up correspond to these values. Turn your paper over and randomly select a sample of size n = 2 and compute the mean of your sample. Come up and record your sample and mean on the following table. Copy the values in the table. Ex: Record your sample and mean in the table below. Sample Mean Sample Mean Sample Mean Ex: Construct a histogram for the sample mean data in the table. Ex: Consider the density histogram for the sample mean data in the previous slide. Are most of the values of x near the population mean? Do the x values differ a lot from sample to sample or do they tend to be similar? Homework Page 409 1,2,3,4a,7,10 Read pages 411-420