Statistics 101: Section L - Laboratory 8 Drawing samples from populations requires some method for random selection. In three group activities today you will use different methods to randomly select a sample from a population. In the process you will also learn something about the variability introduced by random sampling. 1. Group Activity 1: The first population that we will sample from is the U.S. Senate (1994) edition. There are 100 members. Some of the characteristics of the Senators that may be of interest are: name sex party state years = = = = = Senator’s last name M for male, F for female D for Democrat, R for Republican two letter state abbreviation years of service The basic idea of random sampling from a population is to put all of the population members into a bag, mix them up and draw from the bag. We can’t throw all the Senators into a bag and mix them up, but we can put laminated slips of paper, one for each Senator, into a bag and use them to represent the Senators. Use a bag containing 100 laminated slips to perform the sampling described below and answer the questions. (a) Choose a sample of 10 U.S. Senators. Record the Senator’s name, sex, party and years of service on the group answer sheet. Also record the proportion of males, the proportion of Democrats and the mean years of service for the sample. (b) Repeat part (a) to obtain two more samples of 10. Be sure to return the 10 sampled slips to the bag and mix thoroughly before drawing each sample of 10. Again record the proportion of males, proportion of Democrats and sample mean years of service. (c) Did the three samples produce the same summary values, i.e. proportions and means? Why? (d) In the 1994 Senate there are 93 males and 7 females, 54 Democrats and 46 Republicans, and the mean years of service of 12.4 years. Which sample comes closest to matching the true proportions in the population? (e) Since most of the time we take only one sample, what can be said about that a single sample’s ability to match the population? 2. Group Activity 2: Your group will need a sheet of 100 random rectangles for this activity. DO NOT study the sheet of random rectangles. The random rectangles represent a generic population of values. The areas of the rectangles could represent the number of people in households or the number of hours per week students spend on STAT 101. What we wish to investigate in this activity is what can be learned about the area of the rectangles using sampling techniques. (a) Each individual in the group should look at the sheet of random rectangles for a few seconds and guess the average area of the rectangles on the sheet. The unit of measure is the background square. Thus rectangle 7 has area 4x3=12. Compare your guess with those of the other members in your group. Discuss why you got different guesses. Record the individual guesses on your group data sheet. (b) Each individual should now select five rectangles that, in her/his judgment, are representative of the rectangles on the sheet. Write down the area for each of the five. Compute the average of the five areas. How does this compare with your guess? How does it compare with the averages of the judgment samples for the others in your group? Again discuss why you got different averages. Record the judgment sample averages on your group data sheet. 1 (c) We want to take a random sample from the 100 rectangles. We could cut out the rectangles, laminate them, put them in a bag, mix thoroughly and draw from the bag. Do you think this would give you a random sample? Hint: What is true about the slips for the Senators that is not true of the rectangles? (d) You now have a 10 sided die available to use. Devise a plan to produce a random sample from this population using the 10-sided die. (e) Using your sampling plan, take a simple random sample of size 5 from the rectangles. Record the rectangle and the area on your group answer sheet. What is the mean area of your sample? How does it compare with your guess and your judgment sample average? (f) If you select 5 new random numbers will you come up with the same average area? Explain briefly. (g) Which do you think will be a better guess of the true average area of the 100 rectangles, the random sample average of 5 rectangles or the random sample average of 20 rectangles? Explain briefly. 3. Group Activity 3: Pine forests are periodically infested with beetles and the infestations often seem to jump from tree to tree in a random manner. How can a forester estimate the total number of trees infested with beetles in a large forest? By taking a sample of plots in the forest and counting the number of infested trees in the sample, a forester will be able to use the sample data to estimate the total. To simulate this activity, imagine that Figure 1 represents a forest and the black dots represent infested trees in that forest. The forest has been divided into 100 plots of equal area. (a) You have a table of random numbers available for you to use. Devise a plan to produce a random sample from this population using the random numbers in the table. (b) Use your sampling plan to randomly select 10 plots from the table. For each plot count the number of infested trees (dots) that are entirely with in the plot. Do not count trees (dots) that overlap the edges of a plot. For example, the (Row 0, Column 0) plot has 6 infested trees. (c) Calculate the mean and the standard deviation of the 10 counts. (d) Using the mean in (c), estimate the total number of infested trees in the forest. (e) Again use the random number table to select 10 plots. This time for each plot count the number of infested trees (dots) that have any part within the plot. For example, the (Row 0, Column 0) plot has 7 infested trees. (f) Calculate the mean and standard deviation of the 10 counts. (g) Using the mean in (f), estimate the total number of infested trees in the forest. (h) What would be the best estimate of the total number of infested trees in the forest. Briefly explain your choice. 2 Statistics 101: Section L - Laboratory 8 Group Answer Sheet Names of Group Members: 1. Group Activity 1: Sample 1 name sex party years summaries p̂M = p̂D = X= Sample 2 summaries p̂M = p̂D = X= Sample 3 summaries p̂M = p̂D = X= 3 2. Group Activity 2: Record your group’s guessed values, judgment sample averages and random sample averages in the spaces below. (a) Guessed Values (b) Judgment sample averages Comparisons: (c) Difference between Senator’s and rectangles (d) Sampling plan (e) Random sample averages Comparisons: (f) Same mean for another sample? (g) Which is better: 5 or 20 4 3. Group Activity 3: (a) Sampling Plan (b) Counts for 10 selected plots, interior trees only. (c) Mean and standard deviation of the 10 counts. (d) Estimate of the total number of infested trees. (e) Counts for 10 selected plots, all trees. (f) Mean and standard deviation of the 10 counts. (g) Estimate of the total number of infested trees. (h) Best estimate of the total number of infested trees and explanation. 5