Lecture 6 Hypothesis Testing and One Population Tests Howell Ch 8, 12 Statistical Inference Two processes I. Estimation of the value of a population parameter. Question asked: What is the value of the population parameter? Answer: A number, called a point estimate, or an interval, called a confidence interval. The interval is one that has a prespecified probability (usually .95) of surrounding the population parameter. Example: What % of persons think we should bomb selected targets in Syria? Answer might be: 38% with a 5% margin of error. This is a combination of point estimate (the 38%) and a interval estimate (from 38-5 to 38+5 or from 33 to 43%). The 5% means that the interval has probability .95 of surrounding the true population percentage. From this we could conclude that it’s quite likely that the population does NOT want strikes. II. Hypothesis Testing. Null Hypothesis Significance Testing (NHST) A. With one population. Deciding whether a particular population parameter (usually the mean) equals a value specified by prior research or other considerations. Example: A light bulb is advertised as having an “average lifetime” of 5000 hours. Question: Is the mean of the population of lifetimes of bulbs produced by the manufacturer equal to 5000 or not? B. With two populations. Deciding whether corresponding parameters (usually means) of the two populations are equal or not. Example: Statistics taught with lab vs. Statistics taught without lab. Question: Is the mean amount learned by the population of students taught with a lab equal to the mean amount learned by the population of students taught without lab? C. Three populations. Deciding whether corresponding parameters (usually means) of the three populations are equal or not. And on and on and on. Inference and One Population Tests - 1 2/9/2016 Introduction to Hypothesis Testing Suppose we have a single population, say the population of light bulbs mentioned above. Suppose we want to determine whether or not the mean of the population equals 5000. Two possibilities 0. The population mean equals 5000, i.e., it is not different from 5000. 1. The population mean does not equal 5000, i.e., it is different from 5000. These possibilities are called hypotheses. The first, the hypothesis of no difference is called the Null Hypothesis: H0. The second, the hypothesis of a difference is called the Alternative Hypothesis: H1. Our task is to decide which of the two hypotheses is true. Two general approaches 1. The Bill Gates (Warren Buffet, Carlos Slim Helu’) approach. Purchase ALL the light bulbs in the population. Measure the lifetime of each bulb. Compute the mean. If the mean equals 5000, retain the null. If the mean does not equal 5000, reject the null. Problem: Too many bulbs. Cost would be enormous. Might not even be possible to get all of them. 2. The Plan B approach. Take a sample of light bulbs. Compute the mean of the sample. Apply the following, intuitively reasonable decision rule If the value of the sample mean is “close” to 5000, decide that the null must be true. If the value of the sample mean is “far” from 5000, decide that the null must be false. But how close is “close”? How far is “far”? What if the mean of the lifetimes of 25 bulbs were 4999.99? What if the mean of the lifetimes of 25 bulbs were 1003.23? What if the mean of the lifetimes of 25 bulbs were 4876.44? Clearly we need some rules. Inference and One Population Tests - 2 2/9/2016 The p-value. Statisticians have decided to redefine “close” and “far” in terms of probabilities, specifically, probabilities computed under the assumption that the null hypothesis is true. They first compute the probability of an outcome as extreme as the sample outcome if the null were true. That probability is called the p-value for the experiment. The p-value is also called the probability of the obtained outcome due to chance alone. The significance level. Statisticians then choose a criterion representing a comfortably small probability. That value is called the significance level of the statistical test. It is typically .05 or some value smaller than .05. They then use the following rule: Loosely . . . If the p-value is small, then reject the null hypothesis If the probability of the observed outcome due to chance alone is small, then it must not be due to chance, so reject the null (chance) hypothesis. If the p-value is large, then retain the null hypothesis. If the probability of the observed outcome due to chance along is large, then it must be due to chance, so retain the null (chance) hypothesis. Formally: If the p-value is less than or equal to the significance level, then reject the null. If the p-value is larger than the significance level, then retain the null. Inference and One Population Tests - 3 2/9/2016 The Steps in Hypothesis Testing Is the Population mean = 5000 or not? Step 1. From the problem description, state the null hypothesis required by the problem and the alternative hypothesis. Step 2. Choose a test statistic whose value will allow you to decide between the null and the alternative hypothesis. Step 3. Determine the sampling distribution of that test statistic under the assumption that the null hypothesis is true. We need the sampling distribution in order to get the p-value. Once you've determined the sampling distribution, you'll know what values of the test statistic would be expected to occur if the null were true and what values would not be expected if the null were true. Step 4. Set the significance level of the test. Step 5. Compute the value of the test statistic. Step 6. Compute the p-value of the test statistic. The p-value is the probability of a value as extreme as the obtained value if the null were true, i.e., the probability of the obtained outcome due only to chance. In most cases, it will be printed by SPSS. Unfortunately, it will be labeled "Sig." not p. Step 7. Compare the p-value with the significance level and make your decision. The key steps graphically . . . Articulate Null Hypothesis and Alternative Hypothesis Reject Null Yes Gather Data Compute p-value p<= Significance level? Inference and One Population Tests - 4 No Do not reject Null 2/9/2016 Worked Out Example Start here on 9/29/15. We'll start with the test of a hypothesis about the mean of a single population pretending that we know the value of sigma. (Text Ch. 11). I. A consumer organization decides to test the truth of a manufacturer’s claims concerning the lifetimes of its “CF” – Compact Florescent bulbs. These bulbs are advertised to have an average lifetime of 5000 hours. Note that the manufacturer is not claiming that any specific bulb will last 5000 hours but that the average of the population of lifetimes will be 5000. Suppose that it was known that the standard deviation of lifetimes to burn-out in the population is 250, based on several years of research with CF bulbs. The magazine employees purchase 25 bulbs and burn them all until they fail. The mean of the sample is not yet known. Note that the observed mean is not shown above. This is done to emphasize that in real life, we set up the hypothesis testing process before knowing the result of the experiment. Step 1: Null hypothesis and Alternative hypothesis. Hmm. We want to know whether the mean of the lifetime of the bulbs is equal to 5000. This translates to . . . Null Hypothesis: The mean of the population equals 5000. µ=5000. Alternative Hypothesis: The mean of the population does not equal 5000. µ≠5000. Step 2: Test Statistic: In this particular problem, we have a choice. We could use X-bar, the mean of the sample as our test statistic. But statisticians have found that it’s simpler to use the Z-statistic computed from X-bar. It turns out that it's easier to use the Z-statistic from the start, since we ultimately have to compute it anyway. Inference and One Population Tests - 5 2/9/2016 Step 3. Sampling distribution of test statistic if null is true. For this step you have to pretend that the null hypothesis is true and then ask yourself, "If I took 1000's of samples of 25 scores in each sample from the population, what would be the distribution of my test statistic be?" The answer is as follows We will use Z=(X-bar-5000)/(250/5) = (X-bar-5000)/50 as the test statistic. Recall that 250 is the population standard deviation and 25 is the sample size, so sigma/sqrt(N) = 250/sqrt(25)=250/5=50. Name of Sampling Distribution if the null hypothesis is true = ___Normal__ Mean of sampling distribution if null is true = ______0________ Standard Deviation of sampling distribution if null is true = ______1________ The distribution is the normal from the Central Limit Theorem. The mean of the distribution of Z's is 0 from our study of Z-scores. And the standard deviation of Z's is 1, again from our study of Z's. Step 4: Significance Level This one's easy. It's quite common to use .05 as the criterion for the probability of the observed outcome. If that probability is less than or equal to .05, we reject. If that probability is larger than .05, we retain the null. So let the significance level be .05. Step 5: Computed value of test statistic From the problem description we find that the mean of the sample is 4925. So X-bar = 4925 If X-bar = 4925, “Z” = (4925-5000) / 250/5 = (4925-5000)/50 = -75/50 = -1.5 Inference and One Population Tests - 6 2/9/2016 Step 6: p-value. In this case, compute the probability of a value as extreme or more extreme than the obtained value if the null were true. Distribution of Zs under assumption that the null is true -2 -2 Z= Tail area = p-value = -1 -1 00 -1.5 22 11 1.5 .0668 .0668 .0668 + .0668 =.1336 This is the probability of getting a Z-score as extreme as 1.5 in either direction by chance alone. Step 7: Conclusion Since the p-value is larger than .0500, do not reject the null hypothesis. A Z as extreme as 1.5 could have been obtained by chance alone. We have no evidence that the population mean of lifetimes is different from 5000. Inference and One Population Tests - 7 2/9/2016 Possible Outcomes of the HT Process State of World Null True µ=5000 Null False µ≠5000 Reject Null Incorrect Rejection Type I Error Correct Rejection Retain Null Correct Retention Incorrect Retention Type II Error Decision Type I Error: Rejecting a true null hypothesis. The Null is true, the population mean really is 5000 but was incorrectly rejected. How can that happen? Because of a random accumulation of factors, our outcome was one which seemed inconsistent with the null. So we rejected it - incorrectly. The pop mean of lifetimes was 5000, but chance factors caused the sample mean to be very far from 5000, leading to rejection of the null. Controlling P(Type I Error): The significance level determines the probability of this type of error. This type of error will occur with probability given by the significance level - usually .05. That is, if you always employ a significance level of .05, then 5% of those times when the null actually is true, you will incorrectly reject it. Type II error: Retaining a false null hypothesis. The Null is false but was incorrectly retained. How could that happen? Because of a random accumulation of factors, chance factors working backwards, our outcome was one which appeared consistent with the null. So our decision was to not reject it – an incorrect decision. The manufacturer was lying, but we incorrectly concluded that the advertising was true. A difference was "missed". We’ll discuss this type of error in the lecture on Power. The probability of this type of error should be kept as small as is possible. Controlling P(Type II error). We have less control over the size of the probability of a Type II error than we do over the probability of a type I error. Sample size is easiest to manipulate. Larger sample sizes lead to smaller P(Type II error). More on this in the section on Power. Correct Retention: Retaining a true null hypothesis. Start here on 10/6/15. The null is true. It should have been retained and it was. This is affirming the null. It is a correct decision. The manufacturer’s claim was correct, and we concluded that it was correct. Controlling: Significance level. Correct Rejection: Rejecting a false null hypothesis The null is false. It should have been rejected, and it was. This is where we want to be. The null was false. We "detected" the difference. The manufacturer’s claim was not correct, and we detected that it was not correct. The probability of correctly rejecting a false Null is called the Power of the statistical test. Power equals 1 - P(Type II Error). More on power later. Controlling: Same factors as those that affect P(Type II error). Sample size is most frequently employed. Inference and One Population Tests - 8 2/9/2016 Estimation Using information from samples to guess the value of a population parameter or difference between parameters. Point Estimate A single value which represents our single best estimate of the value of a population parameter. Point estimate of Pop mean is sample mean. Point estimate of Population variance is SN-12. Interval Estimate An interval which has a prespecified probability of surrounding the unknown parameter. Typically centered around the point estimate. <---------Confidence Interval----------> Lower limit Upper limit Point estimate statistic Statistics used as point estimates: Sample mean, Difference between two sample means, Pearson r Estimation and Hypothesis Testing A statistical law that relates estimation to hypothesis testing is the following: If you have a 95% confidence interval for a population parameter, 1) you reject at significance = .05 the null hypothesis that the population mean equals ANY value outside the confidence interval. 2) you retain at significance = .05, the null hypothesis that the population mean equals ANY value Inside the confidence interval. Suppose the confidence interval for the light bulb problem is 4827 – 5023. Since this interval includes 5000, this means that we retain the null hypothesis that the population mean = 5000. On the other hand, if the null hypothesis was that Population mean = 4500 you would reject that hypothesis. Similarly if the null were that the population mean = 5500 you would reject. Etc. Inference and One Population Tests - 9 2/9/2016 Rationales for computing confidence intervals rather than doing hypothesis tests – 1. From the above, once you have a confidence interval, you can use it to test any hypothesis you wish, so it’s more efficient to simply compute a confidence interval. 2. A confidence interval gives us a perspective on the location of the population mean. A hypothesis test gives us only a “Yes” vs “No” answer. But a confidence interval gives us that “Yes” vs “No” result along with some perspective. Example. Both of the following hypothetical confidence intervals exclude 0, implying that the null hypothesis that the population mean equals 0 would be rejected. But the one on the right gives greater assurance that the population mean is not 0 than does the one on the left. CI --------------0-(------------)------------------Limits of CI: (.002, .270) CI ---0-------------(------------)--------------(.123, .347) The CI tells us how far the hypothesized population mean value is from our outcome. If it is just outside the CI, that gives us the impression that our hypothesis was “almost” correct. If it’s WAY outside the CI, gives us the impression that our hypothesis was “way” incorrect. 3 . If multiple hypothesis tests on the same phenomenon are conducted, some will reject and some will retain the null. This leads to the appearance of inconclusiveness on the part of data analysts. The report of multiple confidence intervals which will probably overlap gives a more coherent picture of the phenomenon. Suppose there were 4 tests of the same hypothesis. Experiment 1 Experiment 2 Experiment 3 Experiment 4 Hypothesis Tests CIs Reject (.002, .270 Retain (-.003, .265 Reject (.005, .275 Retain (-.004, .285) CIs Graphically ----------0-(-----------)------------------------------(-0----------)----------------------------------0---(----------)------------------------------(-0---------------)--------------------- Looking at just the Reject – Retain – Reject – Retain outcomes of the hypothesis tests leaves you feeling as if the researcher doesn’t know what’s going on. But looking at the confidence intervals (Mike – pull aside the white square) shows that those opposite conclusions might have been all identical if the data had been just slightly different from one Experiment to the next. Reporting confidence intervals gives us the opportunity to see how close different experimental results were to each other. Inference and One Population Tests - 10 2/9/2016 Example involving One Population Mean - value of sigma is known A manufacturer of tires claims that average lifetime of its tires is 30,000 miles. A sample of 25 tires is purchased and the tires are put on test cars that are driven until the tire tread reaches the minimum legal depth. Based on the results of the sample, what can be concluded about the manufacturer's claim? Suppose that years of experience with the tire indicated that the population standard deviation is 900. Step 1. From the problem description, state the null hypothesis required by the problem and the alternative hypothesis. Null: Mean of the population of mileages of the tires is 30,000. Alternative: Mean of the population of mileages is not 30,000. Step 2. Choose a test statistic whose value will allow you to decide between the null and the alternative hypothesis. We’ll use the one sample Z statistic, Z= (X-bar – 30,000) ---------------------------Sigma / square root of N The standard error of the mean is 900/sqrt(25) = 900/5 = 180. Step 3. Determine the sampling distribution of that test statistic under the assumption that the null hypothesis is true. From our previous discussions, the distribution of the Z statistic, if the null is true, is the Normal distribution with mean = 0 and standard deviation equal 1. Step 4. Set the significance level of the test. We’ll use .05. Step 5. Compute the value of the test statistic. Suppose that the mean of the sample of 25 tires was 29,640. Since the sample mean is 29,640, Z = (29,640 – 30,000) / 180 = -360/180 = -2.00. Step 6. Compute the p-value of the test statistic. The p-value is the probability of a Z as extreme as -2.00 P(Z <= -2.00) = .0228. P(Z >= +2.00) = .0228. P(Z as extreme as -2.00 = P(Z <= -2.00) + P(Z >= +2.00) = .0228 + .0228 = .0456. Critical Z values In the precomputer days, data analysts determined the Z value whose p would be just equal to .05. That value was called the critical Z. Any obtained Z larger than the critical Z in absolute value would have a p less than or equal to .05. The critical Z corresponding to alpha = .05 is 1.96. So any obtained Z larger than or equal to 1.96 will have a p less than or equal to .05. Nowadays, those who do a lot of hypothesis testing may remember that the critical Z corresponding to significance level is 1.96. But for other tests, we’ll let our computer program compute the p-value directly. Step 7. Compare the p-value with the significance level and make your decision. Since p < .05, Reject the null hypothesis that Population mean = 30,000. Conclude that population mean is less than 30,000. Inference and One Population Tests - 11 2/9/2016 Confidence Interval on the Population mean Skip the details in 2015 1. Compute the point estimate. In this case, that's the sample mean, 29,640. 2. Decide on the level of confidence desired. How about 95%. 3. Use the following formulas to compute the upper and lower limits of the interval using the following formula's LL = X-bar + ZC1 * Standard error of X-bar UL = X-bar + ZC2 * Standard error of X-bar ZC1 and ZC2.are the values equal distance from the mean between which 95% of the scores fall. (If this were a theoretically oriented course, we would show how ZC1 and ZC2 are dervied.) In the case of normally distributed quantities, these values are -1.96 and +1.96. So, for our problem, the 95% confidence interval would be LL = 29,640 - 1.96*900/5 UL = 29,640 + 1.96 * 900/5 LL = 29,640 - 352.80 UL = 29,640 + 352.80 LL = 29,287.2 UL = 29,992.8 The probability is 95% that the interval 29,287.2 to 29,992.8 surrounds the population mean. Graphically 30,000 | ------------------------------------(--------------------------)-----------------------------------------------------| | 29, 287 29,993 Loosely, we can be 95% sure that the population mean falls in the interval 29,287 to 29,993. And we can see from the graphical representation that the confidence interval did not include 30,000. Inference and One Population Tests - 12 2/9/2016 Example involving One Population Mean - value of sigma not known A manufacturer of tires claims that average lifetime of its tires is 30,000 miles. A sample of 25 tires is purchased and the tires are put on test cars that are driven until the tire tread reaches the minimum legal depth. Based on the results of the sample, what can be concluded about the manufacturer's claim? Suppose that you don’t know the value of the population standard deviation. Suppose that the mean of the sample of 25 tires was 29,640 with sample standard deviation equal to 900. Step 1. From the problem description, state the null hypothesis required by the problem and the alternative hypothesis. Null: Alternative: Mean of the population of mileages of the tires is 30,000. Mean of the population of mileages is not 30,000. Step 2. Choose a test statistic whose value will allow you to decide between the null and the alternative hypothesis. We can’t use the Z statistic, because its formula requires that we know the value of the population standard deviation. A natural substitution for the population standard deviation is the sample standard deviation. This would yield the formula, Statistic = (X-bar – 30,000) --------------------------------Sample SD / square root of N Step 3. Determine the sampling distribution of that test statistic under the assumption that the null hypothesis is true. The statistic above is NOT a Z, since S, rather than sigma has been used in the denominator. So the sampling distribution is not the Normal Distribution, as it would be if the statistic were Z. The actual formula for the sampling distribution was discovered by William Gossett, in the early 1900’s. He called the statistic the t statistic and called the distribution the T distribution. The T distribution has mean 0. Its standard deviation is larger than 1, and depends on a quantity called degrees of freedom. In the one-sample case, df = N – 1 = 24 for our example. The shape of the T distribution is similar to the Normal. Step 4. Set the significance level of the test. We’ll use .05. Step 5. Compute the value of the test statistic. Since the sample mean is 29,640, t = (29,640 – 30,000) / (900/5) = 360/180 = -2.00. Inference and One Population Tests - 13 2/9/2016 Step 6. Compute the p-value of the test statistic. The p-value is the probability of a t as extreme as -2.00. The formula for computing the p-value for a t is quite complicated. Luckily, we have computers. Our computer program will compute the value of t and its p-value automatically. That p is .057. Or we could look up the critical t for df = 24 and alpha = .05. That critical t value, from the text, is 2.064. If the obtained t exceeds the critical t in absolute value that would mean that p would be less than or equal to .05 and we could reject the null. Step 7. Compare the p-value with the significance level and make your decision. Since p > .05, do not reject the null hypothesis that Population mean = 30,000. We will act as if the population mean is equal to 30,000. So, we rejected the null hypothesis when using the Z statistic. But we did not reject the null when using the t. This illustrates the fact that the Z statistic is slightly more powerful than the t. Inference and One Population Tests - 14 2/9/2016 Confidence Interval on the Population mean 1. Compute the point estimate. In this case, that's the sample mean, 29,640. 2. Decide on the level of confidence desired. How about 95%. 3. Compute the upper and lower limits of the interval using the following formula's LL = X-bar + tC1 * Standard error UL = X-bar + tC2 * Standard error (Again, if this course were theoretically oriented, we would derive tC1 and tC2 here.) tC1 and tC2.are the t values equal distance from the mean between which 95% of the scores fall. In this case, these values are the two-tailed critical values that would be used if alpha were .05. From Table B, Appendix C, p. 465, critical t when df=24 is 2.064. You won’t be tested over this. So, for our problem, the 95% confidence interval would be LL = 29,640 – 2.064*900/5 UL = 29,640 + 2.064 * 900/5 LL = 29,268.5 UL = 30,011.5 The probability is 95% that the interval 29,268.5 to 30,011.5 surrounds the population mean. Graphically 30,000 | ------------------------------------(------------------------------)------------------------------------------------| | 29, 287 30,011.5 Inference and One Population Tests - 15 2/9/2016 Comparing the Z-test and the t-test for the same data Z-test results t-test results H0: Pop mean = 30000 H0: Pop mean = 30000 Pop SD = 900 Sample SD = 900 N = 25 N = 25 Sample mean = 29,640 Sample mean = 29,640 Z= (29640 – 30000)/(900/5) t = (29640 – 30000)/(900/5) 2.00 p= 2.00 .0456 p= .0570 (----------------------------------------------------) Z CI = ---------------------------|---------------------------|-------------------------|-------------------------29,287 29640 29,993 (--------------------------------------------------------------) T CI = ----------------------|--------------------------------|------------------------------|------------------29,268 29640 30011 30,000 The bottom line is that the Z is a more powerful test statistic than the t. But the Z is higher maintenance – it requires that you know the value of the population standard deviation in order to achieve that power. A tractor is a more powerful tool for plowing than a horse. But you must have gasoline to take advantage of that power. The t is more forgiving, but at the price of not giving you the precision of the Z. Inference and One Population Tests - 16 2/9/2016 One Sample t – SPSS Example A manufacturer of tires claims that average lifetime of its tires is 30,000 miles. A sample of 25 tires is purchased and the tires are put on test cars that are driven until the tire tread reaches the minimum legal depth. Based on the results of the sample, what can be concluded about the manufacturer's claim? The mileage values for the tires are below. id 1 2 3 4 5 6 mileage 29050 29837 29622 29546 28197 28277 7 8 9 10 11 12 29585 31344 29702 29585 29564 28824 13 14 15 16 17 18 31158 29084 29467 28261 28496 30784 Inference and One Population Tests - 17 19 20 21 22 23 24 25 2/9/2016 30534 30397 29883 29077 31239 29630 29857 Select the variable to be analyzed and move it to the “Test Variable(s)” field. Very Important. Put the hypothesized value of the population mean in the “Test Value” field. Choosing the confidence interval level. Click on the [Options] button to open the Confidence Interval dialog box. Hypothesis Testing and One Population Tests - 18 2/9/2016 The Output T-Test Test value entered above. One-Sam ple Stati stics N mil eage 25 Me an Std . Deviatio n Std . Erro r Me an 296 40.0 0 899 .908 179 .982 One-Sam ple Test Te st Val ue = 3000 0 95% Co nfide nce In terva l of the Diffe rence mil eage t -2.0 00 Value of t df 24 Sig . (2-t ailed ) Me an Differe nce .05 7 -36 0.000 The twotailed p-value. Lower -73 1.46 Up per 11. 46 Standardized Confidence interval. Add test value to each limit to get it in terms of the problem’s values. Correct limit of confidence are LL = -731.46 + 30,0000 = 29,268.54 UL = 11.46 + 30,000 = 30,011.46 Hypothesis Testing and One Population Tests - 19 2/9/2016 Hypotheses about the Population Correlation Coefficient, r Biderman and Nguyen (2004) conducted research in which participants were given the Big 5 personality test twice, once under instructions to respond honestly and then again under instructions to “fake good”. We computed a score on each of the Big 5 dimensions under the fake condition and a score on each of the dimensions under the honest conditions. We subtracted the honest condition scores from the faking condition scores and then computed the mean of those 5 differences. That average was the respondents “Faking ability” score. A positive value means that the person was able to fake good, responding with higher scores in the faking condition than in the honest condition. A score of 0 meant that the respondent score no higher on the personality dimensions in the faking condition than in the honest condition. And a negative score meant that the respondent actually scored lower on the personality dimensions in the faking condition than in the honest condition. We gave each respondent the Wonderlic, a common measure of cognitive ability. The distribution of faking ability scores and scores on the Wonderlic are as follows . . 30 25 Frequency 20 15 10 5 0 -2.0 -1.0 0.0 1.0 2.0 Mean = 0.512 Std. Dev. = 0.7247 3.0 N = 203 Ability to fake the Big 5 defined as the mean of the 5 F-H differences 3.0 2.0 1.0 Faking Ability wonderlic test score 40 30 Frequency Ability to fake the Big 5 defined as the mean of the 5 F-H differences Ability to fake the Big 5 defined as the mean of the 5 F-H differences 20 10 0.0 -1.0 R Sq Linear = 0.094 -2.0 Mean = 24.61 Std. Dev. = 6.753 N = 203 0 0 10 20 30 wonderlic test score 40 50 10 20 30 40 wonderlic test score We then correlated the FAKABIL and WONDLIC scores. The scatterplot is above. The question is: In the population, is there a correlation between cognitive ability and faking ability? Hypothesis Testing and One Population Tests - 20 2/9/2016 50 Testing Hypotheses on Population r Null Hypothesis: The population correlation coefficient is 0 Alternative Hypothesis: The population correlation coefficient is not 0. Test statistics: (There are two possibilities) 1) A t statistic Sample r – 0 t = ---------------------1 – Sample r2 ------------N - 2 Critical t values would be obtained from tables of critical t. 2) r itself. Critical r values would be obtained from tables of critical r. SPSS Analysis Analyze -> Correlate -> Bivariate r Correla tions gscore fakabil Abilit y to fake the Big 5 de fined as th e gscore wond erlic me an o f the 5 test score F-H diffe rences wond erlic test score Pe arson Correlatio n 1 .30 7** Sig . (2-t ailed ) N fakabil Abilit y to f ake th e Big 5 de fined as th e me an of the 5 F-H d iffere nces . .00 0 20 3 20 3 Pe arson Correlatio n .30 7** Sig . (2-t ailed ) .00 0 . 20 3 20 3 N p-value N 1 **. Co rrelat ion is sign ifican t at th e 0. 01 le vel (2 -taile d). Conclusion: Our data suggest that in the population, the ability to fake a Big 5 personality test is positively related to cognitive ability. Confidence intervals for the Population r Minium, et. al.’s Figure 16.7 on p. 302 is an excellent table. (I’ll copy and print it for you.) Formulas required to determine the LLs and ULs are complicated. Hypothesis Testing and One Population Tests - 21 2/9/2016