Part 4

STA 2023 E.Philias Part 4 Chapter 10: Hypothesis Testing (using a single sample) Hypothesis Testing is the statistical procedure designed to test a claim about a population parameter based on the statistical evidence contained in a representative sample of the population. Research hypothesis is a claim about a population parameter that can be tested using sample data. Statistical hypotheses are the null and alternative hypotheses. Null hypothesis ( 𝐻0 ) is a claim (Or statement) about a population parameter that is assumed to be true until it declared false. It describes the opposite of the alternative hypothesis. The null hypothesis must contain the equality sign. Alternative hypothesis ( 𝐻𝑎 ) describes the research hypothesis of the problem. It contains strict inequalities. Test Statistic is a formula that summarizes the statistical evidence collected against the null hypothesis (or in favor of the alternative/research hypothesis). Rejection region is the set of values of the test statistic indicating convincing evidence against Ho. Four Outcomes from Hypothesis Testing 𝐻0 𝑖𝑠 𝑡𝑟𝑢𝑒 𝐻0 is false Do not Reject 𝐻0 Correct decision Type II error Reject 𝐻0 Type I error Correct decision Type I error consists of rejecting the null hypothesis 𝐻0 when 𝐻0 is actually true. Type II error consists of failing to reject 𝐻0 when 𝐻0 is actually false. Alpha (𝛼) designates the probability of making a Type I error. STA 2023 E.Philias Beta (𝛽) designates the probability of type II error. p-value is a probability that measures the strength of our case against 𝐻0 (that is, in favor of 𝐻𝑎 ). The p-value of any statistical test describes the observed probability of type I error. Tails of a Test A two-tailed test has rejection regions in both tails, a left tailed test has the rejection region in the left tail, and a right-tailed test has the rejection region in the right tail of the distribution curve. Sign in the null Hypothesis 𝐻0 Sign in the alternative Hypothesis𝐻1 Rejection region Two-Tailed test = Left-Tailed Test = or ≥ Right-Tailed Test = or ≤ ≠ < > In both tails In the left tail In the right tail Steps to perform a test of Hypothesis A statistical test of hypothesis procedure has the following five steps. 1. State the null and alternative hypotheses. 2. Create the Decision rule by determining the rejection and nonrejection regions. 3. Calculate the value of the test statistic. 4. Make a decision. 5. State your conclusion STA 2023 E.Philias Testing Hypotheses Regarding the Population Mean (large sample) Step1: Determine the null and alternative hypotheses. The hypotheses can be structured in one of three ways: Two-Tailed Left-Tailed Right-Tailed 𝐻0 : 𝜇 = 𝜇0 𝐻0 : 𝜇 = 𝜇0 𝐻0 : 𝜇 = 𝜇0 𝐻1 : 𝜇 ≠ 𝜇0 𝐻1 : 𝜇 < 𝜇0 𝐻1 : 𝜇 > 𝜇0 Note: 𝜇0 is the assumed or status quo value of the population mean. Step 2 : Create the Decision rule by determining the rejection region. The level of significance is used to determine the critical value. Step 3: Calculate the value of the test statistic z= z= Where 𝜎𝑥̅ = 𝜎 √𝑛 and 𝑠𝑥̅ = x̅−𝜇 𝜎𝑥̅ x̅−𝜇 𝑠𝑥̅ if 𝜎 is known if 𝜎 is not known 𝑠 √𝑛 The value of z is calculated for x̅ using the formula is also called the observed value of z. Step 4: Decision: Compare the critical value with the test statistic: Two-Tailed If z < -𝑧𝛼 or z > 𝑧𝛼⁄2 Left-Tailed If z < -𝑧𝛼 reject Right-Tailed If z > 𝑧𝛼 , reject Reject the null hypothesis the null hypothesis the null hypothesis 2 Step 5: Conclusion 1) Reject 𝐻0 in favor of 𝐻𝑎 : The sample data provides sufficient evidence to support the research hypothesis. 2) Fail to reject 𝐻0 : The sample data provides insufficient evidence to support the research hypothesis. STA 2023 E.Philias Rejection Regions for Common Values of 𝜶 Alternatives Hypothesis 𝛼 = .10 𝛼 = .05 𝛼 = .01 Lower-Tailed z< -1.28 z< -1.645 z< -2.33 Upper-Tailed z > 1.28 z > 1.645 z > 2.33 Two-Tailed z< -1.645 or z > 1.645 Z < -1.96 or z > 1.96 z< -2.575 or z > 2.575 Hypothesis Tests using the p-Value approach The p-value is the smallest significance level at which the null hypothesis is rejected. P-value is a probability that measures the strength of our case against Ho (that is, in favor of Ha). The p-value of any statistical test describes the observed probability of type I error. Step 1: Determine 𝐻0 and 𝐻𝑎 Step 2: Decide on a level of significance, depending on the seriousness of making a type 1 error . ̅ x−𝜇 Step 3 : Compute the test statistics 𝑧0 = 𝜎 ̅ 𝑥 Step 4: Determine the p-value. Two tails test : p-value = 2P( z > 𝑧0 ) Right tail test : p-value = P( z > 𝑧0 ) Left tail test : p-value = P( z < 𝑧0 ) STA 2023 E.Philias Tests Concerning Means (Small Samples) Conditions under which the t distribution is used to make tests of hypothesis about 𝝁 The t distributions is used to conduct a test of hypothesis about 𝝁 if 1. The sample size is small ( n <30). 2. The population from which the sample is drawn is (approximately) normally distributed. 3. The population standard deviation 𝜎 is unknown. Test Statistic The value of the test statistic t for the sample mean x̅ is computed as x̅−𝜇 t= 𝑠 ̅ 𝑥 where 𝑠𝑥̅ = 𝑠 √𝑛 The value of t calculated for x̅ by using the above formula is also called the observed value of t. Hypothesis Test about a Population Proportion : Large samples The value of the test statistic z for the sample proportion, 𝑝̂ , is computed as Z= 𝑝̂−𝑝 𝜎𝑝 ̂ where 𝜎𝑝̂ =√ 𝑝̂(1−𝑝̂) 𝑛 The value of p used in this formula is the one used in the null hypothesis. The value of z calculated for 𝑝̂ is also called the observed value of z. STA 2023 E.Philias Technology Step by Step Hypothesis Tests Regarding 𝝁 (Large sample) Step 1: If necessary, enter raw data in 𝐿1 . Step 2: Press STAT, highlight TESTS, and select 1 : Z-Test. Step 3: If the data are raw, highlight DATA; make sure that List1 is set toL1 and Freq is set to 1. If summary statistics are known, highlight STATS and enter the summary statistics. Following 𝜎, enter the population standard deviation. For the value of 𝜇0 , enter the value of the mean stated in the null hypothesis. Step 4: Select the direction of the alternative hypothesis. Step 5: Highlight Calculate and press ENTER. The TI-83/84 plus gives the P-value. Hypothesis Tests Regarding 𝝁 (Small sample) Step 1: If necessary, enter raw data in 𝐿1 . Step 2: Press STAT, highlight TESTS, and select 2: T-Test. Step 3: If the data are raw, highlight DATA; make sure that List1 is set toL1 and Freq is set to 1. If summary statistics are known, highlight STATS and enter the summary statistics. Following 𝜎, enter the population standard deviation. For the value of 𝜇0 , enter the value of the mean stated in the null hypothesis. Step 4: Select the direction of the alternative hypothesis. Step 5: Highlight Calculate and press ENTER. The TI-83/84 plus gives the P-value. STA 2023 E.Philias Hypothesis Tests regarding a Population Proportion Step 1: Press STAT, highlight TESTS, and select 5: 1-PropZTest. Step 2: For the value of 𝑝0 , enter the “status quo” value of the population proportion. Step 3: Enter the number of successes, x, and the sample size n. Step 4: Select the direction of the alternative hypothesis. Step 5: Highlight Calculate or Draw, and press ENTER. The TI-83/84 gives the P-value. STA 2023 E.Philias Comparing Two Population Means: Independent Sampling To perform inference on the difference of two population means, we must first determine whether the data come from independent or dependent sample. A sampling method is independent when the individual selected for one sample do not dictate which individuals are to be in a second sample. A sampling method is dependent when the individuals selected to be in one sample are used to determine the individuals to be in the second sample. DIFFERENCES BETWEEN MEANS (LARGE SAMPLE) Large-Sample Confidence Interval for (𝝁𝟏 - 𝝁𝟐 ) (𝑥 ̅̅̅1 - ̅̅̅) 𝑥2 ± 𝑧𝛼⁄2 √ 𝜎12 𝑛1 + 𝜎22 𝑛2 Where 𝜎12 𝑎𝑛𝑑 𝜎22 are the variances of the two populations being sampled, and 𝑛1 𝑎𝑛𝑑 𝑛2 𝜎2 𝜎2 1 2 are the respective sample sizes. We also refer to √𝑛1 + 𝑛2 as the standard error of the statistic (𝑥 ̅̅̅1 - ̅̅̅). 𝑥2 STA 2023 E.Philias Large-Sample Test of Hypothesis for (𝝁𝟏 - 𝝁𝟐 ): Independent Sampling Step 1: Determine the null and alternative hypotheses. The hypotheses are structured in one of three ways: Two-tailed Left-Tailed Right-Tailed 𝐻0 : 𝜇1 = 𝜇2 𝐻0 : 𝜇1 = 𝜇2 𝐻0 : 𝜇1 = 𝜇2 𝐻𝑎 : 𝜇1 ≠ 𝜇2 𝐻𝑎 : 𝜇1 < 𝜇2 𝐻𝑎 : 𝜇1 > 𝜇2 Step 2: Select a level of significance 𝛼, depending on the seriousness of making a type I error. Step 3: Compute the test statistic Z= (𝑥 ̅̅̅1̅ − ̅𝑥̅̅2̅)− (𝜇1 − 𝜇2 ) 𝜎2 𝜎2 √ 1+ 2 𝑛1 𝑛2 ≈ (𝑥 ̅̅̅1̅ − ̅𝑥̅̅2̅)− (𝜇1 − 𝜇2 ) 𝑠2 𝑠2 √ 1+ 2 𝑛1 𝑛2 , with(𝜇1 − 𝜇2 ) =0 Step 4: Compare the critical value with the test statistic Step 5: Conclusion STA 2023 E.Philias Small-Sample Confidence Interval for (𝝁𝟏 − 𝝁𝟐 ) (Independent Samples) (𝑥 ̅̅̅1 - ̅̅̅) 𝑥2 ± 𝑡𝛼⁄2 √𝑠𝑝2 ( Where 𝑠𝑝2 = (𝑛1 −1)𝑠12 +(𝑛2 −1)𝑠22 𝑛1 +𝑛2 −2 1 𝑛1 + 1 𝑛2 ), and 𝑡𝛼⁄2 is based on (𝑛1 + 𝑛2 − 2) degrees of freedom. Here 𝑠𝑝 is the pooled sample standard deviation. Small-Sample Test of Hypothesis for (𝝁𝟏 − 𝝁𝟐 ) (Independent Samples) Test statistic: t = (𝑥 ̅̅̅1̅ − ̅𝑥̅̅2̅)−(𝜇1 − 𝜇2 ) 1 1 √𝑠𝑝2 (𝑛 +𝑛 ) 1 2 , (𝜇1 − 𝜇2 ) = 0 STA 2023 E.Philias Comparing Two Population Means: Dependent Samples Differences Between Means (Paired Data) Confidence Interval for Matched-Pairs Data (dependent Samples) A (1-𝛼)∙100% confidence interval for 𝜇𝑑 is given by Large Sample 𝜎 𝑑̅ ∓ 𝑧𝛼⁄2 ∙ 𝑑 ≈ 𝑑̅ ∓ 𝑧𝛼⁄2 ∙ √𝑛 𝑠𝑑 √𝑛 Small Sample 𝑑̅ ∓ 𝑡𝛼⁄2 ∙ 𝑠𝑑 √𝑛 The critical value 𝑡𝛼⁄2 is determined using n-1 degree of freedom. The value of 𝑑̅ and 𝑠𝑑 are the mean and standard deviation of the differenced data. Note: The interval is exact when the population is normally distributed and approximately correct for nonnormal populations, provided that n is large. Paired Difference Test of Hypothesis for 𝝁𝒅 = (𝝁𝟏 − 𝝁𝟐 ) =0 (Dependent Samples) Large sample 𝑑̅ 𝑑̅ √𝑛 𝑠𝑑 √𝑛 Test statistic : z = 𝜎𝑑 ≈ Small Sample: 𝑑̅ Test statistic: t = 𝑠𝑑 √𝑛 STA 2023 E.Philias Sampling Distribution of the Difference between two proportions (Large sample) Suppose a simple random sample of size 𝑛1 is taken from a population where 𝑥1 of the individual s have a specified characteristic, and a simple random sample of size 𝑛2 is independently taken from a different population where 𝑥2 of the individuals have a 𝑥 𝑥 specified characteristic. The sampling distribution of 𝑝 ̂1 - 𝑝 ̂, ̂1 =𝑛1 and 𝑝 ̂2 =𝑛2 ,is 2 where 𝑝 1 2 approximately normal. With mean 𝜇𝑝̂1 ̂2 − 𝑝 =𝑝1 − 𝑝2, and standard deviation 𝜎𝑝̂1 ̂2 − 𝑝 =√ ̂ ̂(1−𝑝 𝑝 1 1) 𝑛1 + ̂ ̂(1−𝑝 𝑝 2 2) 𝑛2 Large-Sample A (1- 𝜶)∙100% confidence interval for (𝒑𝟏 − 𝒑𝟐 ) ̂1 - 𝑝 (𝑝 ̂) ∓ 𝑧𝛼⁄2 √ 2 ̂ ̂(1−𝑝 𝑝 1 1) 𝑛1 + ̂ ̂(1−𝑝 𝑝 2 2) 𝑛2 Large-sample Test of Hypothesis about (𝒑𝟏 − 𝒑𝟐 ) The z-statistic Z = ̂1 − 𝑝 ̂)−(𝑝 (𝑝 2 1 −𝑝2 ) 1 1 is used to test the null hypothesis that 𝐻0 : (𝑝1 − √𝑝̂ (1 − 𝑝̂)(𝑛 +𝑛 ) 1 2 𝑝2 ) = 0 (or, equivalently 𝐻0 : 𝑝1 = 𝑝2) the best estimate of 𝑝1 = 𝑝2 =p is 𝑝̂ = The best point estimate for p is called the pooled estimate of p, denoted by 𝑝̂ . 𝑥1 +𝑥2 𝑛1 +𝑛2

Part 4

Related documents

Products

Support

Part 4

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib