Formulas

ES10003: Introduction to Statistics Formulae & Similar Guide 1. Chebychev’s Inequality For any population with mean μ and standard deviation σ , and k > 1 , the percentage of observations that fall within the interval [μ ± kσ] is at the least 100[1-(1/k2)]%. 2. Combinations The number of combinations of n items taken m at a time 𝑛 𝐶𝑚 = n!/[m!(n-m)!] 3. Permutations The number of permutations taken m at a time 𝑃𝑚𝑛 = n!/(n-m)! 4. Multiplication rule in probability P(A|B) P(B) = P(A∩ B) 5. Bayes Theorem Let A and B be two events. Bayes theorem says: P(B|A) =[P(A|B)P(B)] / P(A) 6. Binomial Distribution 𝑁! P(x)= 𝑥!(𝑛−𝑥)! Px(1-P)(N-x) Where P(x) is the probability of x successes in N trials. P is the probability of success in a single trial. N is the sample size. 7. The Poisson Distribution The probability of x successes is: P(x)= 𝑒 −𝜆 𝜆𝑥 𝑥! Where P(x) = the probability of x successes over a given time or space, given   = the expected number of successes per time or space unit,  > 0 8. Rectangular or continuous uniform distribution 𝑓(𝑥) = 1 𝑏−𝑎 𝑖𝑓 𝑎 ≤ 𝑋 ≤ 𝑏 (4) = 0 otherwise 9. The Normal Distribution 1 √2𝜋𝜎2 2 /2𝜎 2 𝑒 −(𝑥−𝜇) (8) 10. Finite Population Correction factor If the sample size n is not a small fraction of the population size N, then a finite population 𝑁−𝑛 correction is √𝑁−1. 11. Standard error of the mean: 𝜎𝑋̅ = 𝜎 √𝑛 12. Standard Normal Distribution 𝑍 = 𝑋̅ − 𝜇𝑥 𝜎𝑥̅ 13. Related to: Confidence Interval for sample variance E(s2) = σ2 2𝜎4 Var(s2) = 𝑛−1 (n-1) s2/σ2 follows a chi-squared distribution with n-1 degrees of freedom: (𝑛−1)𝑠2 𝜎2 2 = Χ𝑛−1 14. Efficiency Efficiency is related with smaller variance. If two estimators are unbiased then a estimator with smaller variance compared to becomes more efficient. 𝑅𝑒𝑙𝑎𝑡𝑖𝑣𝑒 𝐸𝑓𝑓𝑖𝑐𝑖𝑒𝑛𝑐𝑦 = 𝑉𝑎𝑟(𝜃̂2 ) 𝑉𝑎𝑟(𝜃̂1 ) 15. Covariance 𝐶𝑜𝑣(𝑥, 𝑦) = 𝐸 (𝑋 − 𝜇𝑋)(𝑌 − 𝜇𝑌) = 𝑠𝑥𝑦 = ∑(𝑥𝑖 − 𝑥̅ )(𝑦𝑖 − 𝑦̅ ) 𝑛−1 16. Correlation ρ = 𝜎𝑥𝑦 𝜎𝑥 𝜎𝑦 𝑆𝑥𝑦 𝑟 =𝑆 𝑥 𝑆𝑦 population correlation Sample correlation 17. Tests difference between two population means, μd . when two samples are dependent 𝑍= 𝑑̅ 𝑆𝑑 Confidence interval 𝑆 𝑑̅ ± tn-1,α/2 𝑑 √𝑛 18. Tests of the Difference between two Normal Population means: Independent Samples 2 2 𝑛𝑥 𝑛𝑦 𝜎 𝜎 When 𝜎𝑥2 and 𝜎𝑦2 [the population variances] are known the variance of 𝑋̅ - 𝑌̅ is 𝑥 + 𝑦 and the corresponding Z variable is defined as: Z= (𝑥̅ − 𝑦̅ )−(𝜇𝑥 − 𝜇𝑌) 2 2 𝜎 𝜎 √ 𝑥+ 𝑦 𝑛𝑥 𝑛𝑦 Confidence intervals (𝑥̅ − 𝑦̅) ± 𝑍𝛼/2 √ 𝜎𝑥2 𝑛𝑥 𝜎𝑦2 + 𝑛𝑦 19. Tests of the Difference Between Two Normal Population Means: Independent Samples: 𝝈𝟐𝒙 and 𝝈𝟐𝒚 are unknown but assumed equal t= (𝑥̅ − 𝑦̅ )−(𝜇𝑥 − 𝜇𝑌) 2 2 𝑠 𝑠 √ 𝑝+ 𝑝 𝑛𝑥 𝑛𝑦 𝑠𝑝2 = 2 (𝑛𝑥 −1)𝑠𝑥2 + (𝑛𝑦 −1)𝑠𝑦 𝑛𝑥 + 𝑛𝑦−2 Confidence intervals 𝑠2 2 𝑠𝑝 𝑥 𝑛𝑦 (𝑥̅ − 𝑦̅) ± 𝑡𝑛𝑥+𝑛𝑦 −2,𝛼/2 √𝑛𝑝 + 20. Test Hypotheses for the Difference Between Two Population Proportions (𝑝 ̂𝑥 − 𝑝 ̂) 𝑦 𝑍= √ 𝑝̂0 (1 − 𝑝̂0 ) 𝑝̂0 (1 − 𝑝̂0 ) + 𝑛𝑥 𝑛𝑦 𝑝̂0 = 𝑛𝑥 𝑝̂𝑥 + 𝑛𝑦𝑝̂𝑦 𝑛𝑥 + 𝑛𝑦 Confidence intervals 𝑝̂𝑥 − 𝑝̂𝑦 ± 𝑍𝛼/2 √ 𝑝̂𝑥 (1−𝑝̂𝑥 ) 𝑛𝑥 + 21. Tests of Equality of Two Variances 𝑠2 /𝜎2 𝐹 = 𝑠𝑥2 /𝜎𝑥2 as population variances are equal. 𝑦 𝑦 𝐹= 𝑠𝑥2 𝑠𝑦2 22. Regression 𝑌𝑖 = 𝛽0 + 𝛽1 𝑋𝑖 + 𝑢𝑖 𝑝̂𝑦(1−𝑝̂𝑦 ) 𝑛𝑦 ∑ 𝑥𝑦−𝑛𝑋̅𝑌̅ 𝛽̂1 = ∑ 𝑥 2−𝑛𝑋̅ 2 formula for slope. 𝛽̂0 = 𝑌̅ − 𝛽̂1 𝑋̅ formula for intercept 𝑡 𝑜𝑟 𝑍 = 𝛽̂1 − 𝛽 𝑆𝐸(𝛽̂1 ) Confidence intervals 𝛽̂1 ± 𝑡(𝛼, 𝑣) 𝑆. 𝐸(𝛽̂1 )=95% confidence interval 2 23. Coefficient of Determination 𝑹𝟐 𝑅2 = (∑ 𝑥𝑦 − 𝑛𝑋̅𝑌̅)2 (∑ 𝑥 2 − 𝑛 𝑋̅ 2 )(∑ 𝑦 2 − 𝑛 𝑌̅ 2 )

Formulas

Related documents

Products

Support

Formulas

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib