Chapter 2

advertisement
The best way to memorize formulas is by practice.
Work as hard as the bees.
SUMMARY OF FORMULAS/TESTS (CHAPTER 2 – CHAPTER 7)
οƒ˜ Chapter 2: Summarizing and Graphing Data
1. Construct a frequency distribution: π‘π‘™π‘Žπ‘ π‘  π‘€π‘–π‘‘π‘‘β„Ž ≈
(π‘šπ‘Žπ‘₯π‘–π‘šπ‘’π‘š π‘£π‘Žπ‘™π‘’π‘’)−(π‘šπ‘–π‘›π‘–π‘šπ‘’π‘š π‘£π‘Žπ‘™π‘’π‘’)
π‘›π‘’π‘šπ‘π‘’π‘Ÿ π‘œπ‘“ π‘π‘™π‘Žπ‘ π‘ π‘’π‘ 
π‘π‘™π‘Žπ‘ π‘  π‘“π‘Ÿπ‘’π‘žπ‘’π‘’π‘›π‘π‘¦
2. Relative frequency = π‘ π‘’π‘š π‘œπ‘“ π‘Žπ‘™π‘™ π‘“π‘Ÿπ‘’π‘žπ‘’π‘’π‘›π‘π‘–π‘’π‘ 
οƒ˜ Chapter 3: Statistics for Describing, Exploring, and Comparing Data
1. Sample mean: π‘₯ =
∑π‘₯
𝑛
∑: the sum of a set of values
X: the individual data values
n: the number of values in a sample
2. Population mean: µ =
∑π‘₯
𝑁
N: the number of values in a population
3. Median: First arrange the values from the smallest to the greatest. And then, if the
number of the values is odd, the middle value of the list is the median; if the
number of the values is even, the average value of the middle numbers is the
median.
4. π‘šπ‘–π‘‘π‘Ÿπ‘Žπ‘›π‘”π‘’ =
π‘šπ‘Žπ‘₯π‘–π‘šπ‘’π‘š π‘£π‘Žπ‘™π‘’π‘’+π‘šπ‘–π‘›π‘–π‘šπ‘’π‘š π‘£π‘Žπ‘™π‘’π‘’
2
5. range = (maximum value) − (minimum value)
6. Mean from frequency distribution: π‘₯ =
7. Weighted mean: π‘₯ =
∑(π‘“βˆ™π‘₯)
∑𝑓
∑(𝑀·π‘₯)
∑𝑀
8. Sample standard deviation: 𝑠 = √
∑(π‘₯−π‘₯)²
𝑛−1
𝑛 ∑(π‘₯ 2 )−(∑ π‘₯)²
or 𝑠 = √
𝑛(𝑛−1)
∑(π‘₯−µ)²
9. Population standard deviation: 𝜎 = √
10. Sample variance: 𝑠² =
∑(π‘₯−π‘₯)²
𝑛−1
11. Population variance: 𝜎² =
or 𝑠² =
𝑁
𝑛 ∑(π‘₯ 2 )−(∑ π‘₯)²
𝑛(𝑛−1)
∑(π‘₯−µ)²
𝑁
12. Range rule of thumb:
For estimating a value of the standard deviation s: 𝑠 ≈
π‘Ÿπ‘Žπ‘›π‘”π‘’
4
For interpreting a known value of the standard deviation:
Minimum “usual” value = (mean) - 2× (standard deviation)
Maximum “usual” value = (mean) + 2× (standard deviation)
13. Coefficient of variation (or CV):
𝑠
Sample: 𝐢𝑉 = π‘₯ · 100%
Population: 𝐢𝑉 =
𝜎
µ
· 100%
14. Measures of relative standing:
Sample: 𝑧 =
π‘₯−π‘₯
Population: z =
𝑠
x−µ
σ
Ordinary values: -2 ≤ z score ≤ 2
Unusual values: z score < -2
or
z score > 2
15. Interquartile range (or IQR) = Q3 – Q1
Semi-interquartile range =
Midquartile=
𝑄3−𝑄1
2
𝑄3+𝑄1
2
10 – 90 percentile range = P90 – P10
οƒ˜ Chapter 4: probability
1. Relative frequency approximation of probability:
𝑃(𝐴) =
π‘›π‘’π‘šπ‘π‘’π‘Ÿ π‘œπ‘“ π‘‘π‘–π‘šπ‘’π‘  𝐴 π‘œπ‘π‘π‘’π‘Ÿπ‘Ÿπ‘’π‘‘
π‘›π‘’π‘šπ‘π‘’π‘Ÿ π‘œπ‘“ π‘‘π‘–π‘šπ‘’π‘  π‘‘β„Žπ‘’ π‘‘π‘Ÿπ‘–π‘Žπ‘™ π‘€π‘Žπ‘  π‘Ÿπ‘’π‘π‘’π‘Žπ‘‘π‘’π‘‘
P: probability
A, B and C: specific events
P(A): the probability of event A occurring
2. Classical approach to probability (requires equally likely outcomes):
π‘›π‘’π‘šπ‘π‘’π‘Ÿ π‘œπ‘“ π‘€π‘Žπ‘¦π‘  𝐴 π‘π‘Žπ‘› π‘œπ‘π‘π‘’π‘Ÿ
𝑃(𝐴) =
π‘›π‘’π‘šπ‘π‘’π‘Ÿ π‘œπ‘“ π‘‘π‘–π‘“π‘“π‘’π‘Ÿπ‘’π‘›π‘‘ π‘ π‘–π‘šπ‘π‘™π‘’ 𝑒𝑣𝑒𝑛𝑑𝑠
3. Formal Addition Rule: P(A or B) = P(A) + P(B) – P(A and B)
P(A and B) is the that A and B both occur at the same time as an outcome in a trial of
procedure.
4. Rule of complementary events: P(A) + P(𝐴) = 1
5. Formal Multiplication Rule: P(A and B) = P(A) · P(B|A)
P(B|A) is the probability of event B occurring after it is assumed that event A has
already occurred. If A and B are independent events, P(B|A) is really the same as P(B).
6. Conditional probability: P(B|A)=
𝑃(𝐴 π‘Žπ‘›π‘‘ 𝐡)
𝑃(𝐴)
𝑛!
7. Permutations Rule (when Items are all different): π‘›π‘ƒπ‘Ÿ = (𝑛−π‘Ÿ)!
 There are n different items available. We select r of the n items (without
replacement). We consider rearrangements of the same items to be different
sequences.
𝑛!
8. Permutations Rule(when some items are identical): 𝑛1!𝑛2!β‹―π‘›π‘˜!
 There are n items available and some items are identical to others. We select
all of the n items (without replacement). We consider rearrangements of
distinct items to be different sequences.
𝑛!
9. Combinations Rule: π‘›πΆπ‘Ÿ = (𝑛−π‘Ÿ)!π‘Ÿ!
 There are n different items available. We select r of the n items (without
replacement). We consider rearrangement of the same items to be the
same.
οƒ˜ Chapter 5: Discrete Probability Distributions
1. Mean for a probability distribution: µ = ∑[π‘₯ · 𝑃(π‘₯)]
2. Variance for a probability distribution: 𝜎² = ∑[(π‘₯ − µ)² · 𝑃(π‘₯)]
or 𝜎² = ∑[π‘₯² · 𝑃(π‘₯)] − µ²
3. Standard deviation for a probability distribution: σ = √∑[π‘₯² · 𝑃(π‘₯)] − µ²
4. Expected value: 𝐸 = ∑[π‘₯ · 𝑃(π‘₯)]
5. Binomial distribution:
Mean: µ = np
Variance: σ² = npq
Standard deviation: σ = √π‘›π‘π‘ž
𝑛!
Binomial probability: 𝑃(π‘₯) = (𝑛−π‘₯)!π‘₯! · 𝑝 π‘₯ · π‘ž 𝑛−π‘₯
for x = 0, 1, 2, …, n
n: number of trials
x: number of successes among n trials
p: probability of success in any one trial
q: probability of failure in any one trial (q = 1 – p)
P(x): the probability of getting exactly x successes among the n trials
 Each trial must have all outcomes classified into two categories (commonly
referred to as success and failure).
6. Poisson distribution:
Poisson probability: P(x) =
µπ‘₯ ·π‘’ −µ
π‘₯!
Where e ≈ 2.71828
Mean: µ
Standard deviation: 𝜎 = √µ
 The random variable x is the number of occurrences of an event over some
interval.
 0 ≤ P(x) ≤1
οƒ˜ Chapter 6: Normal Probability Distributions
1. Z score formula: 𝑧 =
π‘₯−µ
𝜎
(round z scores to 2 decimal places)
 When working with an individual value from a normally distributed
population, use the formula 𝑧 =
𝒙−µ
𝜎
 Other forms of the z score formula:
π‘₯ = µ + (𝑧 · 𝜎)
µ=x–z·σ
.
𝜎=
π‘₯−µ
𝑧
𝑧=
 X, µ and σ are known⇔
π‘₯−µ
𝜎
𝑧 π‘‡π‘Žπ‘π‘™π‘’
z ⇔
probability
2. The Central Limit Theorem:
π‘₯−µ
z score formula: 𝑧 =
𝜎
√𝑛
Standard deviation of the sample mean: σπ‘₯ =
𝜎
√𝑛
 When working with a mean for some sample (or group), use the formula
=
𝒙−µ
𝜎
√𝑛
.
π‘₯−µ
𝑧= 𝜎
 π‘₯, µ, σ and n are known ⇔
√𝑛
𝑧 π‘‡π‘Žπ‘π‘™π‘’
z ⇔
probability
οƒ˜ Chapter 7: Estimates and Sample Sizes
Proportion
1. Estimate a population proportion:
Margin of error for proportion: 𝐸 = 𝑧α/2√
𝑝̂ π‘žΜ‚
𝑛
Confidence interval for population proportion p: 𝑝̂ – E < p < 𝑝̂ + E
P: population proportion
π‘₯
𝑝̂ : sample proportion, 𝑝̂ =𝑛
π‘žΜ‚: sample proportion of failures in a sample of size n, π‘žΜ‚ = 1 - 𝑝̂
Zα/2: critical z score based on the desired confidence level
 The sample estimate is a single value (or point) used to approximate a
population parameter.
2. Sample size for estimating Proportion p:
When 𝑝̂ is known, 𝑛 =
[𝑍α/2]² π‘Μ‚π‘žΜ‚
𝐸²
[𝑍𝛼/2]²·0.25
When 𝑝̂ is unknown, 𝑛 =
𝐸²
 If the computed sample size is not a whole number, round it up to the next
higher whole number.
3. Finding the point estimate and E from a confidence interval:
π‘ƒπ‘œπ‘–π‘›π‘‘ π‘’π‘ π‘‘π‘–π‘šπ‘Žπ‘‘π‘’ π‘œπ‘“ 𝑝: 𝑝̂ =
Margin of error: 𝐸 =
(π‘’π‘π‘π‘’π‘Ÿ π‘π‘œπ‘›π‘“π‘–π‘‘π‘’π‘›π‘π‘’ π‘™π‘–π‘šπ‘–π‘‘)+(π‘™π‘œπ‘€π‘’π‘Ÿ π‘π‘œπ‘›π‘“π‘–π‘‘π‘’π‘›π‘π‘’ π‘™π‘–π‘šπ‘–π‘‘)
2
(π‘’π‘π‘π‘’π‘Ÿ π‘π‘œπ‘›π‘“π‘–π‘‘π‘’π‘›π‘π‘’ π‘™π‘–π‘šπ‘–π‘‘)−(π‘™π‘œπ‘€π‘’π‘Ÿ π‘π‘œπ‘›π‘“π‘–π‘‘π‘’π‘›π‘π‘’ π‘™π‘–π‘šπ‘–π‘‘)
2
Population Mean
1. Estimate a population mean: σ known
Margin of error for mean: E = zα/2 ·
𝜎
√𝑛
Confidence interval estimate of the population mean µ: π‘₯ − 𝐸 < πœ‡ < π‘₯ + 𝐸
Sample size: 𝑛 = [
π‘π‘Ž/2𝜎
𝐸
]²
2. Estimate a population mean: σ unknown but s known
Margin of error for mean: E = tα/2 ·
𝑠
√𝑛
Confidence interval estimate of the population mean µ: π‘₯ − 𝐸 < πœ‡ < π‘₯ + 𝐸
Sample size: 𝑛 = [
π‘‘π‘Ž/2𝑠
𝐸
]²
3. Finding the point estimate and E from a confidence interval:
Point estimate of µ: π‘₯ =
Margin of error: 𝐸 =
(π‘’π‘π‘π‘’π‘Ÿ π‘π‘œπ‘›π‘“π‘–π‘‘π‘’π‘›π‘π‘’ π‘™π‘–π‘šπ‘–π‘‘)+(π‘™π‘œπ‘€π‘’π‘Ÿ π‘π‘œπ‘›π‘“π‘–π‘‘π‘’π‘›π‘π‘’ π‘™π‘–π‘šπ‘–π‘‘)
2
(π‘’π‘π‘π‘’π‘Ÿ π‘π‘œπ‘›π‘“π‘–π‘‘π‘’π‘›π‘π‘’ π‘™π‘–π‘šπ‘–π‘‘)−(π‘™π‘œπ‘€π‘’π‘Ÿ π‘π‘œπ‘›π‘“π‘–π‘‘π‘’π‘›π‘π‘’ π‘™π‘–π‘šπ‘–π‘‘)
2
4. Estimate a population variance:
Chi-square distribution: π‘₯² =
(𝑛−1)𝑠²
𝜎²
, refer to Table A-4, df = n – 1
Confidence interval for the population variance σ²:
(𝑛−1)𝑠²
𝑋²π‘…
< σ² <
(𝑛−1)𝑠²
𝑋²πΏ
Download