The Normal Distribution

advertisement
The Normal Distribution
Using Models to Describe Data Distributions
•The Density Curve
•All density curves are based on mathematical models
(“equations”) that can be used to describe the
frequency for a given datum
•The area under all densities curves equals 1
the model
the data
A Very Useful Model
•The Normal Distribution is
a model that can be used to
describe data that is:
•Uni-modal
•Symmetric
•Approximates the “bell
curve”
•Can be described by the
equation:
1
N (  , ) 
e
 2
1 x 2
 (
)
2 
The “Rule”
•Normally distributed data has the following
critical property:
The 68-95-99.7 Rule
Example…
• You have just received the score on your
law-school admissions test (LSAT). You
got 163. The exam results are normally
distributed with N(155,2.6) for that year of
testing. In order to apply to a very
prestigious law school you must finish in
the 97th percentile or better. Can you apply
with this score?
Z-Scores and the Standard Normal Distribution
•All normal distributions share the same shape
•A simple linear transformation can convert any normal
distribution to the Standard Form
•This gives us the concept of the Z-Score
•The 68-95-99.7 rule applies here and can give us a deeper
insight into what a z-score means
z
X 

•Converting to Z-scores allows you to use Table A (inside
cover of book)
The Standard Normal Distribution
•Any normal distribution can be converted to the
SND via z-score to N(m,s)  N(0,1)
Using z-scores…
If the z-score for a data
Point is 1 then this means
That 84.13% of the samples
In the population are less than
The value of this data point
How do you
interpret
a z-score of
-1.71?
Z
Who’s the Greatest?
Ty Cobb batted 0.420
In 1911
N(0.266,0.0371)
Ted Williams batted
0.406 in 1941
N(0.267,0.0326)
z = 4.15
z = 4.26
George Brett batted
0.390 in 1980
N(0.261,0.0317)
z = 4.07
Normal Quantile Plots (digging deeper)
• If you want to use what we have just learned to
assess data then you must be sure that the data
fits a normal distribution.
• Visual inspection (stemplot or histogram is a good
start).
• The Normal Quantile Plot is even better…Plot data
against the probability value that you get for the zscore of the data. If the graph is a straightline the
data is normally distributed.
Warning! Your text “messed-up!” All of the normal quantile plots have a
deceptively labeled x-axis – instead of z-score it should be the probability
associated with the z-score.
Example:
•Here is the data from an amazing star WZ Sagittae
•Am I justified in thinking that the noise in this data is
normally distributed?
My comparison
sources – the “fuzz”
in the data is the
“noise” or error
which I attribute to
basic natural
processes of
measuring light
Look at the Data…
“Z-Score” Plot
In conclusion…
• Review the summary on pages 83-84.
Make sure you understand z-scores, what
they mean and how to use them.
• Sample problems to gauge your
understanding: 1.81, 1.93, 1.97
Download