CHAPTER 6.stat.doc

advertisement
CHAPTER SIX:
Probability
In the social sciences, researchers speak of
likelihood; not proof. The study of abstract constructs
does not lend itself to direct manipulation or observation.
Likelihood of abstract events can be determined by
assessing the probability of the event. PROBABILITY can be
expressed as a proportion. If an event can only turn out
two ways, and only one of those ways is acceptable, the
chances of an acceptable outcome are 1 in 2. Expressed as a
fraction, this becomes 1/2: the numerator is the number of
acceptable outcomes and the denominator is the number of
outcomes, acceptable and unacceptable.
Consider a die (one of a pair of dice) with 6 equal
sides. Each side has a different number, 1 through 6. If
any even number will do, the chance of a success is 3 in 6,
or 3/6. This reduces to 1/2, which can be expressed as 5050. The chances of it going either way (success or failure
to get what was wanted) are 50%.
LIKELIHOOD AS AREA:
If likelihood can be expressed as a fraction, or
proportion, than likelihood can be depicted as the
proportion of area under the curve of a polygon. If any
score less than the mean of a distribution is acceptable,
than the chances of getting an acceptable score is 50% and
can be depicted as the lower half of a polygon:
The likelihood of an event can be determined when all
possible outcomes are known and then used to form a
frequency distribution of all of those potential outcomes,
presented as a polygon. If a distribution is normally
distributed, the area under the curve can be determined
with a calculus derivative. This has already been done for
Z-scores between -4 to +4 in the UNIT NORMAL TABLE. Divided
into three columns, column A identifies the Z Score, column
B identifies all the area below the Z score and column C
identifies all the area above the Z score. The area in B +
C always sums to 1, the total area of one distribution.
When X = u, Z = 0, column B contains .5000, column C
contains .5000, and the likelihood of being larger that the
u is 50% (as is the likelihood of being smaller).
NOTATION:
To indicate the question, "What is the likelihood that
a score is less than some value?" (ie 10) the notation
would appear:
p(x<10)=_______.
Above, the p stands for probability ( 0r proportion).
The likelihood that a score is greater would appear:
p(x>10)=_______.
Either could be solved by computing the Z-score for X.
This will require a knowledge of u and σ. Once computed,
the z-score could be used to determine the area up to the
desired point. If the Z is negative, the order of columns B
and C are reversed. The Unit Normal Table (UNT) translates
raw scores (X) to likelihoods (p-values) via the Z-scores.
If u = 5 and σ = 5, then 10 is 1 standard of deviation
above the mean , or Z = +1. It is placed at approximately
the 84th%. That is, .8413 or 84.13% of the distribution is
lower than a score of 10. The likelihood of doing better
that 10 is only 15.87%. There is a much greater likelihood
of getting a smaller score than 10 (84.13%, to be exact).
BINOMIALS:
When a question about an outcome can be stated as a
dichotomy (the either/or scenario), the area under the
curve can be approximate from the UNT via the BINOMIAL
formula. The binomial approximation asks a discrete
question of a continuous data set.
If n = the number of trials;
p = the probability of a success;
q = the probability of a failure;
u = n x p
and the sigma of the binomial is the √ of (n)*(p)*(q)
then Z, for the binomial becomes:
Z(Bi) = X - np / σ
Because the binomial approximation asks a discrete
question of a continuous set (the UNT), caution must be
taken to use the upper limit of X when estimating
likelihood above the number, and the lower limit of X is
used when estimating below the number. Further, you can
determine the area under the curve for a discrete number.
That means, in the binomial case, you can determine
p(X<10), p(X>10), p(X=10), p(X<10) and the p(X>10).
Download