ENGR 610 Applied Statistics Fall 2007 - Week 3 Marshall University CITE Jack Smith Overview for Today Review of Chapter 4 Homework problems (4.57,4.60,4.61,4.64) Chapter 5 Continuous probability distributions Uniform Normal LogNormal Exponential Sampling of the mean, proportion Standard Normal Distribution (Z scores) Approximation to Binomial, Poisson distributions Normal probability plot Central Limit Theorem Homework assignment Chapter 4 Review Discrete probability distributions Binomial Poisson Others Hypergeometric Negative Binomial Geometric Cumulative probabilities Probability Distributions A probability distribution for a discrete random variable is a complete set of all possible distinct outcomes and their probabilities of occurring, where P(X ) 1 i i The expected value of a discrete random variable is its weighted average over all possible values where the weights are given by the probability distribution. E(X) X i P(X i ) i Probability Distributions The variance of a discrete random variable is the weighted average of the squared difference between each possible outcome and the mean over all possible values where the weights are given by the probability distribution. X2 (X i X ) 2 P(X i ) i The standard deviation (X) is then the square root of the variance. Binomial Distribution Each elementary event is one of two mutually exclusive and collectively exhaustive possible outcomes (a Bernoulli event). The probability of “success” (p) is constant from trial to trial, and the probability of “failure” is 1-p. The outcome for each trial is independent of any other trial n! P(X x | n, p) p x (1 p) nx x!(n x)! Binomial Distribution Binomial coefficients follow Pascal’s Triangle 1 11 121 1331 Distribution nearly bell-shaped for large n and p=1/2. Skewed right (positive) for p<1/2, and left (negative) for p>1/2 Mean () = np Variance (2) = np(1-p) Poisson Distribution Probability for a particular number of discrete events over a continuous interval (area of opportunity) Assumes a Poisson process (“isolable” event) Dimensions of interval not relevant Independent of “population” size Based only on expectation value () e x P(X x | ) x! Poisson Distribution, cont’d Mean () = variance (2) = Right-skewed, but approaches symmetric bell-shape as gets large Other Discrete Probability Distributions Hypergeometric Negative Binomial Bernoulli events, but selected from finite population without replacement p now defined by N and A (successes in population N) Approaches binomial for n < 5% of N Number of trials (n) until xth success Last selection is constrained to be a success Geometric Special case of negative binomial for x = 1 (1st success) Cumulative probabilities P(X<x) = P(X=1) + P(X=2) +…+ P(X=x-1) P(X>x) = P(X=x+1) + P(X=x+2) +…+ P(X=n) Continuous Probability Distributions Differ from discrete distributions, in that Any value within range can occur Probability of specific value is zero Probability obtained by cumulating bounded area under curve of Probability Density Function, f(x) Discrete sums become integrals Continuous Probability Distributions P(a X b) P(X b) E(X) b b f (x)dx a f (x)dx xf (x)dx (Mean, expected value) 2 2 (x ) f (x)dx (Variance) Uniform Distribution 1 b a f (x) 0 ab 2 2 (b a) 2 12 a x b elsewhere Normal Distribution Why is it important? Numerous phenomena measured on continuous scales follow or approximate a normal distribution Can approximate various discrete probability distributions (e.g., binomial, Poisson) Provides basis for SPC charts (Ch 6,7) Provides basis for classical statistical inference (Ch 8-11) Normal Distribution Properties Bell-shaped and symmetrical The mean, median, mode, midrange, and midhinge are all identical Determined solely by its mean () and standard deviation () Its associated variable has (in theory) infinite range (- < X < ) Normal Distribution 1 (1/ 2)[( X x )/ x ]2 f (x) e 2 x Standard Normal Distribution 1 (1/ 2)Z 2 f (x) e 2 where Z X x x Is the standard normal score (“Z-score”) With and effective mean of zero and a standard deviation of 1 Normal Approximation to Binomial Distribution For binomial distribution x np x np(1 p) and so X x Z x X np np(1 p) Variance, 2, should be at least 10 Normal Approximation to Poisson Distribution For Poisson distribution x x and so Z X x x X Variance, , should be at least 5 Normal Probability Plot Use normal probability graph paper to plot ordered cumulative percentages, Pi = (i - 0.5)/n * 100%, as Z-scores - or Use Quantile-Quantile plot (see directions in text) - or Use software (PHStat)! Lognormal Distribution 1 f (x) (X ) e 2 ln(x ) e (1/ 2)[(ln(X ) ln(x ) )/ ln(x ) ]2 2 ln(X ) ln(X ) /2 X e 2 2 ln(X ) ln(X ) 2 ln(X ) (e 1) Exponential Distribution f (x) e x 1/ Poisson, with continuous rate of change, Only memoryless random distribution X P(x X) 1 e Sampling Distribution of the Mean Central Limit Theorem x x x x / n p p Continuous data (proportion) (1 ) n Attribute data Homework Ch 5 Skip Ch 6 and Ch 7 Appendix 5.1 Problems: 5.66-69 Statistical Process Control (SPC) Charts Read Ch 8 Estimation Procedures