Chapter 4 Continuous Random Variables and Probability Distributions 4.1 - Probability Density Functions 4.2 - Cumulative Distribution Functions and Expected Values 4.3 - The Normal Distribution 4.4 - The Exponential and Gamma Distributions 4.5 - Other Continuous Distributions 4.6 - Probability Plots POPULATION random variable X Example: X = Cholesterol level (mg/dL) “Density” Pop vals pmf x p(x) x1 p(x1) x2 p(x2) x3 p(x3) ⋮ ⋮ Total 1 Total Area = 1 f ( x) p ( x) (height) (area) p(x) = Probability that the random variable X is equal to a specific value x, i.e., | x p(x) = P(X = x) “probability mass function” (pmf) x (width) p( x) f ( x) x | x X POPULATION random variable X Example: X = Cholesterol level (mg/dL) Pop vals pmf x p(x) x1 p(x1) F(x1) = p(x1) x2 p(x2) F(x2) = p(x1) + p(x2) x3 p(x3) F(x3) = p(x1) + p(x2) + p(x3) ⋮ ⋮ ⋮ Total 1 increases from 0 to 1 cdf F(x) = P(X x) “staircase graph” Total Area = 1 F(x) = Probability that the random variable X is less than or equal to a specific value x, i.e., F(x) = P(X x) “cumulative distribution function” (cdf) | x X POPULATION random variable X Example: X = Cholesterol level (mg/dL) Pop vals pmf cdf x p(x) x1 p(x1) F(x1) = p(x1) x2 p(x2) F(x2) = p(x1) + p(x2) x3 p(x3) F(x3) = p(x1) + p(x2) + p(x3) ⋮ ⋮ ⋮ Total 1 increases from 0 to 1 F(x) = P(X x) Calculating “interval probabilities”… F(b) = P(X b) F(a–) = P(X a–) F(b) – F(a–) = P(X b) – P(X a–) = P(a X b) b p(x) a | | a–a | b X POPULATION random variable X Example: X = Cholesterol level (mg/dL) Pop vals pmf x p(x) x1 p(x1) F(x1) = p(x1) x2 p(x2) F(x2) = p(x1) + p(x2) x3 p(x3) F(x3) = p(x1) + p(x2) + p(x3) ⋮ ⋮ ⋮ Total 1 increases from 0 to 1 Calculating “interval probabilities”… F(b) = P(X b) F(a–) = P(X a–) b a cdf F(x) = P(X x) f ( x) dx F (b) F (a ) b f ( x ) x F ( b ) F ( a ) F(b) – F(a–) = a p( x) P(X b) – P(X a–) = P(a X b) b p(x) a | | a–a | b X FUNDAMENTAL THEOREM OF CALCULUS (discrete form) Reconsider the following discrete random variable… Example: X = “value shown on a single random toss of a fair die (1, 2, 3, 4, 5, 6)” X is uniformly distributed over 1, 2, 3, 4, 5, 6. Probability Table Cumul Prob Probability Histogram P(X = x) P(X x) x p(x) F(x) 1 1/6 1/6 2 1/6 2/6 3 1/6 3/6 4 1/6 4/6 5 1/6 5/6 6 1/6 1 Density Total Area = 1 1 6 1 6 1 6 1 6 1 6 1 6 X “What is the probability of rolling a 4?” P( X 4) 1 6 1 6 Reconsider the following discrete random variable… Example: X = “value shown on a single random toss of a fair die (1, 2, 3, 4, 5, 6)” X is uniformly distributed over 1, 2, 3, 4, 5, 6. Probability Table Cumul Prob Probability Histogram P(X = x) P(X x) x p(x) F(x) 1 1/6 1/6 2 1/6 2/6 3 1/6 3/6 4 1/6 4/6 5 1/6 5/6 6 1/6 1 Density Total Area = 1 1 6 1 6 1 6 1 6 1 6 1 6 X “What is the probability of rolling a 4?” “staircase graph” from 0 to 1 P( X 4) 1 6 1 7 Reconsider Consider thethe following following continuous discrete random randomvariable… variable… Example: X = “Ages “value of shown on afrom single random toss of a fair children 1 year old to 6 years old”die (1, 2, 3, 4, 5, 6)” 3, 4, 5,[1, 6. 6]. Further suppose that X is uniformly distributed over 1, the2,interval Probability Table Cumul Prob Probability Histogram P(X = x) P(X x) x p(x) F(x) 1 1/6 1/6 2 1/6 2/6 3 1/6 3/6 4 1/6 4/6 5 1/6 5/6 6 1/6 1 Density Total Area = 1 1 6 1 6 1 6 1 6 1 6 1 6 X “What is the probability of rolling a 4?” P( X 4) 1 6 1 8 Consider the following continuous random variable… Example: X = “Ages of children from 1 year old to 6 years old” 3, 4, 5,[1, 6. 6]. Further suppose that X is uniformly distributed over 1, the2,interval Probability Table Cumul Prob Probability Histogram P(X = x) P(X x) x f(x) F(x) 1 1/6 1/6 2 1/6 2/6 3 1/6 3/6 4 1/6 4/6 5 1/6 5/6 6 1/6 1 Density Total Area = 1 1 6 1 6 1 6 1 6 1 6 1 6 X “What is the probability a ofchild rolling is a4 4?” years old?” P( X 4) 16 1 9 POPULATION random variable X Example: X = Cholesterol level (mg/dL) Example: X = “reaction time” “Pain Threshold” Experiment: Volunteers place one hand on metal plate carrying low electrical current; measure duration till hand withdrawn. Time Time intervals intervals = 1.0 = 5.0 0.5 2.0 1.0 secs secs “In the limit…” f ( x) we obtain a density curve Total Area = 1 SAMPLE In principle, as # individuals in samples increase without bound, the class interval widths can be made arbitrarily small, i.e, the scale at which X is measured can be made arbitrarily fine, since it is continuous. 10 “In the limit…” we obtain a density curve Cumulative probability F(x) = P(X x) = Area under density curve up to x f(x) = probability density function (pdf) • f(x) 0 • Area = 1 f ( x) 00 F(x) increases continuously from 0 to 1. x x x As with discrete variables, the density f(x) is the height, NOT the probability p(x) = P(X = x). In fact, the zero area “limit” argument would seem to imply P(X = x) = 0 ??? (Later…) However, we can define “interval probabilities” of the form P(a X b), using cdf F(x). 11 “In the limit…” we obtain a density curve Cumulative probability F(x) = P(X x) = Area under density curve up to x F(b) f(x) = probability density function (pdf) F(b) F(a) F(a) • f(x) 0 • Area = 1 f ( x) a b F(x) increases continuously from 0 to 1. a b As with discrete variables, the density f(x) is the height, NOT the probability p(x) = P(X = x). In fact, the zero area “limit” argument would seem to imply P(X = x) = 0 ??? (Later…) However, we can define “interval probabilities” of the form P(a X b), using cdf F(x). 12 “In the limit…” we obtain a density curve Cumulative probability F(x) = P(X x) = Area under density curve up to x F(b) f(x) = probability density function (pdf) F(b) F(a) F(a) • f(x) 0 • Area = 1 f ( x) a b F(x) increases continuously from 0 to 1. a b An “interval probability” P(a X b) can be calculated as the amount of area under the curve f(x) between a and b, or the difference P(X b) P(X a), i.e., F(b) F(a). (Ordinarily, finding the area under a general curve requires calculus techniques… unless the “curve” is a straight line, for instance. Examples to follow…) 13 Consider the following continuous random variable… Example: X = “Ages of children from 1 year old to 6 years old” Further suppose that X is uniformly distributed over the interval [1, 6]. f ( x) 0.20 > 0 Density f ( x) Total Area = 1 1 Check? 1 1 1 6 Base 6= 6 – 16= 5 6 Height = 0.2 1 6 1 6 5 0.2 = 1 X “What is the probability of that rolling a random a 4?” child is 4 years old?” doesn’t mean….. P( X 4) 4.000000000......) 16 A single value is one point out of an infinite continuum of points on the real number line. The probability that a continuous random variable is exactly equal to any single value is ZERO! Consider the following continuous random variable… Example: X = “Ages of children from 1 year old to 6 years old” Further suppose that X is uniformly distributed over the interval [1, 6]. f ( x) 0.20 Density f ( x) 1 6 1 6 1 6 1 6 1 6 1 6 X “What is the probability of rolling a 4?” child is 4between 4 and 5 years old?” that a random years old?” actually means.... P(4 ( XX4) 5) = (5 – 4)(0.2) = 0.2 NOTE: Since P(X = 5) = 0, no change for P(4 X 5), P(4 < X 5), or P(4 < X < 5). Consider the following continuous random variable… Example: X = “Ages of children from 1 year old to 6 years old” Further suppose that X is uniformly distributed over the interval [1, 6]. Cumulative probability F(x) = P(X x) = Area under density curve up to x f ( x) 0.20 Density f ( x) For any x, the area under the curve is 1 6 1 6 1F(x) =10.2 (x1– 1). 1 6 6 6 6 X x x or F ( x) 0.2 dt 1 Consider the following continuous random variable… Example: X = “Ages of children from 1 year old to 6 years old” Further suppose that X is uniformly distributed over the interval [1, 6]. Cumulative probability F(x) = P(X x) = Area under density curve up to x f ( x) 0.20 F(x) = 0.2 (x – 1) Density f ( x) For any x, the area under the curve is 1 6 1 6 F(x) increases continuously from 0 to 1. 1F(x) =10.2 (x1– 1). 1 6 6 6 6 (compare with “staircase graph” for discrete case) X x x or F ( x) 0.2 dt 1 Consider the following continuous random variable… Example: X = “Ages of children from 1 year old to 6 years old” Further suppose that X is uniformly distributed over the interval [1, 6]. Cumulative probability F(x) = P(X x) = Area under density curve up to x f ( x) 0.20 F(x) = 0.2 (x – 1) Density f ( x) F(5) = 0.8 1 6 1 6 1 6 1 6 1 6 1 6 X “What is the probability of rolling a 4?” child is under 5 years old? that a random F (5) P( X 5) 0.2 (5 1) 0.8 Consider the following continuous random variable… Example: X = “Ages of children from 1 year old to 6 years old” Further suppose that X is uniformly distributed over the interval [1, 6]. Cumulative probability F(x) = P(X x) = Area under density curve up to x f ( x) 0.20 F(x) = 0.2 (x – 1) Density f ( x) 1 6 1 6 1 6 1 6 1 6 1 6 F(4) = 0.6 X “What is the probability of rolling a 4?” child is under 4 years old? that a random F (4) P( X 4) 0.2 (4 1) 0.6 Consider the following continuous random variable… Example: X = “Ages of children from 1 year old to 6 years old” Further suppose that X is uniformly distributed over the interval [1, 6]. Cumulative probability F(x) = P(X x) = Area under density curve up to x f ( x) 0.20 F(x) = 0.2 (x – 1) Density f ( x) F(5) = 0.8 1 6 1 6 1 6 1 6 1 6 1 6 F(4) = 0.6 X “What is the probability of rolling a 4?” child is between 4 and 5 years old?” that a random P(4 X 5) P( X 5) P( X 4) Consider the following continuous random variable… Example: X = “Ages of children from 1 year old to 6 years old” Further suppose that X is uniformly distributed over the interval [1, 6]. Cumulative probability F(x) = P(X x) = Area under density curve up to x f ( x) 0.20 F(x) = 0.2 (x – 1) Density f ( x) F(5) = 0.8 1 6 1 6 1 6 1 6 1 6 0.2 1 6 F(4) = 0.6 X “What is the probability of rolling a 4?” child is between 4 and 5 years old?” that a random P(4 X 5) P( X 5) P( X 4) = F(5) F(4) = 0.8 – 0.6 = 0.2 Consider the following continuous random variable… Example: X = “Ages of children from 1 year old to 6 years old” Further suppose that X is uniformly distributed over the interval [1, 6]. Density f ( x) f ( x) .08 ( x 1) 0 1 Base Height 1) (0.4) Area = (6 2 =1 0.4 Consider the following continuous random variable… Example: X = “Ages of children from 1 year old to 6 years old” Cumulative Distribution Function F(x) Cumulative probability F(x) = P(X x) = Area under density curve up to x f ( x) .08 ( x 1) Density f ( x) F ( x) x base height 1 ( x 1) .08( x 1) 2 .04 ( x 1) 2 F ( x) i.e., x 1 .08(t 1) dt F ( x) x Consider the following continuous random variable… Example: X = “Ages of children from 1 year old to 6 years old” Cumulative Distribution Function F(x) Cumulative probability F(x) = P(X x) = Area under density curve up to x f ( x) .08 ( x 1) Density f ( x) F ( x) base height 1 ( x 1) .08( x 1) 2 .04 ( x 1) 2 F ( x) i.e., x 1 .08(t 1) dt F (5) F (4) x “What is the probability that a child is under 4 years old?” “What is the probability that a child is under 5 years old?” “What is the probability that a child is between 4 and 5?” P( X 4) F (4) P( X 5) F (5) P(4 X 5) A continuous random variable X Cumulative probability function (cdf) In summary… x corresponds to a probability density F ( x) P( X x) f (t ) dt function (pdf) f(x), whose graph is a density curve. f(x) is NOT a pmf! F ( x) f ( x) f ( x) 0 f ( x) f ( x) dx 1 Fundamental Theorem of Calculus P( X any constant a) 0, not f (a) F(x) increases continuously from 0 to 1. b P(a X b) f ( x) dx F (b) F (a ) Moreover… a 25 A continuous random variable X Cumulative probability function (cdf) In summary… x corresponds to a probability density F ( x) P( X x) f (t ) dt function (pdf) f(x), whose graph is a density curve. f(x) is NOT a pmf! F ( x) f ( x) f ( x) 0 E[ X ] f ( x) dx x f1( x) dx E ( X ) 2 2 Fundamental Theorem of Calculus ( x ) F(x)f increases ( x) dx 2 continuously from 0 to 1. E X E[ X ] x f ( x) dx 2 2 P( X any constant a) 0, not f (a) 2 b 2 P(a X b) f ( x) dx F (b) F (a ) Moreover… a 26 SECTION 4.3 IN POSTED LECTURE NOTES 28 Four Examples: 1 For any b > 0, consider the following probability density function (pdf)... Determine the cumulative distribution function (cdf) 2 x, 0 x b f ( x) b 2 F ( x) P( X x) 0, else For any x < 0, it follows that F ( x) P( X x) 0. 2 For any 0 x b, it follows that… b F ( x) P( X x) x 2 b2 2 without calculus... x b2 1 2 0 ( x 0) 2 2 b 2 x x 2 b with calculus... | x 0 b f (t ) dt 0 x 0 2 x2 t dt 2 2 b b 29 Four Examples: 1 For any b > 0, consider the following probability density function (pdf)... Determine the cumulative distribution function (cdf) 2 x, 0 x b f ( x) b 2 F ( x) P( X x) 0, else For any x < 0, it follows that F ( x) P( X x) 0. 2 For any 0 x b, it follows that… b F ( x) P( X x) x 2 b2 2 x 2 b Note: F (0) 0 2 b 2 0 F (b) b 2 b 2 1 For any x b, it follows that… F ( x) P( X x) 1 0 1. 0 | x b 30 Four Examples: 1 For any b > 0, consider the following probability density function (pdf)... Determine the cumulative distribution function (cdf) 2 x, 0 x b x0 f ( x) b 2 F ( x) P( X x) 0, x 2 0, else 2 , 0 xb 2 b xb b 1, 2 x 2 b 1 2 x 2 b 0 b 0 b 31 Four Examples: 2 For any b > a > 0, consider the probability density function (pdf)... 2x 0 xa ab , 2( x b) f ( x) , a xb b( a b) 0, else Determine the cumulative distrib function (cdf) F ( x) P( X x) For any x 0, it follows that F ( x) 0. For any 0 x a, it follows that x F ( x) 0 2t dt 0 ab 2( x b) b(a b) 2x ab For any a x b, it follows that F ( x) F (a ) x a 2(t b) dt b(ab) 1 (b) 0 For any x b, it follows that F ( x) F a 0 x b x 32 Four Examples: 2 For any b > a > 0, consider the probability density function (pdf)... 2x Edistrib [ X ] function x f ((cdf) x) dx Determine the mean cumulative , 0 x a ab a b 2( x b) x f ( x) dx x f ( x ) dx f ( x) , a xb 0 a b ( a b ) a 2x b 2( x b) x dx x dx 0 a 0, ab b(a b) else Determine the variance 2( x b) b(a b) 2x ab 2 0 a E X E X x 2 f ( x) dx 2 2 a 0 2 b 2x 2 2( x b) x dx x dx 2 a ab b(a b) 2 b x 33 Four Examples: 3 Consider the following probability density function (pdf)... 2 , x 1 f ( x) x3 0, x 1 Confirm pdf f ( x) dx 1 E[ X ] x f ( x) dx 2 2 x 3 dx 2 dx 1 1 x x 1 c c x lim 2 x 2 dx lim 2 1 c c 1 1 c c c x c 2 3 dx lim 2 x dx lim 2 3 c 1 c 2 x 1 c 1 1 lim 2 1 lim 2 1 c c c x 1 2 2 2 lim 2 lim 2 c c c x 1 35 Four Examples: 4 3 Consider the following probability density function (pdf)... 12 , x 1 f ( x) x 23 0, x 1 Confirm pdf f ( x) dx 1 1 2 c c x x c c 12 23 dx lim 2 x x dxdx lim 2 32 c c 1 1 x 12 1 1 cc 1 1 11 lim 2 11lim lim 2 1 1 c c c cc x 1 1 E[ X ] x f ( x) dx 1 21 2 x 32 dx 2 dx dx 1 1 x x 1 c c c x 2 lim 2 x x1dx dx lim 2 c 1 1 c 1 1 c c 2 2 lim 2 lim 2 c c c x 1 36 Four Examples: 4 3 Consider the following probability density function (pdf)... 12 , x 1 f ( x) x 23 0, x 1 Confirm pdf f ( x) dx 1 1 2 c c x x c c 12 23 dx lim 2 x x dxdx lim 2 32 c c 1 1 x 12 1 1 cc 1 1 11 lim 2 11lim lim 2 1 1 c c c cc x 1 1 E[ X ] 1 x f ( x) dx 1 1 x 2 dx dx 1 x x lim x 1 dx lim ln | x |1 c c 1 lim (ln c) c c c c 37