Risk Management Seminar Part 2

advertisement
Risk
Management
Part 2 – Variability and
Risk: Twin Sons of
Different Mothers
by
Donald E. Shannon, PMP, CFCM, CPCM, DML
The Contract Coach, Albuquerque, NM
Disclaimer
Information in this presentation makes reference to various
software products. This should not be interpreted as a
recommendation or endorsement by any of the sponsors of any
one product. Individuals should conduct appropriate research to
identify a product that best meets their specific needs. Where
appropriate, credit has been given to the software OEM
especially where screen shots of their products have been used.
A listing of the products commonly used to accomplish the
simulation and scheduling functions described herein is included
at the end of this presentation
Where Are We Going?
 Quantitative (numerical or statistical) analysis
 What is variability
 The nature of random variables
 How they are depicted
 Confidence intervals
 Variability (uncertainty) in program / project management
 Why estimates never seem to be right
 Adding risk to uncertainty = real world variability
 Adding risk and uncertainty to our cost/schedule model
Variability
Why things are never
exactly alike….
Trivia questions:
1.
2.
3.
Who is the driver in the above?
What year was the photo taken and how can you tell?
On average how long does a F-1 pit stop take in 2014
Trivia questions:
1.
2.
3.
Who is the driver in the above? – Kimi Raikkonen
What year was the photo taken and how can you tell? – 2008. Because Kimi’s car is #1 and he won
the F1 championship in 2007 giving him that number for the following year.
On average how long does a F1 pit stop take in 2014? – 3 Seconds
A 3-Second Pit Stop?
Let’s Fact Check That Claim
Pit Stop Times
2014 F-1 Grand Prix of Spain
 Is the average pit stop time




actually 3.0 seconds?
Data from 13 pit stops during
the most recent F-1 race
Most consistent team was
Williams with 3 stops each of
which was 3.0 seconds
Fastest team was Red Bull with
an average stop of 2.675 seconds
Further analysis would likely
show that the average time of
2.8 in this sample was within the
expected range and there is not
enough evidence to reject the
supposed average of 3.0 seconds
Driver
Team
VET
RB
3.1
2.3
2.7
RIC
RB
2.9
2.4
2.65
GRO
LOT
2.9
-
2.9
MAS
WIL
3.0
-
3.0
BOT
WIL
3.0
3.0
3.0
HAM
MER
3.8
4.3
4.05
ROS
MER
2.8
3.0
2.9
2.68
2.98
3.04
Average
Stop
1
Stop 2
Average
Variability (Uncertainty)
 The output of any task when repeated will vary to some
degree from its predecessors or successors
 Size, weight, volume etc
 Performance time
 Your morning commute is a good example
 Depending on road conditions and traffic it may vary
significantly from day-to-day
 Even when conditions are perfect the time still varies by a few
seconds or minutes
Random Variables

We call these probabilistic events
because the answer can not be
described by a single value.
 The value of these events (time,
cost, etc.) is variable.
 So how do we describe variability




We establish ranges of possible
values
We define measures of central
tendency (average, mean,
median, mode)
We identify how the values are
spread out (variance)
We quantify how far possible
values are displaced from the
center (standard deviation or zscore)
On Closer Examination
Reviewing some terms you probably know
 The pit stop range (low to





high) was 2 Seconds (2.3 – 4.3)
3.0 was the most common
(mode) time recorded
3.0 was the middle value
(median)
3.04 was the arithmetic
average (also called mean or μ)
of all the times recorded
The standard deviation (σ) was
.54
All data lies within +/- 3 σ
Pit Stop Time
8
7
6
5
4
3
2
1
0
μ
-1 σ
2.3 - 2.5 - 2.8 2.5
2.7
3.0
1σ
3.1 3.3
2σ
3.3 3.5
Time
3.5 3.7
3σ
3.8 - 4.0 4.0
4.3
Random Variables
 Random variables are
either discrete (whole
numbers 1,2,3) or
continuous
 There are separate rules for
displaying the data for
each
 Discrete RVs are
typically shown in
histograms (bar charts)
 Continuous RV are
typically shown in
probability distribution
functions (PDF) or
cumulative distribution
functions (CDF)
Histograms
 Data may be either continuous




or discrete
Discrete data is grouped into
classes or “bins”
Bin values displayed on x-axis
Count or number in class
displayed on y-axis
A smoothed line enclosing the
tips of each column may
provide insight as to the
underlying distribution
(Normal, Lognormal, Beta,
etc.
Probability Distribution Function






The total area under the curve is always
equal to 1 (100%)
The tallest point of the curve is the mode
(most often occurring value)
In symmetric distributions the mean and
median are co-located with the mode
If the distribution is not symmetric we
say it is ‘skewed’ with the direction of
the skew (left or right) being where the
preponderance of the values reside.
The shape of the distribution may give a
clue to the underlying distribution but
beware – sometimes the shape is
misleading
Values on the x-axis may be actual or z
(standard deviations). If z scores those
to the left of the mean are negative and
those to the right are positive.
Cumulative Distribution Function
 Values start at zero
 Values end at 1 (or 100
percent)
 Value on the y axis (vertical) is
at cumulative probability for
the value on the x axis i.e., the
sum of the probabilities from
zero to the selected x value
 The shape of the curve is an
indication of the shape of the
probability distribution
function (PDF).
Confidence Intervals
 What is the likelihood of an
event happening?
 Single point estimate of
continuous variable is
undefined
 Therefore we phrase the
question as:
 Probability that x will exceed
some value
 Probability that x is less than
some value
 Probability that x is between a
and b.
Using a PDF to Answer Questions
 Consider the normal distribution
 Average IQ is 100
 Standard deviation is 10 points
 68% lie between 90 – 110
 95.4% lie between 80 – 120
 99.74 lie between 70 – 130
 PDFs help us answer questions
such as:
 What is the probability of
someone having an IQ between
110 and 120? (13.6%)
 What is the probability of an IQ of
125? (Trick question. If IQ is a
continuous random variable the
probability of one precise value is
undefined)
Uncertainty in Program
Management
Why things always take longer and cost more than you planned ….
Murphy’s Law
 Often stated as: “Anything that can go wrong, will go wrong”
 References to the principle date to at least 1877
 Famous corollaries include:
 If anything just cannot go wrong, it will anyway
 … Usually in the worst possible (or most inconvenient) time in the
worst possible way.
 Things will be lost or damaged in inverse relationship to their value
or need.
 But the real culprit in all of this is nature’s uncertainty.
Uncertainty
 Uncertainty is an admission that the
outcome of any event is a random variable
 Therefore the data describing that event
(cost, performance time etc.) will be variable
 Forecasting the outcome of an uncertain
event can only be stated probabilistically
 Probabilistic outcomes typically involve
 A percent likelihood the event will occur
 Sometimes an indication of underlying
distribution (uniform, binomial, normal,
etc,)
 When we plan for uncertain events we
typically plan for the “most likely” outcome
 “On average” it costs “x” or takes “y” days
 Sometimes we do better
 Sometimes we do worse
Please Fasten Your Seatbelts
Statistics Airline Flight 101 Now Departing
 We can’t talk about uncertainty and quantitative risk
management without delving into statistics.
 Keep in mind that the goal is not to teach you the math
but the underlying concepts and terminology
 We’ll let the computer do the math – all we care about is
the output
 But to understand the output you have to know the
concepts.
Uncertainty in Program
Management
 The time to perform a task (e.g., a pit
stop) is a random variable
 Because of this nature, when we
express a task duration as a specific
value (single point estimate) we
ignore uncertainty
 Single point estimates are typically
“most likely” values
 Better estimates are possible if we
describe 3 points
 Minimum (Best Case)
 Maximum (Worst Case)
 Most Likely
 The three point estimate is especially
useful if dealing with ‘expert opinion”
Task Performance Time
Most Likely (m)
9
6
Minimum (a)
15
Maximum (b)
Triangular Distribution
 Triangle distribution is a continuous
frequency distribution often used to
model random variables (cost or
performance time) in program
management
Most Likely (m)
 Fast
 Easy to use
 Provides reasonable accuracy
9
 Tends to be slightly optimistic i.e.,
values returned tend to be a little less
than what ends up being the case
 Formula
 Mean = a+m+b ÷ 3 = 30 ÷3 = 10
 Mode = m = 9
 STDEV = σ=√(a2+m2+c2-am-ab-mc)
18
15
6
Minimum (a)
μ =10
Maximum (b)
σ=1.58
Note: Don’t worry about doing the number crunching – that’s why we have
software to do this for us!
Triangular Distribution
 Probability of a value of x is given





by the area of interest.
Example. What is the probability
that x is less than 7.25?
f(x) = 2(x-a)
(b-a)(m-a)
f(x) = 2 x (7.25-6)
(15-6)(9-3)
f(x) = 2.5 / 54 = .0462963
p(x) = ½ * .0462963 * 1.25 = .029
or 2.9%
Most Likely (m)
9
6
15
7.25
Minimum (a)
μ =10
Maximum (b)
σ=1.58
Note: Don’t worry about doing the number crunching – that’s why we have
software to do this for us!
Special Use Of the Triangular
Distribution
 Some scheduling applications use a




“weighted” version of the triangle
The PERT Triangular Distribution
distribution called the “PERT”
Most Likely (m)
distribution
PERT (Program Evaluation and
Review Technique) adds (statistical)
9
emphasis to the “Most Likely”
value and weigh it 4 times as likely
as the Minimum or Maximum
Average (mean) is then = a+4m+b /
15
6
6
Criticism is that PERT tends to be
Minimum (a)
μ
Maximum (b)
optimistic
=9.5
σ=1.50
Second criticism is that PERT
ignores likelihood of events outside
3 points (closed interval)
Other Distributions Used
In Program Management
 Beta Distribution is a typically
mound shaped continuous
distribution
 Very flexible and can take on a
number of shapes depending
on the parameters used
 May be either closed form
(upper example) or open form
(lower example)
 Is less optimistic than
triangular
Other Distributions Used
In Program Management
 Lognormal Distribution is a
typically mound shaped
continuous distribution
 Very flexible and can take on a
number of shapes depending
on the parameters used
 Open ended toward the upper
side allowing for extreme
values
 Is the resultant distribution
when two triangular
distributions are combined.
Why Estimates of Cost or
Schedule End Up Being Wrong
Murphy was an optimist ……
Leading Factors
 Optimism bias
 Everything takes longer than planned
 Everything costs more than you thought
 Improper estimates
 Corrupt data
 Failure to properly consider risk
Optimism Bias
 Values predicted are “better” or
“rosier” than real world
experience would show
 Sources of Optimism Bias
 Expert Opinion on average tends
to be as much as 25% optimistic
for commonly performed tasks1
 Use of the triangle distribution
when another (e.g., Beta) is
more appropriate
 The sequence of collecting
estimates for Minimum,
Maximum and Most likely can
introduce bias2
1 Bias in Memory Predicts Bias in Estimation of Future Task Durations, Roy 2007
2 Herding Cats: Why 3 Point Estimates Create False Optimism
Improper Estimates
 Estimates are commonly built
on mathematical models
 Models must be appropriate for
the application selected
 Models must be used within the
range they were designed to
predict
 Another estimating technique is
to compare something new to
something already done
 Effort is often scaled by some
percentage
 Only as good a technique as the
data on which it is based and the
similarity of the two
tasks/projects
Risk
 Risk adds an additional element
to uncertainty
 Broadens the range of possible
values – usually to:
 Increase costs
 Extend performance time
 Risk is an “event” such that it
either takes place or does not
take place
 Risk events have 3 parameters
 Polarity ( +/- )
 Likelihood – their chance of
occurring
 Impact – the cost, delay, or
opportunity associated with the
risk
Risk as a Variable
 Risks are a two-step process
 Step 1 … Does risk occur
 Step 2 … What is the impact
Delay associated with risk
Most Likely (m)
 Risk impacts are often
modeled using the triangular
distribution
3
 Minimum impact (cost, delay,
or both)
 Most likely impact
 Worst case impact
 Data comes from risk register
as was completed in Part 1
1
Minimum (a)
6
Maximum (b)
When Worlds Collide
What happens if we combine a variable (cost or schedule) and risk ….
How Risk Changes an Event
 If the risk event does not
happen, no change to event
 If the risk event happens then
the task performance time will
be the sum of two probabilities
 Task Performance time (no
risk)
 Risk Impact
 Summing probability
distributions is a bit tricky
Task Performance Time
Most Likely (m)
9
15
6
Minimum (a)
Maximum (b)
+
Delay associated with risk
Most Likely (m)
3
1
Minimum (a)
6
Maximum (b)
Combining Probability
Distributions
Using Method of Moments
to Combine PDF’s
 Method of Moments Technique
 Analytical technique
 Used to calculate the “moments” of
the combined distribution
 The resultant distribution from
adding two triangular distributions is
a lognormal distribution.1
 The Moments of that are:
 Mean = μ = μ1 + μ2 … μn
 Variance = σ2 = it depends2
 Skewedness1 = ϑ =
 Kurtosis = κ = 12/5 = 2.387
 The math needed to calculate these
is outside the scope of this
presentation
1. Analytic Method for Cost and Schedule Risk Analysis, Raymond P.
Covert, NASA, 5 April, 2013, pp 34 - 37
2. Calculating variance for the sum of two distributions is complicated
when the two distributions are correlated. Formula shown is for
correlated data
Using Simulation to
Combine PDF’s
 Monte Carlo Technique
 Generate random number
between 0 and 1
 Convert random number to
duration based on triangular
distribution 6,9,15
 Generate second random number
between 0 and 1
 Convert second random number
to duration based on triangular
distribution 1,3,6
 Add duration 1 to duration 2
 Record value
 Repeat numerous times
 Compute statistics such as mean
etc. from collected data
Why Do I Favor Simulation?
 Mostly because it provides
equivalent results without having
to be a math major.
 Praised for
 Ability to provide statistics of a
simulated CDF or PDF formed by
complex modeling of random
variables
 Ease of use
 Criticized for:
 Non-uniform sampling
 Unable to correlate two
distributions using Pearson
product – moment correlation
coefficients
 Does not provide reasonable
results with small number of trials
Simulation output of 3,460 iterations
© Intavar Institute Risky Project 6.0
Going Forward
Adding risk and variability to our cost and schedule estimates ….
Time to Build Our Model
 Our model of the effort
(contract, project, etc) must
include:
 Tasks to be performed
 Resources to be used in
accomplishing those tasks
 Labor
 Materials
 Other (Travel, subcontracts,
etc.)
 Cost data for the each of the
resources and tasks
 Schedule data
 Risk Data
Tools to be Used
 Excel or Numbers spreadsheet
 Project, Primavera, FastTrack,
or other scheduling application
 Simulation program such as:
 @Risk
 Risky Project
 Primavera
 Full Monty
What We Will Do
 Create Work Breakdown






Structure
Expand (decompose) WBS into
various tasks
Identify resources for each
task
Create risk register
Map risks to each task
Determine cost baseline with
and without risk
Create probabilistic estimates
for cost at completion and
completion date.
Summary
 The language of quantitative




analysis is statistics – learn the
language
Schedule (task) duration and
cost are random variables
Nothing is certain until it
occurs
Risk is another random
variable that modifies cost,
schedule, or both
You don’t need a PhD to do
the math … let the software do
that while you interpret the
results
Produced by: The
Contract Coach
The Contract Coach
5338 La Colonia Dr NW
Albuquerque, NM 87120
(505) 259-8485
http://www.contract-coach.com
Monte Carlo Simulation Software
 MS Excel Add-in
 @Risk
 Oracle Crystal Ball
 Risk Solver
 General Purpose Products
 Analytica
 GoldSim
 Reno
 Oracle Crystal Ball
 SPSS
 MS Project Add-in
 Full Monty
 Risky Project (also
supports Primavera P6)
 Enterprise Systems
 Risk+ (Deltek)
 Primavera EPPM &
Primavera Risk (Oracle)
Download