Intelligent Decision Support Methods

advertisement
Intelligent Decision Support Methods
From the book-Intelligent
by Vasant Dhar and Roger Stein
Decision Support Methods
Information Systems

“I know of no commodity more valuable than information.”

Management Information System (MIS)
• Transaction Processing Systems
– Accurate Record Keeping
• Decision Support Systems (DSS)
– Model-Driven DSS
– Data-Driven DSS
Intelligent DSS by HCH
2
Intelligence Density

DEF: A Metric for Knowledge Work Productivity.

Knowledge Intensive organizations transform raw data
into something useful-knowledge-and deliver the
knowledge to the part of the organization where it can be
used most effectively.

Intelligence Density: How quickly can you get the essence
of the underlying data from the output?
Intelligent DSS by HCH
3
The Vocabulary of Intelligence Density

Quality of Model
• Accuracy, Explainability, Speed, Reliability..

Engineering Dimension
• Flexibility, Scalability, Ease of Use,...

Quality of Available Resource
• Learning Curve, Tolerances for Noise, Complexity,...

Logistical Constraints
• Independence from Experts, Computational Ease, Development Time,..
Intelligent DSS by HCH
4
Intelligent DSS by HCH
5
Intelligent DSS by HCH
6
Intelligent DSS by HCH
7
Intelligent DSS by HCH
8
Dimensions of Problems and Solutions

Intelligence Density Dimensions: Quality of Systems

How Well is the System Engineered?

Quality of Available Resources

Logistical Constraints
Intelligent DSS by HCH
9
Intelligence Density Dimensions:
Quality of Systems (1/2)

Accuracy
• measures how dose the outputs of a system are to the correct or best
decision. Can you be confident that the errors(results that are not
accurate)are not so severe as to make the sys-tem too costly or dangerous
to use?

Explainabilitv
• is the description of the process by which a conclusion was reached.
Statistical models explain the output to some degree in the sense that each
independent variable influences or ‘explains’ the dependent variable in
that it accounts for some portion of the variance of the dependent variable.
Intelligent DSS by HCH
10
Intelligence Density Dimensions:
Quality of Systems (2/2)
• Other systems, where rule-based reasoning is involved, show exp1icitly
how conclusions are derived, yet others, such as neural networks, generate
opaque mathematical formulas. These are sometimes referred to as 'black
boxes’, because for the user they are the mathematical equivalent of the
magician's black box: Data go in at one end and results come out the other,
but you cannot (easily) see the rationale behind the conclusion.

Response speed
• is the time it takes for a system to complete analysis at the desired level of
accuracy. The flip side to this dimension is confidence in the sense that
you can ask how confident you are that a certain period of time, within
which the system must provide an answer, will be sufficient to perform the
analysis. In applications that require that results be produced within a
specified timeframe, missing that time frame means that no matter how
accurate and otherwise desirable the results are, they will be useless in
practice.
Intelligent DSS by HCH
11
How Well is the System Engineered? (1/3)

Scalability
• involves adding more variables to the problem or increasing the range of
values that variables can take. For example, scalability is a major issue
when you're interested in going from a prototype system involving 10
variables to one with 30 variables. Scalability can be a real problem
when the interactions among variables increase rapidly in unpredictable
ways with the introduction of additional variables(making the system
brittle)or where the computational complexity increases rapidly.

Compactness
• refers to how small (literally, the number of bytes) the system can be
made.Once a system has been developed and tested, it needs to be put into
the hands of the decision makers within an organization. It must be taken
out into the field, be that the shop floor, the trading floor, or the ocean
floor.
Intelligent DSS by HCH
12
How Well is the System Engineered? (2/3)

Flexibility
•

is the ease with which the relationships among the variables or their domains can
be changed, or the goals of the system modified. Most systems are not designed to
be used once and then thrown away. Instead they must be robust enough to
perform well as additional functionality is added over time. In addition, many of
the business processes that you might model are not static (i.e., they change over
time). As a result, the ability to update a system or to have the system adapt itself to
new phenomena important.
Embeddability
•
refers to the ease with which a system can be coupled with or incorporated into the
infrastructure of an organization. In some situations, systems will be components
of larger systems or other databases. If this is the case, systems must be able to
communicate well and mesh smoothly with the other components of the
organization infrastructure. A system that requires proprietary software engineer,or
specific hardware will not necessarily be able to integrate itself into this
infrastructure.
Intelligent DSS by HCH
13
How Well is the System Engineered? (3/3)

Ease of use
• describes how complicated the system is to use for the businesspeople
who will be using it on a daily basis. Is it an application that requires a lot
of expertise or training, or is it something a user can apply right out of the
box?
Intelligent DSS by HCH
14
Quality of Available Resources

Tolerance for noise in data
• the degree to which the quality of a system, most notably its accuracy, is
affected by noise in the electronic data.

Tolerance for data sparseness
•

is the degree to which the quality of a system is affected by
incompleteness or lack of data.
Tolerance for complexity
• is the degree to which the quality of a system is affected by interactions
among the various components of the process being modeled or in the
knowledge used to model a process.

Learning curve requirements
• indicate the degree to which the organization needs to experiment in order
to become sufficiently competent at solving a problem or using a
technique.
Intelligent DSS by HCH
15
Logistical Constraints

Independence from experts
• is the degree to which the system can be designed, built, and tested
without experts. While expertise is valuable, access to experts within an
organization can be a logistical nightmare and can be very expensive.

Computational ease
• is the degree to which a system can be implemented without requiring
special-purpose hardware or software.

Development speed
• is the time that the organization can afford to develop a system or,
conversely, the time a modeling technology would require to develop a
system.
Intelligent DSS by HCH
16
Topics

Data-Driven Decision Support

Evolving Solutions: Genetic Algorithms

Neural Networks

Rule-Based Expert Systems

Fuzzy Logic

Case-Based Reasoning

Machine Learning
Intelligent DSS by HCH
17
Data-Driven Decision Support



OLTP: On-Line Transaction Processing
ISAM: Indexed Sequential Access Method, early DBMS
RDBMS: Relational Database Management Systems
• Data Normalization
• SQL: Sequential/Structured Query Language

EIS: Executive Information Systems
• Friendly & Intelligent User-interface

Data Warehousing and OLAP: On-Line Analytical Process
• LAN: Local Area Network
• Data Loader->Converter->Scrubber->Transformer->Warehouse->OLAP
Intelligent DSS by HCH
18
Intelligent DSS by HCH
19
Intelligent DSS by HCH
20
Intelligent DSS by HCH
21
Intelligent DSS by HCH
22
Evolving Solutions - Genetic Algorithms (I)

Optimization Problems:
• A set of problem variables
• A set of constraints
• A set of objectives

Example:
• ACME Transport, Inc., a shipping firm, needs to plan a delivery route that
will minimize the time and cost of the shipping, but at the same time ,
make delivers to all 10 of its overseas clients.
• Exhaustive Search: evaluate all possible 10! = 3,628,800 routes.
– Problem: If the number of clients increase to 25, then there are 25! = 1.55*10 25 possible
route. Therefore it will take a very fast computer (evaluate a million route per second) to
evaluate only 0.23% possible route in 4 billion years.
• Often not a LP problem
Intelligent DSS by HCH
23
The Example - Genetic Algorithms (II)

Possible constraints to the ACME problem
• Shipping costs must be less than 70% of fee charged.
• Customer waiting time must be less than 90 days.
• If a customer does more than $x of business with ACME then waiting
time must be less than 60 days.

Possible objectives to the ACME problem
•
•
•
•
Overall delivery time is minimized.
Overall profit is maximized.
Ship fleet wear is minimized.
Number of repeated country visits is minimized.
Intelligent DSS by HCH
24
Intelligent DSS by HCH
25
The Origin - Genetic Algorithms (III)

GAs were originally developed by computer scientist John
Holland in the 1970s as experiments to see if computer
programs could evolve in a Darwinian sense.

GAs are very useful for solving classes of problems that
were previously computationally prohibitive, especially in
the area of optimization.

GAs is a heuristic techniques that cannot guarantee
optimal solutions. Only near optimal solutions can be
expected
Intelligent DSS by HCH
26
The Theory of Evolution - Genetic Algorithms (IV)

Basic Concept:
• Natural Selection, i.e., Survival

Different kromes will survive based on the compatibility of
their attributes with their environment. They are hunted by
their predators at night.

Each type of krome represents one solution to the survival
problem. Kromes with better attributes have higher
probability to survive and therefore reproduce,
Intelligent DSS by HCH
27
Intelligent DSS by HCH
28
Introduction - Genetic Algorithms (V)

The smallest unit of a GA is called a gene, representing a
unit of information in your problem domain.

A series of these genes, or a chromosome, represents one
possible complete solution to the problem.

A decoder converts the chromosome into a solution to the
problem. (or interprets the meaning of a chromosome)

A fitness function then is used to determines which
chromosome solutions are good and which are not very
good.
Intelligent DSS by HCH
29
Introduction - Genetic Algorithms (VI)

A GA randomly creates an initial population of
chromosomes and evaluates their fitness.

A new generation (new population of chromosomes) is
created by combining and refining the information in the
chromosome using
• Selection
• Crossover
• Mutation

The process is repeated until a satisfactory solution is
found.
Intelligent DSS by HCH
30
Intelligent DSS by HCH
31
Intelligent DSS by HCH
32
Intelligent DSS by HCH
33
Intelligent DSS by HCH
34
Intelligent DSS by HCH
35
Intelligent DSS by HCH
36
Intelligent DSS by HCH
37
Intelligent DSS by HCH
38
Notes - Genetic Algorithms (VII)

Do not guarantee an optimal solution.

You can use a GA to solve problems that you don’t even
know hoe to solve. All you need to be able to do is
describe a good solution and provides a fitness function
that can rate a given chromosome.

How good a solution provided by a GA is determined by
how good the problem is formulated.
Intelligent DSS by HCH
39
Simulating the Brain to Solve Problems
- Neural Networks (I)

Learning preserves the errors of the past, as well as its
wisdom.

The Learning Process: Induction
• Data
• Generalization
• Model

The Example:
• Over the years, you must have a very good idea how much time you need
to spend on and how to prepare a quiz to get certain grade.
• That is, you build mental models based on the past experiences (data) by
generalization.
Intelligent DSS by HCH
40
The Origin - Neural Networks (II)

Neural networks were first theorized as early as the 1940’s
by two scientists at the University of Chicago (McColloch
and Pitts). Works was done in the mid-1950s as well
(McCarthy 1956; Rosenblatt 1957) when researchers
developed simple neural nets in attempts to simulate the
brain’s cognitive learning processes.

ANNs are simple computer programs that build models
from data by trial and error.

Very useful in modeling complex poorly understood
problems for which sufficient data can be collected.
Intelligent DSS by HCH
41
Nervous Systems - Neural Networks (III)

Our nervous systems consist of a network of individual but
interconnected nerve cells called neurons.

Neurons can receive information (stimuli) from the outside
world at various points in the network.

The information travels through the network by generating
new internal signals that are passed from neuron to neuron.
These new signals ultimately produce a response.

A neuron passes information on to neighbor neurons by
firing or releasing chemicals called neurotransmitters.
Intelligent DSS by HCH
42
Nervous Systems - Neural Networks (IV)

The connections between neurons at which information
transfers are called synapses.

Information can either excite or inhibit neurons.

Synaptic connection can be strengthened (learning) or
weakened (forgetting) over time with experience.

With repeated learning, one can generalize his/her
experience, modifying the response to stimuli, and thus
ultimately reach the level of reflexes.
Intelligent DSS by HCH
43
Introduction - Neural Networks (V)

ANN involves a system of neurons (or nodes) and
weighted connections (the equivalent of synapses) inside
the memory of a computer.

Nodes are arranged in layers:
• Input layer
• Hidden layer
• Output layer

Through learning (trial and error, propagating, other
algorithms), ANN adjusts the weights on each connections
to match the desired response (minimize the amount of
error).
Intelligent DSS by HCH
44
Intelligent DSS by HCH
45
Intelligent DSS by HCH
46
Intelligent DSS by HCH
47
Intelligent DSS by HCH
48
Intelligent DSS by HCH
49
Intelligent DSS by HCH
50
Intelligent DSS by HCH
51
Intelligent DSS by HCH
52
Training Steps of a Neural Network (1/2)



Step l: The network makes a guess based on its current weights and the input
data.
Step 2: The net calculates the error associated with the output (at the out,put
node). For example, if the desired output were 1, but the network output were
0, the error would be +1, based on the difference between l and 0.
Step 3: The net determines by how much and in what direction each of the
weights leading in to this node needs to be adjusted. How?
This is accomplished by calculating how much each of the individual weighted
inputs to the node contributed to the error,given the particular input value. So,
for example, if a node's output were too small, the net might need to
concentrate on (that is,increase) small or negative weights that lead up to that
node. In essence, the network feeds back the information about how well it's
doing to the neurodes in the net, and where possible problems might be.
Intelligent DSS by HCH
53
Training Steps of a Neural Network (2/2)


Step4: The net adjusts the weights of each node in the layer according to the
analysis in the previous step. For example, in the case where thc output was
too small, the neural network will try to increase the values of the positive
weights since that would make the weighted sum larger. This would bring the
output closer to 1, which is what you want in this case. Similarly, the neural
net should also try to decrease the size of the negative weights (or even make
them positive).
Step5:The net repeats the process by performing a similar set of calculations
(Step l-Step3)for-each node in the hidden layer below it. But since you
cannot tell the net what the desired output of each of the hidden nodes should
be (they are internal and hidden), the neural network does a kind of sensitivity
analysis to determine how large the error of each of these nodes is..
Intelligent DSS by HCH
54
Intelligent DSS by HCH
55
Intelligent DSS by HCH
56
Note - Neural Networks (VI)

No domain experts are needed, unlike Rule-based Systems
or Fuzzy Systems.

Excel at mapping relationships on to data that are noisy
and incomplete.

Need adequate learning rate step size.

Avoid over-training. (may accidentally learn from noise)
Intelligent DSS by HCH
57
Intelligent DSS by HCH
58
Intelligent DSS by HCH
59
Putting Expert Reasoning in a Box
- Rule-Based Systems (I)

Learn to reason forward and backward on both sides of a
question.

You can view much of problem solving as consisting of
rules.
•
•
•
•

Automobile/Car Repair
Medical Care
Accounting and Tax Practice
Quality Control
The most famous of RBS, XCON, developed in 1979 by
Digital Equipment Corp.
Intelligent DSS by HCH
60
Intelligent DSS by HCH
61
The Basic Concept
- Rule-Based Systems (II)

CreditBank loan application example

IF employment stability is very low
AND credit history is very low
THEN credit risk is very high

The region that each rule applied as in Fig. 7.1 is called
problem space. Each cube is essentially a rule. In other
words, a rule “samples” a region of the problem space.
Intelligent DSS by HCH
62
Intelligent DSS by HCH
63
The Basic Concept
- Rule-Based Systems (III)

The part before the “then” is referred to as the condition
part of the rule or the left-hand side (LHS), and the part
after the “then” as the action part, or the right hand side
(RHS).

Forward Chaining

Hypothesize

Backward Chaining
Intelligent DSS by HCH
64
The Basic Concept
- Rule-Based Systems (IV)

How the rules are used is flexible and is referred as the
control strategy.

Three basic components
• a rule base
• working memory
• a rule interpreter

Steps (Recognize-Act Cycle)
• Rules are matched against the data
• The interpreter selects one instantiated rule
• The selected rule is fired
Intelligent DSS by HCH
65
Intelligent DSS by HCH
66
Intelligent DSS by HCH
67
Intelligent DSS by HCH
68
Intelligent DSS by HCH
69
Intelligent DSS by HCH
70
The Basic Concept
- Rule-Based Systems (V)

Differences between RBS and Decision Tree
•
•
•
•

How well you understand the problem at that time
Modification
RBS tells nothing about how to do things
One-direction Vs. Multiple directions
The order in which rules are processed affects the results
• Meta Rules
Intelligent DSS by HCH
71
Intelligent DSS by HCH
72
Intelligent DSS by HCH
73
Intelligent DSS by HCH
74
Intelligent DSS by HCH
75
Intelligent DSS by HCH
76
Intelligent DSS by HCH
77
Intelligent DSS by HCH
78
Notes
- Rule-Based Systems

60% to 70% of the time taken to develop rule-based
systems is spent on knowledge acquisition.

Only worth considering when you have experts available.

What-if analysis using dependency network.

The difficulty: making the right rules to fire at the right
time.
Intelligent DSS by HCH
79
Download