Crime Risk Factors Analysis

advertisement
Crime Risk Factors Analysis
Application of Bayesian Network
Introduction
• Urbanization causes the crime problem to increase in
magnitude and complexity
• To provide effective control, with limited resources, law
enforcement agencies need to be proactive in their
approach
• Since cities and crime pattern are dynamic in nature, a
system is required which can develop a model that
provide accurate predictions and must also constantly
updates changes various parameters over time
• BN has been adopted for inference and prediction in the
crime problem domain
Introduction Contd.
• The use of BN enable us to use the incomplete existing
data to setup initial model and to continue the
enhancement of the model’s predictive capabilities as
new data is added
Analysis of the Factors Affecting Crime
Risk
1. Identifying Crime Pattern Characteristic
2. Establish the relationship between various
factors
3. Determine the crime risk level
4. Recognize the crime data pattern (structure
learning)
5. Predict crime risk factors
Expert Probabilities Elicitations
• Eliciting probabilities from experts involve some
difficulties, therefore Keeney and von Winterfeldt’s
process for eliciting expert judgment was employed
• The formal process to elicit probabilities from expert
consists of seven steps
1.
2.
3.
4.
5.
6.
7.
Identification and selection of the issues
Identification and selection of the experts
Discussion and the refinement of the issues
Training for elicitation
Elicitation
Analysis, Aggregation and resolution of disagreement
Documentation and Communication
Data Processing
• The purpose is to prepare raw data for subsequent
process of analysis, it involves three steps
1. Data consolidation
2. Data Selection
3. Data Transformation
• A set of data containing 1000 records were obtained
• The data contain 20 variables, which were organized in
five groups of factors
• Population, Crime Locations, Types of Crimes, Traffic and
Environment
Bayesian Network Model
• The BN developed for this research project was based on
crime pattern analysis carried out by Brantingham and
Brantingham and the theory of crime control through
environmental design
• Pattern theory focuses on the environment and crime,
and maintains that crime location, characteristics of such
locations, movement path that brings offenders and
victims together at such locations, and people’s
perception of crime locations are significant objects for
study
Bayesian Network Model Contd.
• Pattern theory synthesizes Its attempts to explain how
changing spatial and temporal ecological structures
influence crime trends and patterns
• The model was constructed and tested using Hugin
software
• Conditional Probabilities Tables (CPT)
• Sample of CPT
– The probability of each input node was calculated using data
contained in the training example for each state, the variables
have five states, 0=very low, 1=low, 2=medium, 3=high and
4=very high
Learning Bayesian Network
1. Input
–
Consists of prior crime data & background knowledge
2. Output
–
Revised Bayesian Network
3. Learning Algorithm
–
–
To determine which links to be included in the DAG
To determine the parameters
A
Population
statistics, Crime,
Geographic Data
B
Learning
Algorithm
C
Revised Bayesian Network
The Expectation Maximization
Algorithm
• The Expectation Maximization (EM) algorithm
was used within the Hugin software for the data
• In the study since we have the complete set of
data, the EM algorithm was used to count the
frequencies of probabilities, it consist of two
iteration steps
– Expectation step (E-step) calculates the expectation of
the missing statistic
– Maximization step (M-step) maximizes a certain
function
• The two steps are performed until convergence
The Expectation Maximization
Algorithm
Initial network
and training
data
Computation
E-step
Expected
Count
Reparametrize
M-step
Updated
Network
The iteration steps of the Expectation Maximization Algorithm
Results
• The results shows that the factors that
considerably affected crime risk in Bangkok
Metropolitan area were environment, types of
crime, crime location, traffic and population.
• Crime risk probability given murder is “yes”
makes environment (drug sale and low
standard housing) as the primary reason on
the expected murder rate
Results
Factor
Name of variable
State very low
State very high
Environment
Drug Sale
0.2710
0.6935
Low Standard
Housing
0.0664
0.5795
Rape
0.1033
0.2762
Robbery
0.0983
0.4268
Nightclubs
0.0732
0.4548
Shopping center
0.2135
0.2071
Movie theater
0.2874
0.2259
Bank
0.2528
0.2538
Traffic
Traffic volume
0.1875
0.2797
Population
Pop Density
0.1734
0.2334
Crime
Location
Download