Crime Risk Factors Analysis Application of Bayesian Network.

Slides:



Advertisements
Similar presentations
Decision Support and Artificial Intelligence Jack G. Zheng May 21 st 2008 MIS Chapter 4.
Advertisements

Decision Support and Artificial Intelligence Jack G. Zheng July 11 th 2005 MIS Chapter 4.
1 WHY MAKING BAYESIAN NETWORKS BAYESIAN MAKES SENSE. Dawn E. Holmes Department of Statistics and Applied Probability University of California, Santa Barbara.
Bayesian network classification using spline-approximated KDE Y. Gurwicz, B. Lerner Journal of Pattern Recognition.
Significance Testing.  A statistical method that uses sample data to evaluate a hypothesis about a population  1. State a hypothesis  2. Use the hypothesis.
Naïve Bayes. Bayesian Reasoning Bayesian reasoning provides a probabilistic approach to inference. It is based on the assumption that the quantities of.
Bayesian Network and Influence Diagram A Guide to Construction And Analysis.
Preparing Data for Quantitative Analysis
1 Knowledge Engineering for Bayesian Networks. 2 Probability theory for representing uncertainty l Assigns a numerical degree of belief between 0 and.
Introduction of Probabilistic Reasoning and Bayesian Networks
Modeling Human Reasoning About Meta-Information Presented By: Scott Langevin Jingsong Wang.
. Learning – EM in ABO locus Tutorial #08 © Ydo Wexler & Dan Geiger.
University of Buffalo The State University of New York Spatiotemporal Data Mining on Networks Taehyong Kim Computer Science and Engineering State University.
Overview Full Bayesian Learning MAP learning
Bayesian network inference
Parameterising Bayesian Networks: A Case Study in Ecological Risk Assessment Carmel A. Pollino Water Studies Centre Monash University Owen Woodberry, Ann.
Data Mining Techniques Outline
Accounting for Variation in Youth Homicide Trends Across Cities in the United States Kevin J. Strom, RTI International and Kirk R. Williams, University.
Knowledge Engineering a Bayesian Network for an Ecological Risk Assessment (KEBN-ERA) Owen Woodberry Supervisors: Ann Nicholson Kevin Korb Carmel Pollino.
. Learning Bayesian networks Slides by Nir Friedman.
Statistical Methods Chichang Jou Tamkang University.
1 Learning Entity Specific Models Stefan Niculescu Carnegie Mellon University November, 2003.
Improvements in the Spatial and Temporal representation of the Model Owen Woodberry Bachelor of Computer Science, Honours.
Creating Architectural Descriptions. Outline Standardizing architectural descriptions: The IEEE has published, “Recommended Practice for Architectural.
Data Mining CS 341, Spring 2007 Lecture 4: Data Mining Techniques (I)
Knowledge Engineering a Bayesian Network for an Ecological Risk Assessment (KEBN-ERA) Owen Woodberry Supervisors: Ann Nicholson Kevin Korb Carmel Pollino.
Total Quality Management BUS 3 – 142 Statistics for Variables Week of Mar 14, 2011.
Geographic Information System Geog 258: Maps and GIS February 17, 2006.
DASHBOARDS Dashboard provides the managers with exactly the information they need in the correct format at the correct time. BI systems are the foundation.
Science and Engineering Practices
1 New York State Mathematics Core Curriculum 2005.
Extracting Places and Activities from GPS Traces Using Hierarchical Conditional Random Fields Yong-Joong Kim Dept. of Computer Science Yonsei.
Spatial Statistics and Spatial Knowledge Discovery First law of geography [Tobler]: Everything is related to everything, but nearby things are more related.
The Development of Decision Analysis Jason R. W. Merrick Based on Smith and von Winterfeldt (2004). Decision Analysis in Management Science. Management.
1 Chapter Seven Introduction to Sampling Distributions Section 1 Sampling Distribution.
Léon van Berlo / Jos van Leeuwen The Neighbourhood Wizard Cause and effect of changes in urban neighbourhoods.
1 The Aggregate Rail Ridership Forecasting Model: Overview Dave Schmitt, AICP Southeast Florida Users Group November 14 th 2008.
Least-Mean-Square Training of Cluster-Weighted-Modeling National Taiwan University Department of Computer Science and Information Engineering.
Bayesian networks Classification, segmentation, time series prediction and more. Website: Twitter:
Cognitive Computer Vision Kingsley Sage and Hilary Buxton Prepared under ECVision Specific Action 8-3
Spatial Data Analysis Yaji Sripada. Dept. of Computing Science, University of Aberdeen2 In this lecture you learn What is spatial data and their special.
Management & Development of Complex Projects Course Code MS Project Management Perform Qualitative Risk Analysis Lecture # 25.
Bayesian Networks for Data Mining David Heckerman Microsoft Research (Data Mining and Knowledge Discovery 1, (1997))
Introduction to Bayesian Networks
Effective Wayfinding in Airports Providing Information to Airport Operators A.C. Farr and K.L Mengersen Introduction Wayfinding is the consistent use and.
Decision Trees. MS Algorithms Decision Trees The basic idea –creating a series of splits, also called nodes, in the tree. The algorithm adds a node to.
Estimating Component Availability by Dempster-Shafer Belief Networks Estimating Component Availability by Dempster-Shafer Belief Networks Lan Guo Lane.
CSE 5331/7331 F'07© Prentice Hall1 CSE 5331/7331 Fall 2007 Machine Learning Margaret H. Dunham Department of Computer Science and Engineering Southern.
Active learning Haidong Shi, Nanyi Zeng Nov,12,2008.
SOFTWARE METRICS Software Metrics :Roadmap Norman E Fenton and Martin Neil Presented by Santhosh Kumar Grandai.
Chapter 20 Classification and Estimation Classification – Feature selection Good feature have four characteristics: –Discrimination. Features.
Expectation-Maximization (EM) Algorithm & Monte Carlo Sampling for Inference and Approximation.
International Conference on Fuzzy Systems and Knowledge Discovery, p.p ,July 2011.
Learning and Acting with Bayes Nets Chapter 20.. Page 2 === A Network and a Training Data.
MODULE 2 READNESS. OBJECTIVES Prioritize personal values and describe their relationship to incident management and firefighter safety Company Officer’s.
1 Chapter 8: Model Inference and Averaging Presented by Hui Fang.
Using Bayesian Networks to Predict Plankton Production from Satellite Data By: Rob Curtis, Richard Fenn, Damon Oberholster Supervisors: Anet Potgieter,
Refined Online Citation Matching and Adaptive Canonical Metadata Construction CSE 598B Course Project Report Huajing Li.
Hidden Markov Model Parameter Estimation BMI/CS 576 Colin Dewey Fall 2015.
A fault tree – Based Bayesian network construction for the failure rate assessment of a complex system 46th ESReDA Seminar May 29-30, 2014, Politecnico.
An Algorithm to Learn the Structure of a Bayesian Network Çiğdem Gündüz Olcay Taner Yıldız Ethem Alpaydın Computer Engineering Taner Bilgiç Industrial.
Chapter 3: risk measurement
Irina Rish IBM T.J.Watson Research Center
Week 11 Knowledge Discovery Systems & Data Mining :
CS498-EA Reasoning in AI Lecture #20
Artificial Intelligence Chapter 20 Learning and Acting with Bayes Nets
Pegna, J.M., Lozano, J.A., and Larragnaga, P.
An Introduction to Correlational Research
Chapter 20. Learning and Acting with Bayes Nets
Presentation transcript:

Crime Risk Factors Analysis Application of Bayesian Network

Introduction Urbanization causes the crime problem to increase in magnitude and complexity To provide effective control, with limited resources, law enforcement agencies need to be proactive in their approach Since cities and crime pattern are dynamic in nature, a system is required which can develop a model that provide accurate predictions and must also constantly updates changes various parameters over time BN has been adopted for inference and prediction in the crime problem domain

Introduction Contd. The use of BN enable us to use the incomplete existing data to setup initial model and to continue the enhancement of the model’s predictive capabilities as new data is added

Analysis of the Factors Affecting Crime Risk 1.Identifying Crime Pattern Characteristic 2.Establish the relationship between various factors 3.Determine the crime risk level 4.Recognize the crime data pattern (structure learning) 5.Predict crime risk factors

Expert Probabilities Elicitations Eliciting probabilities from experts involve some difficulties, therefore Keeney and von Winterfeldt’s process for eliciting expert judgment was employed The formal process to elicit probabilities from expert consists of seven steps 1.Identification and selection of the issues 2.Identification and selection of the experts 3.Discussion and the refinement of the issues 4.Training for elicitation 5.Elicitation 6.Analysis, Aggregation and resolution of disagreement 7.Documentation and Communication

Data Processing The purpose is to prepare raw data for subsequent process of analysis, it involves three steps 1.Data consolidation 2.Data Selection 3.Data Transformation A set of data containing 1000 records were obtained The data contain 20 variables, which were organized in five groups of factors Population, Crime Locations, Types of Crimes, Traffic and Environment

Bayesian Network Model The BN developed for this research project was based on crime pattern analysis carried out by Brantingham and Brantingham and the theory of crime control through environmental design Pattern theory focuses on the environment and crime, and maintains that crime location, characteristics of such locations, movement path that brings offenders and victims together at such locations, and people’s perception of crime locations are significant objects for study

Bayesian Network Model Contd. Pattern theory synthesizes Its attempts to explain how changing spatial and temporal ecological structures influence crime trends and patterns The model was constructed and tested using Hugin software Conditional Probabilities Tables (CPT) Sample of CPT – The probability of each input node was calculated using data contained in the training example for each state, the variables have five states, 0=very low, 1=low, 2=medium, 3=high and 4=very high

C Learning Bayesian Network 1.Input – Consists of prior crime data & background knowledge 2.Output – Revised Bayesian Network 3.Learning Algorithm – To determine which links to be included in the DAG – To determine the parameters Population statistics, Crime, Geographic Data Learning Algorithm BA Revised Bayesian Network

The Expectation Maximization Algorithm The Expectation Maximization (EM) algorithm was used within the Hugin software for the data In the study since we have the complete set of data, the EM algorithm was used to count the frequencies of probabilities, it consist of two iteration steps – Expectation step (E-step) calculates the expectation of the missing statistic – Maximization step (M-step) maximizes a certain function The two steps are performed until convergence

The Expectation Maximization Algorithm Initial network and training data Expected Count Updated Network Computation E-step Reparametrize M-step The iteration steps of the Expectation Maximization Algorithm

Results The results shows that the factors that considerably affected crime risk in Bangkok Metropolitan area were environment, types of crime, crime location, traffic and population. Crime risk probability given murder is “yes” makes environment (drug sale and low standard housing) as the primary reason on the expected murder rate

Results FactorName of variableState very lowState very high EnvironmentDrug Sale Low Standard Housing CrimeRape Robbery LocationNightclubs Shopping center Movie theater Bank TrafficTraffic volume PopulationPop Density