Applied Kernel Density Estimation: Dynamic Spatiotemporal Analysis of Density Maps on Crime Data David G. Smalling.

Slides:



Advertisements
Similar presentations
Plans to improve estimators to better utilize panel data John Coulston Southern Research Station Forest Inventory and Analysis.
Advertisements

Experiments and Variables
Query Optimization of Frequent Itemset Mining on Multiple Databases Mining on Multiple Databases David Fuhry Department of Computer Science Kent State.
Data Mining Methodology 1. Why have a Methodology  Don’t want to learn things that aren’t true May not represent any underlying reality ○ Spurious correlation.
1 Approximated tracking of multiple non-rigid objects using adaptive quantization and resampling techniques. J. M. Sotoca 1, F.J. Ferri 1, J. Gutierrez.
Patch to the Future: Unsupervised Visual Prediction
GP Applications Two main areas of research Testing genetic programming in areas other techniques have been applied to. Applying genetic programming to.
Uniform Crime Report (UCR) FBI Compiles data from the nation’s law enforcement agencies on crime for: Numbers of arrests Reports of crimes This is the.
Daozheng Chen 1, Mustafa Bilgic 2, Lise Getoor 1, David Jacobs 1, Lilyana Mihalkova 1, Tom Yeh 1 1 Department of Computer Science, University of Maryland,
Simulation.
1 Validation and Verification of Simulation Models.
Digital Scheme Inspection. Android Based Application for Regular inspection accomplishment and reporting. It can be utilized by different Departments.
Handwritten Character Recognition using Hidden Markov Models Quantifying the marginal benefit of exploiting correlations between adjacent characters and.
A PREDICTION APPROACH TO AGRICULTURAL LAND USE ESTIMATION Ambrosio L., Marín C., Iglesias L., Montañés J., Rubio L.A. Universidad Politécnica Madrid. Spain.
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
Title: Spatial Data Mining in Geo-Business. Overview  Twisting the Perspective of Map Surfaces — describes the character of spatial distributions through.
Constructing Individual Level Population Data for Social Simulation Models Andy Turner Presentation as part.
1 Patch Complexity, Finite Pixel Correlations and Optimal Denoising Anat Levin, Boaz Nadler, Fredo Durand and Bill Freeman Weizmann Institute, MIT CSAIL.
RESEARCH A systematic quest for undiscovered truth A way of thinking
Mapping and Measuring Our World Ernie Laden and Kate Barton Curriculum Project Education 200.
Preparing Data for Analysis and Analyzing Spatial Data/ Geoprocessing Class 11 GISG 110.
September 18-19, 2006 – Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development Using Geographic Information Systems (GIS) as.
IS415 Geospatial Analytics for Business Intelligence Lesson 10: Geospatial Data Analysis- Point Patterns Analysis.
1 Validation & Verification Chapter VALIDATION & VERIFICATION Very Difficult Very Important Conceptually distinct, but performed simultaneously.
Analyzing Reliability and Validity in Outcomes Assessment (Part 1) Robert W. Lingard and Deborah K. van Alphen California State University, Northridge.
Role of Statistics in Geography
The Tools of Geography FrancisciWG.1. Remember: Geography is the science that studies the lands, the features, the inhabitants and the phenomena of the.
Spatial Data Analysis Yaji Sripada. Dept. of Computing Science, University of Aberdeen2 In this lecture you learn What is spatial data and their special.
Geographic Techniques for Teachers GCU 674. Today’s Challenges Local, National, Global Environmental, Social, Political, Economic … What is done to help.
Data Collection and Sampling
Generic Approaches to Model Validation Presented at Growth Model User’s Group August 10, 2005 David K. Walters.
INTERACTIVE ANALYSIS OF COMPUTER CRIMES PRESENTED FOR CS-689 ON 10/12/2000 BY NAGAKALYANA ESKALA.
Extending Spatial Hot Spot Detection Techniques to Temporal Dimensions Sungsoon Hwang Department of Geography State University of New York at Buffalo DMGIS.
Time Series Analysis and Forecasting
Chapter 5 Models and UML Notation for The Object-Oriented Approach.
Researcher requires geographical subset due to disk space restriction. Goes directly to NSIDC site hoping for “one stop shopping” at
Designing Calendar Visualization for Visual Analytics Law Enforcement Toolkit Shantanu Joshi, Kushal Patel, Abish Malik, Ross Maciejewski, Timothy F. Collins,
Spatially Assessing Model Error Using Geographically Weighted Regression Shawn Laffan Geography Dept ANU.
Intro to Human Geography.  Two-dimensional representation or flat model of the Earth’s surface or a portion of it.  What are the different uses of a.
A DISTRIBUTION BASED VIDEO REPRESENTATION FOR HUMAN ACTION RECOGNITION Yan Song, Sheng Tang, Yan-Tao Zheng, Tat-Seng Chua, Yongdong Zhang, Shouxun Lin.
Spatial Analysis of Crime Data: A Case Study Mike Tischler Presented by Arnold Boedihardjo.
BioSS reading group Adam Butler, 21 June 2006 Allen & Stott (2003) Estimating signal amplitudes in optimal fingerprinting, part I: theory. Climate dynamics,
CS532 TERM PAPER MEASUREMENT IN SOFTWARE ENGINEERING NAVEEN KUMAR SOMA.
INTRODUCTION TO GIS  Used to describe computer facilities which are used to handle data referenced to the spatial domain.  Has the ability to inter-
Time Series Analysis and Forecasting. Introduction to Time Series Analysis A time-series is a set of observations on a quantitative variable collected.
Applications of Supervised Learning in Bioinformatics Yen-Jen Oyang Dept. of Computer Science and Information Engineering.
Predicting the Location and Time of Mobile Phone Users by Using Sequential Pattern Mining Techniques Mert Özer, Ilkcan Keles, Ismail Hakki Toroslu, Pinar.
Statistical methods for real estate data prof. RNDr. Beáta Stehlíková, CSc
CISC Machine Learning for Solving Systems Problems Presented by: Eunjung Park Dept of Computer & Information Sciences University of Delaware Solutions.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Image registration aligns the common features of two images. The open-source Insight Toolkit (ITK, funded by the National Library of Medicine) provides.
CJ 102 Criminology. Chapter Two: The Nature and Extent of Crime.
Non-parametric Methods for Clustering Continuous and Categorical Data Steven X. Wang Dept. of Math. and Stat. York University May 13, 2010.
LOAD FORECASTING. - ELECTRICAL LOAD FORECASTING IS THE ESTIMATION FOR FUTURE LOAD BY AN INDUSTRY OR UTILITY COMPANY - IT HAS MANY APPLICATIONS INCLUDING.
Geocoding Chapter 16 GISV431 &GEN405 Dr W Britz. Georeferencing, Transformations and Geocoding Georeferencing is the aligning of geographic data to a.
Modelling & Simulation of Semiconductor Devices Lecture 1 & 2 Introduction to Modelling & Simulation.
Michael Xie, Neal Jean, Stefano Ermon
Uncertainty Analysis in Emission Inventories
3.1 Clustering Finding a good clustering of the points is a fundamental issue in computing a representative simplicial complex. Mapper does not place any.
INTRODUCTION TO GEOGRAPHICAL INFORMATION SYSTEM
Software Engineering (CSI 321)
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
3.1 Clustering Finding a good clustering of the points is a fundamental issue in computing a representative simplicial complex. Mapper does not place any.
COSC 6335 Data Mining Fall 2009: Assignment3a Post Analysis
Analyzing Reliability and Validity in Outcomes Assessment Part 1
THE STAGES FOR STATISTICAL THINKING ARE:
Group 9 – Data Mining: Data
Analyzing Reliability and Validity in Outcomes Assessment
Chapter 2 The Incidence of Crime
STATISTICS derived from the Latin word STATUS, Italian word STATISTA, German word STATISTIK, and French word STATISTIQUE which express one meaning “ Political.
Presentation transcript:

Applied Kernel Density Estimation: Dynamic Spatiotemporal Analysis of Density Maps on Crime Data David G. Smalling

My Project Entails A detailed study of crime movement in New Haven, Connecticut and Kingston, Jamaica, in which I will analyze the effects of sociological trends such as political upheavals and placement of landmarks (police stations), and etc. The development of a system which utilizes latitude and longitude data to generate and image sequence of density maps which will ultimately be used to construct a video of the study region. Detailed study of the interactions between topographical changes and cluster movement. An attempt to quantify Spatiotemporal Correlation between multiple clusters. Development of algorithmic heuristics relative to the dataset in order to reduce the computational complexity of density map generation.

Previous Work Prof. Parris Lyew Ayee of the University of the West Indies in Kingston Jamaica has provided me with a dataset containing very detailed accounts of violent (non- fatal) crimes in Kingston, Jamaica for the year Professor Ayee is himself an expert in spatial modeling and analysis. After having completed groundbreaking research and University of Oxford in Geographic Information Systems, he has supervised numerous studies for the Jamaica Constabulary Force and has significantly optimized the positioning of police efforts in Jamaica.

Other uses of KDE Recent work done by the University of Ljubljana demonstrated methods by which physical laws may be extracted from experimental data with unprecedented accuracy. Kernel Density Estimation is also often used by crime prevention agencies such as the Federal Bureau of Investigation (FBI) to analyze subtle trends in criminal movement and to predict the movement of repeat offenders. The U.S. Department of Education has utilizes a software package under a program called Project Strike Back student financial aid records for the FBI.

Design Definition Frame A frame of size f is a density map derived from a collection of latitude-longitude data from f different rows.

Design The traversal of the aforementioned frame through the dataset is at the underpinning of this method of analysis. For a specified collection of data points in a Euclidean environment such as latitude-longitude data, we take data from the mth row to the (m+f-1)th row and derive a density map corresponding to f elements inclusive. The construction of the necessary image sequence involves continuing this process, taking data from (m+1)th row to the (m+f)th row up to the frame spanning (D-f+1)th row to the Dth row. This technique is depicted graphically below. Where D = dimrow(dataset).

Design

Calibration In order to accurately pin the generated density maps to the locations, a plot is exported that contains the bounds of our study region and some sample points. These points are then loaded into a map system with verified accuracy (eg: Google Earth). Using these images, accurate scaling of the density plot to the map of the study region may be achieved.

Calibration

Notes This method of analysis proves useful in the study of non-organized crime environments and observation of aggregate movement of petty crimes that are unplanned and usually quasi- random in nature. Due to the computational complexity of the algorithms described above it was difficult to run comprehensive tests on the datasets.

Questions?