DATA MINING APPLICATION IN CRIME ANALYSIS AND CLASSIFICATION

Slides:



Advertisements
Similar presentations
A Comparison of Implicit and Explicit Links for Web Page Classification Dou Shen 1 Jian-Tao Sun 2 Qiang Yang 1 Zheng Chen 2 1 Department of Computer Science.
Advertisements

© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/ Other Classification Techniques 1.Nearest Neighbor Classifiers 2.Support Vector Machines.
Chapter 9 DATA WAREHOUSING Transparencies © Pearson Education Limited 1995, 2005.
© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
Retrieval Evaluation. Brief Review Evaluation of implementations in computer science often is in terms of time and space complexity. With large document.
Retrieval Evaluation: Precision and Recall. Introduction Evaluation of implementations in computer science often is in terms of time and space complexity.
DATA WAREHOUSING.
Retrieval Evaluation. Introduction Evaluation of implementations in computer science often is in terms of time and space complexity. With large document.
ML ALGORITHMS. Algorithm Types Classification (supervised) Given -> A set of classified examples “instances” Produce -> A way of classifying new examples.
Introduction to Machine Learning Approach Lecture 5.
DASHBOARDS Dashboard provides the managers with exactly the information they need in the correct format at the correct time. BI systems are the foundation.
Data Mining: A Closer Look
Business Intelligence
Enterprise systems infrastructure and architecture DT211 4
King Fahd University of Petroleum & Minerals Department of Management and Marketing MKT 345 Marketing Research Dr. Alhassan G. Abdul-Muhmin Secondary Data.
CSCI 347 / CS 4206: Data Mining Module 06: Evaluation Topic 07: Cost-Sensitive Measures.
Evaluating Classifiers
Data Mining. 2 Models Created by Data Mining Linear Equations Rules Clusters Graphs Tree Structures Recurrent Patterns.
Repository Method to suit different investment strategies Alma Lilia Garcia & Edward Tsang.
Chapter 4 Pattern Recognition Concepts continued.
Page 1 WEB MINING by NINI P SURESH PROJECT CO-ORDINATOR Kavitha Murugeshan.
APPLICATIONS OF DATA MINING IN INFORMATION RETRIEVAL.
Characterizing Model Errors and Differences Stephen D. Bay and Michael J. Pazzani Information and Computer Science University of California, Irvine
Evaluation – next steps
1 Evaluating Model Performance Lantz Ch 10 Wk 5, Part 2 Right – Graphing is often used to evaluate results from different variations of an algorithm. Depending.
Data Mining: Classification & Predication Hosam Al-Samarraie, PhD. Centre for Instructional Technology & Multimedia Universiti Sains Malaysia.
AI Week 14 Machine Learning: Introduction to Data Mining Lee McCluskey, room 3/10
Data Mining By : Tung, Sze Ming ( Leo ) CS 157B. Definition A class of database application that analyze data in a database using tools which look for.
How Science Works The following PowerPoint is aimed at enhancing skills learnt at GCSE when performing experiments. Pupils must commit the terminology.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
DATA MINING By Cecilia Parng CS 157B.
ASSESSING LEARNING ALGORITHMS Yılmaz KILIÇASLAN. Assessing the performance of the learning algorithm A learning algorithm is good if it produces hypotheses.
ASSESSING LEARNING ALGORITHMS Yılmaz KILIÇASLAN. Assessing the performance of the learning algorithm A learning algorithm is good if it produces hypotheses.
Practical Issues of Classification Underfitting and Overfitting –Training errors –Generalization (test) errors Missing Values Costs of Classification.
Data Mining Practical Machine Learning Tools and Techniques By I. H. Witten, E. Frank and M. A. Hall Chapter 5: Credibility: Evaluating What’s Been Learned.
Machine Learning Tutorial-2. Recall, Precision, F-measure, Accuracy Ch. 5.
Data Mining Basics. “Copyright and Terms of Service Copyright © Texas Education Agency. The materials found on this website are copyrighted © and trademarked.
Chapter 5: Credibility. Introduction Performance on the training set is not a good indicator of performance on an independent set. We need to predict.
Data Resource Management Agenda What types of data are stored by organizations? How are different types of data stored? What are the potential problems.
Evaluating Classifiers Reading: T. Fawcett, An introduction to ROC analysis, Sections 1-4, 7 (linked from class website)An introduction to ROC analysis.
Information Retrieval (based on Jurafsky and Martin) Miriam Butt October 2003.
Data Mining Concepts and Techniques Course Presentation by Ali A. Ali Department of Information Technology Institute of Graduate Studies and Research Alexandria.
Classification Cheng Lei Department of Electrical and Computer Engineering University of Victoria April 24, 2015.
Big Data Quality Panel Norman Paton University of Manchester.
RESEARCH METHODOLOGY Research and Development Research Approach Research Methodology Research Objectives Engr. Hassan Mehmood Khan.
Introducing Precictive Analytics
Etuk, S. O. , Oyefolahan, I. O. , Zubair, H. A, Babakano,F. J
Chapter Three MaxIT WiMax.
Evolving Decision Rules (EDR)
Using Big Data clustering for improved security intelligence
SNS COLLEGE OF TECHNOLOGY
Secondary Data Searches
Evaluation – next steps
Introduction to Data Mining
Chapter 6 Classification and Prediction
Data Mining 101 with Scikit-Learn
Module 02 Research Strategies.
Introduction Data Mining for Business Analytics.
Data Mining Classification: Alternative Techniques
Features & Decision regions
OCR Level 3 Cambridge Technicals in IT
Sampling Design.
A Unifying View on Instance Selection
Prepared by: Mahmoud Rafeek Al-Farra
Text Categorization Document classification categorizes documents into one or more classes which is useful in Information Retrieval (IR). IR is the task.
Introduction of Week 9 Return assignment 5-2
How Science Works The following PowerPoint is aimed at enhancing skills learnt at GCSE when performing experiments. Pupils must commit the terminology.
Comparative Evaluation of SOM-Ward Clustering and Decision Tree for Conducting Customer-Portfolio Analysis By 1Oloyede Ayodele, 2Ogunlana Deborah, 1Adeyemi.
Assignment 1: Classification by K Nearest Neighbors (KNN) technique
Peter E, Ayemholan1, Garba, Suleiman2 and Osaigbovo Timothy3
Presentation transcript:

DATA MINING APPLICATION IN CRIME ANALYSIS AND CLASSIFICATION Obuandike Georgina N. 1. John Alhasan 2, M. B. Abdullahi 3 1. Department of Mathematical Sciences and IT, Federal University, Dutsinma, Katsina state, Nigeria. 2,3. Department of Computer Science, Federal University of Technology, Minna, Niger State, Nigeria 3rd Big Data Analytics and Innovation Conference, 22-25 November, NDC, Abuja, Nigeria

Introduction What is data mining Data Mining and Data Warehouse: a close ally, takes 80% of time. Big Data: it come more quickly and in different format, it is characterized by 3Vs, volume, variety and velocity Big Data Analytic: This is the application of advanced analytic techniques to very big data sets. Place of data mining in crime analysis 3rd Big Data Analytics and Innovation Conference, 22-25 November, NDC, Abuja, Nigeria 1

Research Questions Is data mining able to unravel hidden patterns from the crime data Is the technique selected appropriate for the analysis 3rd Big Data Analytics and Innovation Conference, 22-25 November, NDC, Abuja, Nigeria 2

Methodology Figure 1: Work Methodology Flowchart 3 3rd Big Data Analytics and Innovation Conference, 22-25 November, NDC, Abuja, Nigeria 3

Evaluation Metrics Sensitivity: It is a statistics that shows the records that are correctly labelled by the classifier. Specificity: It is simply a report of instances incorrectly labelled as correct instances Precision: Simply measures exact relevant data retrieved. High precision means the model returns more relevant data than irrelevant data. Kappa: measures the relationship between classified instances and true classes. It usually lies between [0, 1], the value of 1 means perfect relationship while 0 means random guessing. Accuracy: this shows the percentage of correctly classified instances in each classification model Time: Implies time taken to perform the 3rd Big Data Analytics and Innovation Conference, 22-25 November, NDC, Abuja, Nigeria 4

Exploratory Analysis Result Figure 2: Association of age and educational qualification 3rd Big Data Analytics and Innovation Conference, 22-25 November, NDC, Abuja, Nigeria 5

Exploratory Analysis Result cont. Figure 3: Association of offence and educational qualification 3rd Big Data Analytics and Innovation Conference, 22-25 November, NDC, Abuja, Nigeria 6

Classification Result Evaluation Metrics NB C4.5 Time 0.05 secs 1.06 secs Accuracy 93 97 TP Rate 0.935 0.971 FP Rate 0.067 0.027 Kappa 0.8696 0.9409 Precision Recall ROC curve 0.989 0.986 3rd Big Data Analytics and Innovation Conference, 22-25 November, NDC, Abuja, Nigeria 7

Conclusion Data mining has the capability that makes it simple convenient and suitable for data extraction from large databases. It employs different mining algorithms for it’s work. Many agencies gather data for its operational purposes, such data can be mined to discover some relevant patterns that can aid in decision making. The analysis of crime data will help to unravel crime pattern and nature of those who commits such crime so that appropriate strategies and rules will be put in place to control such crimes. 3rd Big Data Analytics and Innovation Conference, 22-25 November, NDC, Abuja, Nigeria 8

Conclusion cont The work reveals that the majority of the inmates that commit crime are between the ages of 18 to 34 and have low or no educational qualification and are either self employed or not doing anything at all. The classification result reveals that 98 percent of these groups of people are threat to the society. Thus, the researchers are of the opinion that government should encourage education and our youths should be gainfully employed. 3rd Big Data Analytics and Innovation Conference, 22-25 November, NDC, Abuja, Nigeria 9