Uzair Bhatti Dan Diecker Puji Bandi Latoya Lewis IS 6833 ANALYTICS ASSIGNMENT PREDICTING HOMICIDE RATE IN ST. LOUIS CITY FOR 2013.

Slides:



Advertisements
Similar presentations
Data Analysis: Relationships Continued Regression
Advertisements

Simple Regression Model
Introduction Correlation of Socioeconomic Factors with Cardiovascular Disease Burden in Maryland Xiao-Ying Yu, Maria Prince, Audrey Regan Maryland Department.
Causal Forecasting by Gordon Lloyd. What will be covered? What is forecasting? What is forecasting? Methods of forecasting Methods of forecasting What.
Linear Regression Using Excel 2010 Linear Regression Using Excel ® 2010 Managerial Accounting Prepared by Diane Tanner University of North Florida Chapter.
Homicide Prediction Robert Dobynes Ken Lynch Misty Schutzius Danelle Tate.
Multiple Regression Analysis
Group A: Dan Diecker, Uzair Bhatti, Puji Bandi, Latoya Lewis.
LECTURE 3 Introduction to Linear Regression and Correlation Analysis
Chapter 14 Introduction to Linear Regression and Correlation Analysis
LINEAR REGRESSION: Evaluating Regression Models. Overview Standard Error of the Estimate Goodness of Fit Coefficient of Determination Regression Coefficients.
Homicide Predictions : Saint Louis City IS6833 February 28, 2010 Group B: Chris Gaynor Robert Jones Kevin Prinke Poernomo Wikandjojo.
St. Louis City Crime Analysis 2015 Homicide Prediction Presented by: Kranthi Kancharla Scott Manns Eric Rodis Kenneth Stecher Sisi Yang.
Chapter 12 Simple Regression
© 2000 Prentice-Hall, Inc. Chap Multiple Regression Models.
Multiple Regression Models. The Multiple Regression Model The relationship between one dependent & two or more independent variables is a linear function.
More about Correlations. Spearman Rank order correlation Does the same type of analysis as a Pearson r but with data that only represents order. –Ordinal.
Multiple Regression and Correlation Analysis
Linear Regression Example Data
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. Business Statistics: Communicating with Numbers By Sanjiv Jaggia.
Chapter 14 Introduction to Linear Regression and Correlation Analysis
MANAGERIAL ECONOMICS 12th Edition
Analytics for Homicides St. Louis City
Quantitative Demand Analysis
Predicting 2013 Trend of Homicide Rate in St. Louis City By Shih-Hua Chen Yao Zhang Teng ma Group E URL:
Predicting Homicides in St. Louis City for 2013 Chad Iseman - Geoff Hickman - Jon Perkins - Yanhui Long - Mustafa Khalili.
Lessons Learned Discover 101 -D Theresia Wansi 1/08/09.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
Extending that Line into the Future St. Louis CMG February 12, 2008 Wayne Bell – UniGroup, Inc.
Statistics for Business and Economics 8 th Edition Chapter 11 Simple Regression Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
PREDICTING THE 2013 SAINT LOUIS CITY HOMICIDE RATE SPENCER SCHNEIDENBACH SHAILESH LANJEWAR XUN ZHOU BEN HOLTMAN.
Ch4 Describing Relationships Between Variables. Pressure.
St. Louis Homicide Analysis Nikolay, Melis, Divya, Ankit.
You want to examine the linear dependency of the annual sales of produce stores on their size in square footage. Sample data for seven stores were obtained.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 12-1 Correlation and Regression.
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
Applied Quantitative Analysis and Practices LECTURE#22 By Dr. Osman Sadiq Paracha.
EQT 373 Chapter 3 Simple Linear Regression. EQT 373 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value.
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
Ch4 Describing Relationships Between Variables. Section 4.1: Fitting a Line by Least Squares Often we want to fit a straight line to data. For example.
Project By Vishnu Narasimhan Elizabeth Stillwell Aditya Dhirani Unemployment in the United States.
Regression Analysis of Temporary Assistance for Needy Families/Aid to Families with Dependant Children for By: Ryan Rafacz.
Transportation Engineering (CIVTREN) notes of AM Fillone, DLSU-Manila
ANOVA for Regression ANOVA tests whether the regression model has any explanatory power. In the case of simple regression analysis the ANOVA test and the.
Statistics for Business and Economics 8 th Edition Chapter 11 Simple Regression Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Shashin Amatya Yi Gao Lauren Reuther INFSYS-6833 Group B Homicide.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 13-1 Introduction to Regression Analysis Regression analysis is used.
11/26/2015 Crimes Against Persons Part One Copyright, 2000 Charles L. Feer.
 Looked at Regression Too Small Sample Size  Sought Correlations Too Many  Looked at Linear Trend Lines  Neighborhood Statistics What We Tried.
Statistics for Business and Economics 8 th Edition Chapter 11 Simple Regression Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Simple Linear Regression In the previous lectures, we only focus on one random variable. In many applications, we often work with a pair of variables.
Lecture 10: Correlation and Regression Model.
Linear Discriminant Analysis (LDA). Goal To classify observations into 2 or more groups based on k discriminant functions (Dependent variable Y is categorical.
 Relationship between education level, income, and length of time out of school  Our new regression equation: is the predicted value of the dependent.
PLC Group: Mr. Keefe Mr. Brewer Mr. Skramstad Student Reading Habits and its Impact on CST.
 Input parameters 1, 2, …, n  Values of each denoted X 1, X 2, X n  For each setting of X 1, X 2, X n observe a Y  Each set (X 1, X 2, X n,Y) is one.
Applied Quantitative Analysis and Practices LECTURE#25 By Dr. Osman Sadiq Paracha.
Yvette Garcia, PharmD, BCPS 1 st Year Executive Administration Program.
Houston, Texas FAT CITY, USA Gloria Lobo-Stratton Sharon Lovdahl Dennis Glendenning.
Real Estate Sales Forecasting Regression Model of Pueblo neighborhood North Elizabeth Data sources from Pueblo County Website.
Regression Modeling Applications in Land use and Transport.
IS 6833 ANALYTICS ASSIGNMENT Ying Chen, Sri Murali, John Powell, Scott Weber.
Construction Engineering 221 Probability and Statistics.
REGRESSION REVISITED. PATTERNS IN SCATTER PLOTS OR LINE GRAPHS Pattern Pattern Strength Strength Regression Line Regression Line Linear Linear y = mx.
Chapter 12 Simple Regression Statistika.  Analisis regresi adalah analisis hubungan linear antar 2 variabel random yang mempunyai hub linear,  Variabel.
Chapter 13 Simple Linear Regression
Multi Linear Regression Lab
Presentation transcript:

Uzair Bhatti Dan Diecker Puji Bandi Latoya Lewis IS 6833 ANALYTICS ASSIGNMENT PREDICTING HOMICIDE RATE IN ST. LOUIS CITY FOR 2013

 Homicide is killing of one human being by another. Homicide is a general term; it includes murder, manslaughter, and other criminal homicides as well as noncriminal killings. Murder is the crime of intentionally and unjustifiably killing another. In the U.S., first- degree murder is a homicide committed with premeditation or in the course of a serious felony.  The first type encompasses any homicide resulting from an intentional act done without malice or premeditation and while in the heat of passion or on sudden provocation.  The second type is variously defined in different jurisdictions but often includes an element of unlawful recklessness or negligence.  Noncriminal homicides include killings committed in defense of oneself or another and deaths resulting from accidents caused by persons engaged in lawful acts. DEFINITION

 2008: 167 Total Murder for the Year  2009: 143 Total Murder for the Year  2010: 144 Total Murder for the Year  2011: 113 Total Murder for the Year  2012: 113 Total Murder for the Year  St. Louis is ranked fourth dangerous city in the US for Murders HOMICIDE OVERVIEW IN ST. LOUIS

Data Segmentation We collected data by neighborhoods and districts St. Louis city consists of 9 districts, 79 neighborhoods, 3 Patrol Zones Data analysis Formulated four variables that correlate with the homicide rates in neighborhoods and districts Analyze and depict the relation between these four variables and the homicide occurrence Variables Organized data in excel using pivots tables Analyze data based on year, month and zip codes Built a regression analysis from all the data collected to predict the murder rate for 2013 Conclusion The ultimate goal is to predict number of homicides and the determined location of unlawful homicides in St. Louis city for OUR APPROACH/OBJECTIVE

MURDER FOR PAST FOUR YEARS

MURDER DISTRIBUTION BY ZIP CODE

MURDERS BY MONTH

 Group A as a Team considered many variables to determine potential relationships to homicide.  Due to randomness of Homicides, variables only help determine potential relationships but are no means of causality  Variables  Time  Year, Month,  Education (High School Diploma)  Home / Renter vacancy  Income  Unemployment  Age / Gender  Race  Location: Districts, Zip code, Neighborhoods, and Streets  Poverty  Drugs  Gangs/ Violence VARIABLES CONSIDERED

 Variables used to develop the Regression Model  Median Household Income  Determined median household income by Zip code  Educational  Determined by average high school graduation rate by Zip code  Vacancy percentage of Rented/Owned Houses  Determined average home vacancy by Zip code  Unemployment Rate VARIABLES USED TO PREDICT NUMBER OF HOMICIDES AND LOCATION

 Based on available data we have chosen to use regression model to establish a correlation between data gathered on St. Louis city and the number of homicides  Variables used have established potential relationship with number of homicides. (Source 5)  Used regression analysis to show the relationship between significant variables, and build regression model to predict future homicides PREDICTION APPROACH

 Inconsistent data availability  Data compatibility issues converting zip codes to districts, districts to neighborhoods  Inadequate data for the required variables  Lack of current data  Each department collects data based on different geographic specifications CONSTRAINTS FACING THE MODEL

REGRESSION OUTPUT WITH ALL VARIABLES  The regression output indicates a correlation for number of homicides with fluctuations in High school graduation rates  Correlation of homicides to Mean Income, Unemployment and number of vacant dwellings is weak

Standard Error Observations95 ANOVA dfSSMSFSignificance F Regression Residual Total Coefficien ts Standard Errort StatP-valueLower 95% Upper 95% Lower 95.0% Upper 95.0% Intercept E Mean Household Income-8E Graduation Rate Unemployment Rate Vacancy

REGRESSION OUTPUT WITH DROPPED VARIABLES  More accurate estimate of homicide numbers using stronger correlating data: SUMMARY OUTPUT Regression Statistics Multiple R R Square Adjusted R Square Standard Error Observations95 ANOVA dfSSMSF Significance F Regression E-10 Residual Total Coefficients Standard Errort StatP-valueLower 95%Upper 95%Lower 95.0%Upper 95.0% Intercept E Graduation Rate E

 Number of homicides to be predicted in year 2013 can be referred by the statistical model illustrating,  Combination of variables can be used to predict number of homicides based on high school graduation rate, Home / Rent vacancy, Unemployment rate,  Because significance F is less than.05 we can still claim the combination of variables can be used to predict 2013 homicides.  The past 5 year prediction for High school degree attainment is 26.5%. Where as the past 3 year prediction is 26.6%. So we predict that the number of homicides are going to be 109. REGRESSION MODEL EQUATION

 Based on current trends in education levels of people living in these areas, this model predicts a decrease in the number of homicides for 2013  Studies show that the graduation rate for the St. Louis City has gone up significantly (at a current rate of 26.5%)  Based on the past observations of the murder occurrence we predict that Zip code is going to have highest murder rate followed by and respectively PREDICTION

 Education level is a well-recorded data source and can be used for estimation of future trends in homicides.  High school graduation rate has an inverse relation with the homicide rate.  Future data-gathering should be limited to data points that are strongly correlated with homicides and easy to gather. Benefits:  Ease of data maintenance  Easier ‘What if?’ functionality if there are fewer data to consider  Ease of use and timeliness of predictions – quicker to respond and deploy resources where needed. RECOMMENDATIONS

 y_facts.xhtml y_facts.xhtml   Missouri.html (homicide overview in St. Louis) Missouri.html  (4 th dangerous city in the US for Murders)  tes.pdf   REFERENCES