APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE CJ 525 MONMOUTH UNIVERSITY Juan P. Rodriguez.

Slides:



Advertisements
Similar presentations
Chapter 4: Basic Estimation Techniques
Advertisements

APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE CJ 525 MONMOUTH UNIVERSITY Juan P. Rodriguez.
9: Examining Relationships in Quantitative Research ESSENTIALS OF MARKETING RESEARCH Hair/Wolfinbarger/Ortinau/Bush.
Multiple Regression and Model Building
Lesson 10: Linear Regression and Correlation
Kin 304 Regression Linear Regression Least Sum of Squares
Correlation and regression
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Chapter 17 Making Sense of Advanced Statistical Procedures in Research Articles.
Correlation CJ 526 Statistical Analysis in Criminal Justice.
Correlation Chapter 9.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Standard Error of the Estimate Goodness of Fit Coefficient of Determination Regression Coefficients.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 14 Using Multivariate Design and Analysis.
Bivariate Regression CJ 526 Statistical Analysis in Criminal Justice.
Chapter Eighteen MEASURES OF ASSOCIATION
Dr. Mario MazzocchiResearch Methods & Data Analysis1 Correlation and regression analysis Week 8 Research Methods & Data Analysis.
Business Statistics - QBM117 Least squares regression.
Measures of Association Deepak Khazanchi Chapter 18.
Multiple Regression Research Methods and Statistics.
Multiple Regression Models
Chapter 9: Correlational Research. Chapter 9. Correlational Research Chapter Objectives  Distinguish between positive and negative bivariate correlations,
Chapter 13: Inference in Regression
Multiple Regression Analysis Multivariate Analysis.
©2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Lecture 22 Dustin Lueker.  The sample mean of the difference scores is an estimator for the difference between the population means  We can now use.
Agenda Review Association for Nominal/Ordinal Data –  2 Based Measures, PRE measures Introduce Association Measures for I-R data –Regression, Pearson’s.
Bivariate Regression Analysis The most useful means of discerning causality and significance of variables.
Chapter 12 Examining Relationships in Quantitative Research Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Soc 3306a Lecture 9: Multivariate 2 More on Multiple Regression: Building a Model and Interpreting Coefficients.
Examining Relationships in Quantitative Research
APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE CJ 525 MONMOUTH UNIVERSITY Juan P. Rodriguez.
© Buddy Freeman, 2015 Multiple Linear Regression (MLR) Testing the additional contribution made by adding an independent variable.
Chapter 16 Data Analysis: Testing for Associations.
The McGraw-Hill Companies, Inc. 2006McGraw-Hill/Irwin DSS-ESTIMATING COSTS.
Discussion of time series and panel models
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 13-1 Introduction to Regression Analysis Regression analysis is used.
Political Science 30: Political Inquiry. Linear Regression II: Making Sense of Regression Results Interpreting SPSS regression output Coefficients for.
Examining Relationships in Quantitative Research
Copyright © 2012 by Nelson Education Limited. Chapter 14 Partial Correlation and Multiple Regression and Correlation 14-1.
APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE CJ 525 MONMOUTH UNIVERSITY Juan P. Rodriguez.
…. a linear regression coefficient indicates the impact of each independent variable on the outcome in the context of (or “adjusting for”) all other variables.
Multiple Regression Analysis Regression analysis with two or more independent variables. Leads to an improvement.
Research Methodology Lecture No :26 (Hypothesis Testing – Relationship)
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Linear Regression and Correlation Chapter 13.
Lecturer: Ing. Martina Hanová, PhD.. Regression analysis Regression analysis is a tool for analyzing relationships between financial variables:  Identify.
Stats Methods at IC Lecture 3: Regression.
Multiple Regression.
Simple Bivariate Regression
Inference for Least Squares Lines
Lecture 10 Regression Analysis
Bivariate & Multivariate Regression Analysis
Chapter 9: Correlational Research
Introduction to Regression Analysis
Correlation, Bivariate Regression, and Multiple Regression
26134 Business Statistics Week 5 Tutorial
Kin 304 Regression Linear Regression Least Sum of Squares
Political Science 30: Political Inquiry
BPK 304W Regression Linear Regression Least Sum of Squares
Quantitative Methods Simple Regression.
BPK 304W Correlation.
Multiple Regression.
LEARNING OUTCOMES After studying this chapter, you should be able to
STA 291 Summer 2008 Lecture 23 Dustin Lueker.
Introduction to Regression
Multiple Linear Regression
STA 291 Spring 2008 Lecture 23 Dustin Lueker.
Chapter 14 Multiple Regression
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE CJ 525 MONMOUTH UNIVERSITY Juan P. Rodriguez

Perspective Research Techniques Accessing, Examining and Saving Data Univariate Analysis – Descriptive Statistics Constructing (Manipulating) Variables Association – Bivariate Analysis Association – Multivariate Analysis Comparing Group Means – Bivariate Multivariate Analysis - Regression

Lecture 7 Multivariate Analysis With Linear Regression

Lectures 5 and 6 examined methods for testing relationships between 2 variables: bivariate analysis Many projects, however, require testing the association of multiple independent variables with a dependent variable: multivariate analysis Multivariate analysis is performed after the researchers understand the characteristics of individual variables (univariate) and the relationships between any 2 variables (bivariate)

Reasons for Multivariate Analysis Social behavior is usually associated with many factors and can not be explained by the association with just one variable. By including more than one variable in the statistical model, the researcher can create a more accurate model to predict or explain social behavior

Reasons for Multivariate Analysis Multivariate analysis can account for the influence of spurious factors by introducing control variables

Linear Regression Used when the increase in an independent variable is associated with a consistent and constant change in the dependent variable. The dependent variable should be numeric and conform to a normal distribution

LR: Bivariate Example Using the States data, we will study the relationship between poverty and teen births.

LR: A Bivariate example The graph indicates that teenage births seem to increase with poverty rate. Using Linear Regression, we will create an equation that can be used to illustrate this tendency Load the States dataset

LR: A Bivariate example

The R2 measures the usefulness of the model: A value of 1 indicates that 100% of the variation in the dependent variable is explained by variations in the independent variable A value of indicates that 45.5% of the variation in the teenage birth rate from state to state can be explained by variations in poverty rates. The remaining 54.5% can be explained by other factors not included in the model

LR: A Bivariate example The ANOVA measured if the model fitted the data: The results indicated that the variation explained by the regression model was about 41 times larger than that explained by other factors. The P value lower than indicated that the chances of this being due to random chance were very small, i.e. the model used fitted the data

LR: A Bivariate example B, (slope) is the size of the difference in the dependent variable corresponding to a change of one unit in the independent variable The value of in this model indicates that for every 1% change in poverty rate there is a predicted increase in the teen birth rate of nearly 3 births (2.735) The significance score of indicates that there is a significant association between teen birth rate and poverty

LR: A Bivariate example The constant (intercept) is the predicted value of the dependent variable when the independent variable is zero. In this case, the constant indicates that there would be 15 teen births per 1000 teenage women even if there were no poor people in a state

Making Predictions The linear regression equation is: Y’ = a + bX Y’ is the predicted value of the dependent variable a is the constant b is the slope X is the value of the independent variable

Making Predictions In our case, the regression equation is: Y’ = X If we wanted to predict the teenage birth rate for a poverty rate of 20%: Y’ = x 20 = Predictions should be limited to the available range of values of the independent variable (in our case between 1% and 22%)

Graphing Bivariate Regression lines

Multiple Linear Regression Regression model includes more than one independent variable We’ll look at some factors affecting teenage birth rate: Poverty (PVS500) Expenditures per pupil (SCS141) Unemployment rate (EMS171) Amount of welfare a family gets (PVS526)

Multiple Linear Regression

MLR: Coefficients Looking at the significance tests for the coefficients, only 2 are significant: States with higher poverty rates have higher teenage birth rates (1.506 per women) for every 1% raise in poverty rates. States that give more welfare aid had lower teen birth rates ( ) for every $1 given as welfare aid.

MLR: R - Squared MLR uses the Adjusted R 2 instead of the R 2 to account for only those variables that contribute significantly to the model The AR 2 in this case, 0.594, indicates that the model accounts for 59.4% of the variation in the teenage birth rate

MLR: R - Squared The ANOVA indicates that the variables considered account for about 19 times of the variation due to other causes. The P<0.001 indicates that the model is a good fit to the data.

Multiple Regression Equation The equation is: Y’ = X X X X 4 X1 : Poverty Rate in 1998 – PVS500 X2 : Expenditures per pupil – SCS141 X3 : Unemployment rate – EMS171 X4 : Amount of welfare received – PVS526

Graphing the Multiple Regression The multiple regression equation is: Y’ = a + b 1 X 1 + b 2 X 2 + b 3 X 3 + b 4 X 4 Y’ is the predicted value of the dependent variable a is the constant b i is the slope for variable i X i is the value of the independent variable i

Graphing the Multiple Regression Dependent variable is plotted against one independent variable at a time The other variables are held constant, at any value, but usually at their mean value We will graph the association between welfare benefits and teenage birth rates holding poverty rates, school expenditures and unemployment rates at their mean values This requires computing TEENPRE, the predicted value of teen birth rate according to the equation

Graphing the Multiple Regression Transform Compute Target Variable: TEENPRE Numeric Expression: (1.506*12.73) + ( * ) + (2.515*4.16) + ( *PVS526) Type and Label Label: Predicted Teenage Birth Rate Continue OK

Graphing the Multiple Regression

Linear Regression Concerns Linear Relationships A numerical dependent variable Normality of residuals The residuals should follow a normal distribution with a mean of 0 Check is this is the case by saving and plotting the residuals when doing the MLR

Normality of Residuals