Department of Statistics University of South Carolina

Slides:



Advertisements
Similar presentations
Continued Psy 524 Ainsworth
Advertisements

EMNLP, June 2001Ted Pedersen - EM Panel1 A Gentle Introduction to the EM Algorithm Ted Pedersen Department of Computer Science University of Minnesota.
Brief introduction on Logistic Regression
Physical Aggression and Self-Injury in Juvenile Delinquent Nikki J. Deaver University of Nebraska-Lincoln Methods Participants: Participants were 43 youths.
Parametric Inference.
Finding Data for Quantitative Analysis Lecture 11.
Introduction to Multilevel Modeling Using SPSS
Chapter 13: Inference in Regression
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
Generalized Linear Models All the regression models treated so far have common structure. This structure can be split up into two parts: The random part:
Bivariate Poisson regression models for automobile insurance pricing Lluís Bermúdez i Morata Universitat de Barcelona IME 2007 Piraeus, July.
The Impact of Missing Data on the Detection of Nonuniform Differential Item Functioning W. Holmes Finch.
A generalized bivariate Bernoulli model with covariate dependence Fan Zhang.
An ecological analysis of crime and antisocial behaviour in English Output Areas, 2011/12 Regression modelling of spatially hierarchical count data.
Applied Epidemiologic Analysis - P8400 Fall 2002 Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.
Logistic Regression Analysis Gerrit Rooks
1 Goal Setting as Motivational tool in Student’s Self-regulated 指導教授: Chen, Ming- Puu 報告者 : Chang, Chen-Ming 報告日期: Cheug, E. (2004). Goal setting.
Logistic Regression Saed Sayad 1www.ismartsoft.com.
The Probit Model Alexander Spermann University of Freiburg SS 2008.
Comparison of discriminant vs. logistic regression analyses for predicting consumer acceptance and purchase decision Darryl Holliday, Ashley Bond, Witoon.
LOGISTIC REGRESSION. Purpose  Logistical regression is regularly used when there are only two categories of the dependent variable and there is a mixture.
Can Pretty People Have Their Cake and Eat it Too? Positive and Negative Effects of Physical Attractiveness. Megan M. Schad, David E. Szwedo, Joanna M.
Setting Control Limits Using Bayesian Methods When Most Observations Are Below the Limit of Quantitation Steven Novick, Harry Yang, and Wei Zhao May 18,
Kaitlyn Patterson & Wendy Wolfe
The Probit Model Alexander Spermann University of Freiburg SoSe 2009
Predicting Brain Size Samantha Stanley Jasmine Dumas Christopher Lee.
Multilevel modelling: general ideas and uses
Missing data: Why you should care about it and what to do about it
BINARY LOGISTIC REGRESSION
Figure 5: Change in Blackjack Posterior Distributions over Time.

Selecting the Best Measure for Your Study
20th EBES Conference – Vienna
Negative Binomial Regression
A Comparison of Two Nonprobability Samples with Probability Samples
Notes on Logistic Regression
Logistic Regression CSC 600: Data Mining Class 14.
Dr.MUSTAQUE AHMED MBBS,MD(COMMUNITY MEDICINE), FELLOWSHIP IN HIV/AIDS
Poverty, Gender and Well-Being: An Urban-Rural Perspective
Ch3: Model Building through Regression
Eastern Michigan University
Antidepressant Use Among Working Age Canadians:
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
Chapter 5 STATISTICS (PART 4).
Introduction Results Hypotheses Discussion Method
Shudong Wang NWEA Liru Zhang Delaware Department of Education
Understanding Standards Event Higher Statistics Award
Andres F. Rengifo Christine S. Scott-Hayward Vera Institute of Justice
Research Methods - Descriptive
Matching Methods & Propensity Scores
Catherine Comiskey and Karen Galligan Date 24h /10/2017
Inference about the Slope and Intercept
Matching Methods & Propensity Scores
Inferences and Conclusions from Data
Tabulations and Statistics
Multiple Regression Models
Inference about the Slope and Intercept
What is Regression Analysis?
Craig Dowden and D.A. Andrews Maria Giovenco Radford University
Chapter 2: Steps of Econometric Analysis
Perpetrator Programs: What we know about completion and re-offending
Data Analytic Strategy and Results
Multiple Regression – Split Sample Validation
Korey F. Beckwith & David E. Szwedo James Madison University
Financial Econometrics Fin. 505
Chapter 2: Steps of Econometric Analysis
Examining Deprivation and Threat Dimensions of Trauma Exposure with Recidivism Outcomes and Risk Among Justice-Involved Youth Becca K. Bergquist, M. A.,
The Effect of Instagram on Text Messaging, Age, and Pinterest
Factor Analysis.
Zhen Zhao, PhD and Holly A. Hill, MD, PhD
Presentation transcript:

Department of Statistics University of South Carolina Zero-Inflated Generalized Poisson Regression Model with an Application to Domestic Violence Data Yang He Department of Statistics University of South Carolina 4/26/2019

1. Introduction In 1989, the Portland Police Bureau in collaboration with the Family Violence Intervention Steering Committee of Multnomah County in Oregon developed a plan to reduce domestic violence in Portland. A special police unit called Domestic Violence Reduction Unit was created for accomplishing two goals: (i) Increasing the sanctions for batterers; (ii) Empowering victims. 3 min; 2 page

2. Design of Study and Data To achieve these two goals, the researchers collected data from official records on batterers and from surveys on victims for 1996-1997. For more details : Annette et al1. (1998), ICPSR 3353. Sample size: 214 cases. Based on these data, different models are utilized to explore the relationship between the number of violent behavior of batterer towards victim and some variables. 2 minutes; 1 page

Data Preprocessing Dependent variable: Violence: # of violent behavior of batterer towards victim. Independent variables: level of education: ordinal 1,2,3 employment status: 1(yes), 0 (no) level of income: ordinal 1-5 having family interaction: 1(yes), 0 (no) belonging to a club: 1(yes), 0 (no) having drug problem: 1(yes), 0 (no) Each variable was measured for both victim and batterer.

4. Statistical Models Generalized Poisson Regression Model (GP) Zero-Inflated Generalized Poisson Regression Model (ZIGP)​ 3 min; 2 page

Different models considered Standard Poisson Regression model (P) 𝛼=0 ; 𝜑=0 Generalized Poisson Regression model (GP) 𝛼≠0 ; 𝜑=0 Zero-Inflated Poisson Regression model (ZIP) 𝛼=0 ; 𝜑≠0 ​Zero-Inflated Generalized Poisson Regression model (ZIGP) 𝛼≠0 ; 𝜑≠0 Zero-inflated Negative Binomial Regression model (ZINB)

5. Data Analysis:  (1). Parameter Estimation: Newton –Raphson Algorithm for ZIGP regression model Estimate 𝛽 and 𝛼 for GP- Use these estimates as the initial estimate for ZIGP The final estimate of τ in ZIP(τ ) can be taken as an initial guess for τ in ZIGP. The Newton-Raphson algorithm: Parameter estimation for ZIGP. SPLUS function ‘nlminb’: to obtain the maximum likelihood estimates. The algorithm converged in less than 20 iterations. (ZINB model did not converge) 3 min; 3 pages

Education level The victim’s education is negatively related to the level of violence. The batterer’s education level is not significant. Income Significant negative relationship between the victim’s income and the level of violence. Thus, victims with high income tend to receive lower number of violence. Only the ZIP model, but not the ZIGP model gave a similar conclusion for the batterer’s income. Employment status of neither victim nor batterer is significant. Having family interaction Family interaction for victim is not significant in both models. Batters having family interaction tend to be less violent in ZIP model.

Belonging to a club Significant positive relationship between the victim’s belonging to a club and the level of violence. However, it is a significant negative relationship for the batterer. Drug problem Negative and significant relationship between the victim having drug problem and the level of violence in both models. Significant positive relationship for batterer. This indicates that more drug problems the batterer has, more violent the batterer becomes. Overall, all six independent variables are significant at 1% level under the ZIP model whereas only three are significant at 1% level under the ZIGP model.

5. Data Analysis 2. Score Test : GP VS ZIGP Target: test whether the number of zeros is too large for a Generalized Poisson model to adequately fit the data. Result: score statistic = 20.02; significant at 5% level when compared to 𝜒 1 2 Conclusion: data have too many zeros  GP regression model is not an appropriate model. ZIGP regression model is more appropriate than GP Regression model. 3. Goodness-of-fit test: ZIP VS ZIGP H0 : α = 0 (ZIP) Ha : α ≠ 0. (ZIGP) Test statistic used: asymptotic Wald statistic Result: α is significantly different from zero. Conclusion: ZIP regression model is not an appropriate model. ZIGP regression model fits the data better than ZIP regression model .

6. Conclusion The application of the ZIGP regression model to the domestic violence data illustrates the usefulness of the model. Iterative technique to estimate the parameters of ZINB regression model did not converge. Even though the ZIGPR model is a good competitor of ZINB regression model, we do not know under what conditions, if any, which one will be better. 2 min; 1 page

Thank you!