Linear statistical models 2009 Count data  Contingency tables and log-linear models  Poisson regression.

Slides:



Advertisements
Similar presentations
© Department of Statistics 2012 STATS 330 Lecture 32: Slide 1 Stats 330: Lecture 32.
Advertisements

CHI-SQUARE(X2) DISTRIBUTION
AP Statistics Tuesday, 15 April 2014 OBJECTIVE TSW (1) identify the conditions to use a chi-square test; (2) examine the chi-square test for independence;
A Model to Evaluate Recreational Management Measures Objective I – Stock Assessment Analysis Create a model to distribute estimated landings (A + B1 fish)
Logistic Regression I Outline Introduction to maximum likelihood estimation (MLE) Introduction to Generalized Linear Models The simplest logistic regression.
Loglinear Models for Independence and Interaction in Three-way Tables Veronica Estrada Robert Lagier.
Chapter 13: The Chi-Square Test
Generalized Linear Mixed Model English Premier League Soccer – 2003/2004 Season.
Loglinear Models for Contingency Tables. Consider an IxJ contingency table that cross- classifies a multinomial sample of n subjects on two categorical.
Log-Linear Models & Dependent Samples Feng Ye, Xiao Guo, Jing Wang.
Adjusting for extraneous factors Topics for today Stratified analysis of 2x2 tables Regression Readings Jewell Chapter 9.
Linear statistical models 2009 Models for continuous, binary and binomial responses  Simple linear models regarded as special cases of GLMs  Simple linear.
Modeling Wim Buysse RUFORUM 1 December 2006 Research Methods Group.
12.The Chi-square Test and the Analysis of the Contingency Tables 12.1Contingency Table 12.2A Words of Caution about Chi-Square Test.
Spotting pseudoreplication 1.Inspect spatial (temporal) layout of the experiment 2.Examine degrees of freedom in analysis.
Final Review Session.
1 Modeling Ordinal Associations Section 9.4 Roanna Gee.
Linear statistical models 2008 Count data, contingency tables and log-linear models Expected frequency: Log-linear models are linear models of the log.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 14 Goodness-of-Fit Tests and Categorical Data Analysis.
OLS versus MLE Example YX Here is the data:
Adjusting for extraneous factors Topics for today More on logistic regression analysis for binary data and how it relates to the Wolf and Mantel- Haenszel.
Statistical hypothesis testing – Inferential statistics II. Testing for associations.
Today: Lab 9ab due after lecture: CEQ Monday: Quizz 11: review Wednesday: Guest lecture – Multivariate Analysis Friday: last lecture: review – Bring questions.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
April 6 Logistic Regression –Estimating probability based on logistic model –Testing differences among multiple groups –Assumptions for model.
2 December 2004PubH8420: Parametric Regression Models Slide 1 Applications - SAS Parametric Regression in SAS –PROC LIFEREG –PROC GENMOD –PROC LOGISTIC.
Discrete Multivariate Analysis Analysis of Multivariate Categorical Data.
4-Oct-07GzLM PresentationBIOL The GzLM and SAS Or why it’s a necessary evil to learn code! Keith Lewis Department of Biology Memorial University,
1 In this case, each element of a population is assigned to one and only one of several classes or categories. Chapter 11 – Test of Independence - Hypothesis.
Forecasting Choices. Types of Variable Variable Quantitative Qualitative Continuous Discrete (counting) Ordinal Nominal.
GEE Approach Presented by Jianghu Dong Instructor: Professor Keumhee Chough (K.C.) Carrière.
FPP 28 Chi-square test. More types of inference for nominal variables Nominal data is categorical with more than two categories Compare observed frequencies.
Nonparametric Tests: Chi Square   Lesson 16. Parametric vs. Nonparametric Tests n Parametric hypothesis test about population parameter (  or  2.
HYPOTHESIS TESTING BETWEEN TWO OR MORE CATEGORICAL VARIABLES The Chi-Square Distribution and Test for Independence.
Logistic regression. Recall the simple linear regression model: y =  0 +  1 x +  where we are trying to predict a continuous dependent variable y from.
1 STA 617 – Chp11 Models for repeated data Analyzing Repeated Categorical Response Data  Repeated categorical responses may come from  repeated measurements.
Chapter Outline Goodness of Fit test Test of Independence.
The table shows a random sample of 100 hikers and the area of hiking preferred. Are hiking area preference and gender independent? Hiking Preference Area.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 12. The Chi-Square Test.
1 STA 617 – Chp10 Models for matched pairs Summary  Describing categorical random variable – chapter 1  Poisson for count data  Binomial for binary.
Log-linear Models HRP /03/04 Log-Linear Models for Multi-way Contingency Tables 1. GLM for Poisson-distributed data with log-link (see Agresti.
1 Follow the three R’s: Respect for self, Respect for others and Responsibility for all your actions.
1 Topic 4 : Ordered Logit Analysis. 2 Often we deal with data where the responses are ordered – e.g. : (i) Eyesight tests – bad; average; good (ii) Voting.
Sigmoidal Response (knnl558.sas). Programming Example: knnl565.sas Y = completion of a programming task (1 = yes, 0 = no) X 2 = amount of programming.
Making Comparisons All hypothesis testing follows a common logic of comparison Null hypothesis and alternative hypothesis – mutually exclusive – exhaustive.
Applied Epidemiologic Analysis - P8400 Fall 2002 Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.
Statistics 2: generalized linear models. General linear model: Y ~ a + b 1 * x 1 + … + b n * x n + ε There are many cases when general linear models are.
Dependent Variable Discrete  2 values – binomial  3 or more discrete values – multinomial  Skewed – e.g. Poisson Continuous  Non-normal.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Chi Square Tests Chapter 17. Assumptions for Parametrics >Normal distributions >DV is at least scale >Random selection Sometimes other stuff: homogeneity,
Applied Epidemiologic Analysis - P8400 Fall 2002 Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.
Log-linear Models Please read Chapter Two. We are interested in relationships between variables White VictimBlack Victim White Prisoner151 (151/160=0.94)
Chi Square Test of Homogeneity. Are the different types of M&M’s distributed the same across the different colors? PlainPeanutPeanut Butter Crispy Brown7447.
Cross Tabulation with Chi Square
Test of independence: Contingency Table
Chapter 11 – Test of Independence - Hypothesis Test for Proportions of a Multinomial Population In this case, each element of a population is assigned.
Introduction The two-sample z procedures of Chapter 10 allow us to compare the proportions of successes in two populations or for two treatments. What.
Basic Estimation Techniques
Generalized Linear Models
Active Learning Lecture Slides
Basic Estimation Techniques
The Chi-Square Distribution and Test for Independence
ביצוע רגרסיה לוגיסטית. פרק ה-2
Econ 3790: Business and Economics Statistics
Chi-square test or c2 test
Joyful mood is a meritorious deed that cheers up people around you
Chapter Outline Goodness of Fit test Test of Independence.
Quadrat sampling & the Chi-squared test
Quadrat sampling & the Chi-squared test
Modeling Ordinal Associations Bin Hu
Presentation transcript:

Linear statistical models 2009 Count data  Contingency tables and log-linear models  Poisson regression

Linear statistical models 2009 Contingency tables and log-linear models Expected frequency: Log-linear models are linear models of the log expected frequency (log is used as link function)

Linear statistical models 2009 A log-linear model for independence The last parameter of each kind can be set to zero

Linear statistical models 2009 The saturated log-linear model Independence can be tested by relating the difference in deviance D 2 – D 1 to a  2 distribution with df 2 – df 1 degrees of freedom. What is D 1 and df 1 for the saturated model?

Linear statistical models 2009 Analysis of example data (1) proc genmod data=linear.snoring; class snore heart; model count = snore heart/link=log dist=Poisson; run; Can a Poisson distribution be justified?

Linear statistical models 2009 Analysis of example data (2) Analysis Of Parameter Estimates Standard Wald 95% Confidence Chi- Parameter DF Estimate Error Limits Square Pr > ChiSq Intercept <.0001 Snore Often <.0001 Snore Seldom Heart No <.0001 Heart Yes Scale Estimates of log(  )

Linear statistical models 2009 Contingency table with one response variable Consider the example data written in the following form proc genmod data=linear.snoring2; class snore; model heart/total = snore/link=logit dist=binomial; run;

Linear statistical models 2009 Analysis of example data (2) Analysis Of Parameter Estimates Standard Wald 95% Confidence Chi- Parameter DF Estimate Error Limits Square Pr > ChiSq Intercept <.0001 Snore No <.0001 Snore Yes Scale log(p/(1- p)) p Yes No

Linear statistical models 2009 The multinomial distribution Consider a nominal random variable that takes k distinct values with probabilities p 1, p 2, …, p k Assume that have made n independent observations of that variable Then where n j is the number of times the j th value is observed Note that n is fixed in a multinomial distribution. If the observations arrive randomly, a Poisson distribution is usually preferable.

Linear statistical models 2009 Higher order tables Consider the following data on drug use Model:

Linear statistical models 2009 Terminology A = alcoholC = cigaretteM = marijuana Model A C M: mutual independence model Model A C M A*C A*M C*M: homogeneous association model Model A C M A*C A*M: Model in which C and M are mutually independent when controlling for A

Linear statistical models 2009 Poisson regression I Poisson distribution Log link where x is a covariate

Linear statistical models 2009 Poisson regression II Poisson distribution Log link where the parameters are row, column and treatment effects