Chapter 2 Describing Contingency Tables Reported by Liu Qi.

Slides:



Advertisements
Similar presentations
Contingency Table Analysis Mary Whiteside, Ph.D..
Advertisements

NORMAL OR GAUSSIAN DISTRIBUTION Chapter 5. General Normal Distribution Two parameter distribution with a pdf given by:
Special random variables Chapter 5 Some discrete or continuous probability distributions.
Analysis of Categorical Data Nick Jackson University of Southern California Department of Psychology 10/11/
Loglinear Models for Contingency Tables. Consider an IxJ contingency table that cross- classifies a multinomial sample of n subjects on two categorical.
Introduction to Categorical Data Analysis
Statistical Inference Chapter 12/13. COMP 5340/6340 Statistical Inference2 Statistical Inference Given a sample of observations from a population, the.
Basic Statistical Review
Chapter 14 Analyzing Quantitative Data. LEVELS OF MEASUREMENT Nominal Measurement Nominal Measurement Ordinal Measurement Ordinal Measurement Interval.
CSE 221: Probabilistic Analysis of Computer Systems Topics covered: Statistical inference (Sec. )
Review Chapter 1-3. Exam 1 25 questions 50 points 90 minutes 1 attempt Results will be known once the exam closes for everybody.
1 Econ 240A Power Outline Review Projects 3 Review: Big Picture 1 #1 Descriptive Statistics –Numerical central tendency: mean, median, mode dispersion:
Log-linear and logistic models
Overview of STAT 270 Ch 1-9 of Devore + Various Applications.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 14 Goodness-of-Fit Tests and Categorical Data Analysis.
Log-linear analysis Summary. Focus on data analysis Focus on underlying process Focus on model specification Focus on likelihood approach Focus on ‘complete-data.
Simple Linear Regression and Correlation
Multivariate Probability Distributions. Multivariate Random Variables In many settings, we are interested in 2 or more characteristics observed in experiments.
Generalized Linear Models
AS 737 Categorical Data Analysis For Multivariate
LIS 570 Summarising and presenting data - Univariate analysis continued Bivariate analysis.
LOGISTIC REGRESSION A statistical procedure to relate the probability of an event to explanatory variables Used in epidemiology to describe and evaluate.
When and why to use Logistic Regression?  The response variable has to be binary or ordinal.  Predictors can be continuous, discrete, or combinations.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
AP Statistics Chapter 15 Notes. Inference for a Regression Line Goal: To determine if there is a relationship between two quantitative variables. –i.e.
Forecasting Choices. Types of Variable Variable Quantitative Qualitative Continuous Discrete (counting) Ordinal Nominal.
Determination of Sample Size: A Review of Statistical Theory
Contingency Tables 1.Explain  2 Test of Independence 2.Measure of Association.
Introduction to Statistics Santosh Kumar Director (iCISA)
Going from data to analysis Dr. Nancy Mayo. Getting it right Research is about getting the right answer, not just an answer An answer is easy The right.
Statistics 3502/6304 Prof. Eric A. Suess Chapter 4.
Logistic Regression. Linear Regression Purchases vs. Income.
1 STA 617 – Chp10 Models for matched pairs Summary  Describing categorical random variable – chapter 1  Poisson for count data  Binomial for binary.
Multiple Logistic Regression STAT E-150 Statistical Methods.
Log-linear Models HRP /03/04 Log-Linear Models for Multi-way Contingency Tables 1. GLM for Poisson-distributed data with log-link (see Agresti.
1 Follow the three R’s: Respect for self, Respect for others and Responsibility for all your actions.
Making Comparisons All hypothesis testing follows a common logic of comparison Null hypothesis and alternative hypothesis – mutually exclusive – exhaustive.
Copyright © 2010 Pearson Addison-Wesley. All rights reserved. Chapter 9 One- and Two-Sample Estimation Problems.
Heart Disease Example Male residents age Two models examined A) independence 1)logit(╥) = α B) linear logit 1)logit(╥) = α + βx¡
Dependent Variable Discrete  2 values – binomial  3 or more discrete values – multinomial  Skewed – e.g. Poisson Continuous  Non-normal.
1 Fighting for fame, scrambling for fortune, where is the end? Great wealth and glorious honor, no more than a night dream. Lasting pleasure, worry-free.
Categorical Data Analysis
Fall 2002Biostat Inference for two-way tables General R x C tables Tests of homogeneity of a factor across groups or independence of two factors.
Objectives (BPS chapter 12) General rules of probability 1. Independence : Two events A and B are independent if the probability that one event occurs.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Statistics and probability Dr. Khaled Ismael Almghari Phone No:
McGraw-Hill/Irwin © 2003 The McGraw-Hill Companies, Inc.,All Rights Reserved. Part Four ANALYSIS AND PRESENTATION OF DATA.
Howard Community College
Cross Tabulation with Chi Square
BINARY LOGISTIC REGRESSION
Statistical Modelling
LEVELS of DATA.
Making Comparisons All hypothesis testing follows a common logic of comparison Null hypothesis and alternative hypothesis mutually exclusive exhaustive.
Chapter 4. Inference about Process Quality
Chapter Six Normal Curves and Sampling Probability Distributions
Generalized Linear Models
Analysis of Data Graphics Quantitative data
Introduction to logistic regression a.k.a. Varbrul
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Maximum Likelihood Find the parameters of a model that best fit the data… Forms the foundation of Bayesian inference Slide 1.
SA3202 Statistical Methods for Social Sciences
Probability & Statistics Probability Theory Mathematical Probability Models Event Relationships Distributions of Random Variables Continuous Random.
Review for Exam 2 Some important themes from Chapters 6-9
Confidence Intervals Chapter 10 Section 1.
Categorical Data Analysis
CHAPTER 6 Statistical Inference & Hypothesis Testing
Joyful mood is a meritorious deed that cheers up people around you
Statistics Review (It’s not so scary).
Introductory Statistics
Presentation transcript:

Chapter 2 Describing Contingency Tables Reported by Liu Qi

Review of Chapter 1 Categorical variable Response-Explanatory variable Nominal-Ordinal-Interval variable Continuous-Discrete variable Quantitative-Qualitative variable

Review(cont.) Use binomial, multinomial and Poisson distribution Not normality distribution Tow most used models: logistic regression(logit) log linear

Binomial distribution

Multinomial distribution

Poisson distribution

Poisson Multinomial

Something unfamiliar Maximum likelihood estimation Confidence intervals Statistical inference for binomial parameters multinomial parameters ……

Terminology and notation Cell Contingency table

Terminology and notation Subjective Sensitivity and Specificity Conditional distribution Joint distribution Marginal distribution Independence =>

Sampling Scheme Poisson the joint probability mass function: Multinomial independent/product multinomial Hyper geometric

Example for sampling

Types of studies Retrospective: case-control Prospective: – Clinical trial observational study – Cohort study Cross-sectional: experimental study

Comparing two proportions Difference Relative risk Odds ratio – Odds defined as – For a 2*2 table, odds ratio – Another name: cross-product ratio

Properties of the Odds Ratio 0=<θ <, θ=1 means independence of X and Y the farther from 1.0, the stronger the association between X and Y. log θ is convenient and symmetric Suitable for all direction No change when any row/column multiplied by a constant.

Aspirin and Heart Attacks Revisited 189/11034= /11037= Relative risk: /0.0094=1.82 Odds ratio: (189*10933)/(10845*1 04)=1.83

Case-Control Studies and the Odds Ratio

However(cont.)

Partial association in stratified 2*2 tables Experimental studies We hold other covariates constant to study the effect of X on Y. Observational studies Control for a possibly confounding variable Z Partial tables=>conditional association Marginal table

Death penalty example

Death penalty example(cont.)

Simpsons paradox

Conditional and marginal odds ratios Conditional Marginal

Conditional independence Conditional independence: Joint probability:

Marginal independence

Marginal versus Conditional

Marginal versus Conditional(cont.) Marginal conditional

Homogeneous Association For a 2*2*K table, homogeneous XY association defined as: A symmetric property: – Applies to any pair of variables viewed across the categories of the third. – No interaction between two variables in their effects on the other variable.

Homogeneous Association(cont.) Suppose: – X=smoking(yes, no) – Y=lung cancer(yes, no) – Z=age( 65) – And Age is an Effect Modifier

Extensions for i*j Tables For a 2*2 table Odds ratio An i*j table Odds ratios

Representation methods Method 1

Method 2

For I*J tables (I-1)*(J-1) odds ratios describe any association All 1.0s means INDEPENDENCE! Three-way I*J*K tables, Homogeneous XY association means: any conditional odds ratio formed using two categories of X and Y each is the same at each category of Z.

Measures of Association Two kinds of variables: – Nominal variables – Ordinal variables Nominal variables: Set a measure for X and Y: – V(Y),V(Y|X) Proportional reduction:

Measures of variation Entropy: Goodman and Kruskal(1954) (tau) Lambda:

About Entropy Uncertainty coefficient: U=0=>INDEPENDENCE U=1=>π(j|i)=1 for each i, some j. Drawbacks: No intuition for such a proportional reduction.

Ordinal Trends An example:

Three kinds of relationship Concordant Discordant Tied

Example(cont.) D=849 Define (C-D)/(C+D) as Gamma measure. Here, A weak tendency for job satisfaction to increase as income increases.

Generalized

Properties of Gamma Measure Symmetric Range [-1,1] Absolute value of 1 means perfect linear Monotonicity is required for Independence =>,not vice-versa.