March 30 More examples of case-control studies General I x J table Chi-square tests.

Slides:



Advertisements
Similar presentations
2013/12/10.  The Kendall’s tau correlation is another non- parametric correlation coefficient  Let x 1, …, x n be a sample for random variable x and.
Advertisements

Tutorial: Chi-Square Distribution Presented by: Nikki Natividad Course: BIOL Biostatistics.
Simple Logistic Regression
1 Contingency Tables: Tests for independence and homogeneity (§10.5) How to test hypotheses of independence (association) and homogeneity (similarity)
Categorical Data. To identify any association between two categorical data. Example: 1,073 subjects of both genders were recruited for a study where the.
1 If we live with a deep sense of gratitude, our life will be greatly embellished.
Copyright ©2011 Brooks/Cole, Cengage Learning More about Inference for Categorical Variables Chapter 15 1.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Categorical Variables Chapter 15.
April 25 Exam April 27 (bring calculator with exp) Cox-Regression
Header= Verdana 28 pt., Red 1 STA 517 – Chapter 3: Inference for Contingency Tables 3. Inference for Contingency Tables 3.1 Confidence Intervals for Association.
EPI 809 / Spring 2008 Final Review EPI 809 / Spring 2008 Ch11 Regression and correlation  Linear regression Model, interpretation. Model, interpretation.
PSYC512: Research Methods PSYC512: Research Methods Lecture 19 Brian P. Dyre University of Idaho.
Categorical Data Analysis: Stratified Analyses, Matching, and Agreement Statistics Biostatistics March 2007 Carla Talarico.
Notes on Logistic Regression STAT 4330/8330. Introduction Previously, you learned about odds ratios (OR’s). We now transition and begin discussion of.
PSY 307 – Statistics for the Behavioral Sciences Chapter 19 – Chi-Square Test for Qualitative Data Chapter 21 – Deciding Which Test to Use.
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Logistic Regression Logistic Regression - Dichotomous Response variable and numeric and/or categorical explanatory variable(s) –Goal: Model the probability.
Confounding, Effect Modification, and Stratification.
The Chi-Square Test Used when both outcome and exposure variables are binary (dichotomous) or even multichotomous Allows the researcher to calculate a.
Cross Tabulation and Chi-Square Testing. Cross-Tabulation While a frequency distribution describes one variable at a time, a cross-tabulation describes.
Xuhua Xia Smoking and Lung Cancer This chest radiograph demonstrates a large squamous cell carcinoma of the right upper lobe. This is a larger squamous.
Lecture 8 Chi-Square STAT 3120 Statistical Methods I.
Analysis of Categorical Data
September 15. In Chapter 18: 18.1 Types of Samples 18.2 Naturalistic and Cohort Samples 18.3 Chi-Square Test of Association 18.4 Test for Trend 18.5 Case-Control.
Amsterdam Rehabilitation Research Center | Reade Testing significance - categorical data Martin van der Esch, PhD.
Chapter 9 Producing Descriptive Statistics PROC MEANS; Summarize descriptive statistics for continuous numeric variables. PROC FREQ; Summarize frequency.
April 11 Logistic Regression –Modeling interactions –Analysis of case-control studies –Data presentation.
For testing significance of patterns in qualitative data Test statistic is based on counts that represent the number of items that fall in each category.
Please turn off cell phones, pagers, etc. The lecture will begin shortly.
1 Applied Statistics Using SAS and SPSS Topic: Chi-square tests By Prof Kelly Fan, Cal. State Univ., East Bay.
Analyses of Covariance Comparing k means adjusting for 1 or more other variables (covariates) Ho: u 1 = u 2 = u 3 (Adjusting for X) Combines ANOVA and.
Dr.Shaikh Shaffi Ahamed Ph.D., Dept. of Family & Community Medicine
Lesson 5 - Topics Formatting Output Working with Dates Reading: LSB:3:8-9; 4:1,5-7; 5:1-4.
Analysis of Categorical Data. Types of Tests o Data in 2 X 2 Tables (covered previously) Comparing two population proportions using independent samples.
April 6 Logistic Regression –Estimating probability based on logistic model –Testing differences among multiple groups –Assumptions for model.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
MBP1010 – Lecture 8: March 1, Odds Ratio/Relative Risk Logistic Regression Survival Analysis Reading: papers on OR and survival analysis (Resources)
Analysis of Qualitative Data Dr Azmi Mohd Tamil Dept of Community Health Universiti Kebangsaan Malaysia FK6163.
Introduction to Biostatistics (ZJU 2008) Wenjiang Fu, Ph.D Associate Professor Division of Biostatistics, Department of Epidemiology Michigan State University.
Contingency Tables 1.Explain  2 Test of Independence 2.Measure of Association.
Please turn off cell phones, pagers, etc. The lecture will begin shortly.
Lecture 3 Topic - Descriptive Procedures Programs 3-4 LSB 4:1-4.4; 4:9:4:11; 8:1-8:5; 5:1-5.2.
1 STA 617 – Chp10 Models for matched pairs Summary  Describing categorical random variable – chapter 1  Poisson for count data  Binomial for binary.
More Contingency Tables & Paired Categorical Data Lecture 8.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.1 Categorical Response: Comparing Two Proportions.
WINKS 7 Tutorial 3 Analyzing Summary Data (Using Student’s t-test) Permission granted for use for instruction and for personal use. ©
Chapter 14 Chi-Square Tests.  Hypothesis testing procedures for nominal variables (whose values are categories)  Focus on the number of people in different.
Types of Categorical Data Qualitative/Categorical Data Nominal CategoriesOrdinal Categories.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Describing Association for Discrete Variables. Discrete variables can have one of two different qualities: 1. ordered categories 2. non-ordered categories.
Lecture 4 Ways to get data into SAS Some practice programming
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Chi Square Tests Chapter 17. Assumptions for Parametrics >Normal distributions >DV is at least scale >Random selection Sometimes other stuff: homogeneity,
Exact Logistic Regression
THE CHI-SQUARE TEST BACKGROUND AND NEED OF THE TEST Data collected in the field of medicine is often qualitative. --- For example, the presence or absence.
Categorical Data Analysis
Fall 2002Biostat Inference for two-way tables General R x C tables Tests of homogeneity of a factor across groups or independence of two factors.
Analysis of matched data Analysis of matched data.
Chapter 4 Selected Nonparemetric Techniques: PARAMETRIC VS. NONPARAMETRIC.
1 © 2008 Brooks/Cole, a division of Thomson Learning, Inc Tests for Homogeneity and Independence in a Two-Way Table Data resulting from observations.
Lesson 10 - Topics SAS Procedures for Standard Statistical Tests and Analyses Programs 19 and 20 LSB 8:16-17.
How to Carry Out Nonparametric Tests and Construct Contingency Tables 21 January 2015.
March 28 Analyses of binary outcomes 2 x 2 tables
Lesson 10 - Topics SAS Procedures for Standard Statistical Tests and Analyses Programs 19 and 20 LSB 9:4-7;12-13 Welcome to lesson 10. In this lesson.
Chapter 18 Cross-Tabulated Counts
Examples and SAS introduction: -Violations of the rare disease assumption -Use of Fisher’s exact test January 14, 2004.
If we can reduce our desire,
Categorical Data Analysis
Applied Statistics Using SPSS
Applied Statistics Using SPSS
Presentation transcript:

March 30 More examples of case-control studies General I x J table Chi-square tests

Exam Question   proportion who smoke cigarettes n = 10 x = 3 report smoking p = 3/10 = 0.30 is estimate of  SE(p) == % CI for  0.30 ± 1.96 (0.14) 0.30 ± 0.28 or (0.02 – 0.58)

HRT Use and Colonic Polyps Case-control study conducted at Digestive Healthcare (DH) in Minneapolis ( ) Patients underwent colonoscopy –Cases were patients with adenomatous polyps –Controls were patients without polyps Questionnaires completed –Use of hormone replacement therapy ascertained Published CEBP 1996 (Potter et al) Since patients undergoing colonoscopy were not a random sample RR of polyps cannot be estimated

HRT Use and Polyps Case (Polyps)Control (No Polyps) HRT Use RO = 72/ /114 = 0.46 No HRT Use 247 RO HRT Use (Case v Control)    463 ) (RO) 2  174) (289) (247) (216) =

USING SAS DATA hrt; INFILE DATALINES; INPUT hrt ccstatus count; DATALINES; ; PROC FREQ DATA=hrt; TABLES hrt*ccstatus/CHISQ RELRISK; WEIGHT count; RUN; ccstatus =1 is case ccstatus = 2 is control

hrt ccstatus Frequency| Percent | Row Pct | Col Pct | 1| 2| Total | 72 | 175 | 247 | | | | | | | | | | 102 | 114 | 216 | | | | | | | | | Total Estimates of the Relative Risk (Row1/Row2) Type of Study Value 95% Confidence Limits Case-Control (Odds Ratio) Cohort (Col1 Risk) Cohort (Col2 Risk)

Interpretation (RO = 0.46) Women who use HRT are at a 54% decreased risk of developing polyps compared to women who do not use HRT. The odds of developing polyps is 54% lower in women who use HRT compared to women who do not use HRT.

Analyses of I x J Tables Two factors, one with I levels, second with J levels I rows, J columns General hypothesis: –Are rows independent of columns? –Ho:    2  3 (when J=2) Chi-square  2 ) test with (I-1 x J-1 df)  Sum over all cells (IxJ)

Example TOMHS Rows – 6 treatment groups Columns – 4 levels of side-effect –1 = none; 2=mild, 3=moderate, 4=severe Do side effect distribution depend on group? –Group 1 = diuretic –Group 2 = ACE –Group 3 = placebo –Side effects: 1=none, 2=mild, 3 = moderate/severe Look at cough

Cough Diuretic ACE Placebo NoneMild Mod/Sev Expected Frequencies if Ho is true: Cell (1,1) = 359*133/498 = 95.9 Cell (1,2) = 109*133/498 = 29.1 Compute for all 9 cells 498

Cough- Expected Frequencies 99 (95.9)28 (29.1)6 (8.0) 86 (95.9)36 (29.1)11 (8.0) 174 (167.2)45 (50.8)13 (14.0) Diuretic ACE Placebo NoneMild Mod/Sev    4DF)   / ( )2 / … + (13-14) 2 /14 = 5.41 P = 0.25

USING SAS DATA cough; INPUT group cough count; DATALINES; ; PROC FREQ DATA=hrt; TABLES group*cough/CHISQ CMH; WEIGHT count; RUN; cough =1 (none) cough = 2 (mild) cough = 3 (mod/severe)

GROUP cough Frequency| Expected | Percent | Row Pct | Col Pct | 1| 2| 3| Total | 99 | 28 | 6 | 133 | | | | | | 5.62 | 1.20 | | | | 4.51 | | | | | | 86 | 36 | 11 | 133 | | | | | | 7.23 | 2.21 | | | | 8.27 | | | | | | 174 | 45 | 13 | 232 | | | | | | 9.04 | 2.61 | | | | 5.60 | | | | | Total

Statistic DF Value Prob Chi-Square Likelihood Ratio Chi-Square Mantel-Haenszel Chi-Square Phi Coefficient Contingency Coefficient Cramer's V Effective Sample Size = 498 Frequency Missing = 7 Cochran-Mantel-Haenszel Statistics (Based on Table Scores) Statistic Alternative Hypothesis DF Value Prob Nonzero Correlation Row Mean Scores Differ General Association Takes into account the ordering of the row categories

1-Sample Z-Test: Matched Pair Data ab cd Control PositiveControl Negative Case Pos Neg Ho:  0.5 where n=b+c and x=b Z = (b/(b+c) – 0.5)/sqrt(.5*.5/(b+c)) Z = (b-c)/sqrt(b+c)  2 = (b-c) 2 /(b+c) Analyses is done on discordant pairs b and c Called McNemar’s chi-square

Example – Vitamin Use/Disease (440 Pairs) Control Vitamin +Control Vitamin - Case Vit + Vit - Ho:  0.5 where n=140 and b = 50  2 = (50-90) 2 /(50+90) = (p=.0007)

Example of Matched Case-control Study DATA vitamin; INFILE DATALINES; INPUT v_case $ v_cont $ count; LABEL v_case = 'Case Use of Vitamins'; LABEL v_cont = 'Control Use of Vitamins'; DATALINES; 1-YES 1-YES YES 2-NO 50 2-NO 1-YES 90 2-NO 2-NO 200 ; 100 pairs where both case and control took vitamins

PROC FREQ DATA=vitamin; TABLES v_case*v_cont/AGREE; WEIGHT COUNT; TITLE 'Matched Case-control Study'; RUN;

v_case(Case Use of Vitamins) v_cont(Control Use of Vitamins) Frequency| Percent | Row Pct | Col Pct |1-YES |2-NO | Total YES | 100 | 50 | 150 | | | | | | | | | NO | 90 | 200 | 290 | | | | | | | | | Total Only the off diagonals give information Table gives evidence that vitamins were protective.

v_case(Case Use of Vitamins) v_cont(Control Use of Vitamins) Frequency| Percent | Row Pct | Col Pct |1-YES |2-NO | Total YES | 100 | 50 | 150 | | | | | | | | | NO | 90 | 200 | 290 | | | | | | | | | Total Statistics for Table of v_case by v_cont McNemar's Test Statistic (S) DF 1 Pr > S Obtained with AGREE Option  2 = (50-90) 2 /(50+90) = 11.43