1 2 Two-samples tests, X 2 Dr. Mona Hassan Ahmed Prof. of Biostatistics HIPH, Alexandria University.

Slides:



Advertisements
Similar presentations
CHI-SQUARE(X2) DISTRIBUTION
Advertisements

Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
2013/12/10.  The Kendall’s tau correlation is another non- parametric correlation coefficient  Let x 1, …, x n be a sample for random variable x and.
KRUSKAL-WALIS ANOVA BY RANK (Nonparametric test)
Bivariate Analysis Cross-tabulation and chi-square.
Chapter 13: The Chi-Square Test
Statistical Tests Karen H. Hagglund, M.S.
Analysis of frequency counts with Chi square
Introduction to Statistics Dr Linda Morgan Clinical Chemistry Division School of Clinical Laboratory Sciences.
Cross Tabulation and Chi Square Test for Independence.
Test statistic: Group Comparison Jobayer Hossain Larry Holmes, Jr Research Statistics, Lecture 5 October 30,2008.
Statistics 303 Chapter 9 Two-Way Tables. Relationships Between Two Categorical Variables Relationships between two categorical variables –Depending on.
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
Chi-Square and F Distributions Chapter 11 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
PSY 307 – Statistics for the Behavioral Sciences Chapter 19 – Chi-Square Test for Qualitative Data Chapter 21 – Deciding Which Test to Use.
BCOR 1020 Business Statistics
Hypothesis Testing and T-Tests. Hypothesis Tests Related to Differences Copyright © 2009 Pearson Education, Inc. Chapter Tests of Differences One.
Statistical Inference Dr. Mona Hassan Ahmed Prof. of Biostatistics HIPH, Alexandria University.
The Chi-Square Test Used when both outcome and exposure variables are binary (dichotomous) or even multichotomous Allows the researcher to calculate a.
AP Statistics Section 13.1 A. Which of two popular drugs, Lipitor or Pravachol, helps lower bad cholesterol more? 4000 people with heart disease were.
The Chi-square Statistic. Goodness of fit 0 This test is used to decide whether there is any difference between the observed (experimental) value and.
Cross Tabulation and Chi-Square Testing. Cross-Tabulation While a frequency distribution describes one variable at a time, a cross-tabulation describes.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Analysis of Categorical Data
Gerstman Case-Control Studies 1 Epidemiology Kept Simple Section 11.5 Case-Control Studies.
September 15. In Chapter 18: 18.1 Types of Samples 18.2 Naturalistic and Cohort Samples 18.3 Chi-Square Test of Association 18.4 Test for Trend 18.5 Case-Control.
CHP400: Community Health Program - lI Research Methodology. Data analysis Hypothesis testing Statistical Inference test t-test and 22 Test of Significance.
Statistics for clinical research An introductory course.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 13: Nominal Variables: The Chi-Square and Binomial Distributions.
Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.
T- T EST Prof. Dr. Ramez Bedwany. C OMPARISON OF TWO INDEPENDENT SAMPLES T - T EST = mean of the first simple = mean of the second sample = pooled Variance.
Measures of Variability Objective: Students should know what a variance and standard deviation are and for what type of data they typically used.
Dr.Shaikh Shaffi Ahamed Ph.D., Dept. of Family & Community Medicine
AP Statistics Section 13.1 A. Which of two popular drugs, Lipitor or Pravachol, helps lower bad cholesterol more? 4000 people with heart disease were.
The binomial applied: absolute and relative risks, chi-square.
Analysis of Qualitative Data Dr Azmi Mohd Tamil Dept of Community Health Universiti Kebangsaan Malaysia FK6163.
Statistics for clinicians Biostatistics course by Kevin E. Kip, Ph.D., FAHA Professor and Executive Director, Research Center University of South Florida,
© Copyright McGraw-Hill 2000
Contingency Tables 1.Explain  2 Test of Independence 2.Measure of Association.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Chapter 11: Chi-Square  Chi-Square as a Statistical Test  Statistical Independence  Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
11.2 Tests Using Contingency Tables When data can be tabulated in table form in terms of frequencies, several types of hypotheses can be tested by using.
1 G Lect 7a G Lecture 7a Comparing proportions from independent samples Analysis of matched samples Small samples and 2  2 Tables Strength.
More Contingency Tables & Paired Categorical Data Lecture 8.
Bullied as a child? Are you tall or short? 6’ 4” 5’ 10” 4’ 2’ 4”
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
THE CHI-SQUARE TEST BACKGROUND AND NEED OF THE TEST Data collected in the field of medicine is often qualitative. --- For example, the presence or absence.
Chapter 10 Section 5 Chi-squared Test for a Variance or Standard Deviation.
Dr.Shaikh Shaffi Ahamed Ph.D., Dept. of Family & Community Medicine
Chi Square Test Dr. Asif Rehman.
Cross Tabulation with Chi Square
CHI-SQUARE(X2) DISTRIBUTION
Student’s t-distribution
Chapter 9: Non-parametric Tests
Dr. Amjad El-Shanti MD, PMH,Dr PH University of Palestine 2016
Unit 8 Section 7.5.
Objectives (BPS chapter 23)
The binomial applied: absolute and relative risks, chi-square
Association between two categorical variables
Hypothesis Testing Review
Chapter 18 Cross-Tabulated Counts
The Chi-Square Distribution and Test for Independence
Is a persons’ size related to if they were bullied
Consider this table: The Χ2 Test of Independence
Day 67 Agenda: Submit THQ #6 Answers.
Risk ratios 12/6/ : Risk Ratios 12/6/2018 Risk ratios StatPrimer.
A paired-samples t-test compares the means of two related sets of data to see if they differ statistically. IQ Example We may want to compare the IQ scores.
Student’s t-distribution
SENIORS: Final transcript request must be made by Friday.
Correlation & Regression
Presentation transcript:

1

2 Two-samples tests, X 2 Dr. Mona Hassan Ahmed Prof. of Biostatistics HIPH, Alexandria University

3 Z-test (two independent proportions) P1= proportion in the first group P2= proportion in the second group n1= first sample size n2= second sample size

4 Critical z = 1.96 at 5% level of significance 2.58 at 1% level of significance

5 Researchers wished to know if urban and rural adult residents of a developing country differ with respect to prevalence of a certain eye disease. A survey revealed the following information Total Eye disease Residence NoYes Rural Urban Test at 5% level of significance, the difference in the prevalence of eye disease in the 2 groups Example

6 P1 = 24/300 = 0.08 p2 = 15/500 = > Z* The difference is statistically significant Answer

7 t-Test (two independent means) = mean of the first group = mean of the second group S 2 p = pooled variance

8 Critical t from table is detected at degree of freedom = n 1 + n level of significance 1% or 5%

9 Sample of size 25 was selected from healthy population, their mean SBP =125 mm Hg with SD of 10 mm Hg. Another sample of size 17 was selected from the population of diabetics, their mean SBP was 132 mmHg, with SD of 12 mm Hg. Test whether there is a significant difference in mean SBP of diabetics and healthy individual at 1% level of significance Example

10 S 1 = 12 S 2 =11 State H0 H 0 :  1 =  2 State H1 H 1 :  1   2 Choose α α = 0.01 Answer

11 Critical t at df = 40 & 1% level of significance = 2.58 Decision: Since the computed t is smaller than critical t so there is no significant difference between mean SBP of healthy and diabetic samples at 1 %. Answer

12 Degrees of freedom Probability (p value) 

13 Paired t- test (t- difference) Uses: To compare the means of two paired samples. Example, mean SBP before and after intake of drug.

14 di = difference (after-before) Sd = standard deviation of difference n = sample size Critical t from table at df = n-1

15 The following data represents the reading of SBP before and after administration of certain drug. Test whether the drug has an effect on SBP at 1% level of significance. Example

16 SBP (After) SBP (Before) Serial No

17 di 2 di After-Before BP After BP Before Serial No Total ∑ di2∑di Answer

18 Answer

19 Critical t at df = 6-1 = 5 and 1% level of significance = Decision: Since t is < critical t so there is no significant difference between mean SBP before and after administration of drug at 1% Level. Answer

20 Degrees of freedom Probability (p value) 

21 Chi-Square test It tests the association between variables... The data is qualitative. It is performed mainly on frequencies. It determines whether the observed frequencies differ significantly from expected frequencies.

22 Where E = expected frequency O = observed frequency

23  Critical X 2 at df = (R-1) ( C -1) Where R = raw C = column  I f 2 x 2 table X 2* = 3.84 at 5 % level of significance X 2* = 6.63 at 1 % level of significance

24

25 In a study to determine the effect of heredity in a certain disease, a sample of cases and controls was taken: Total Disease Family history ControlCases Positive Negative Total Using 5% level of significance, test whether family history has an effect on disease Example

26 X 2 = (80-88) 2 /88 + ( ) 2 /112 + ( ) 2 /132 + ( ) 2 /168 = < 3.84 Association between the disease and family history is not significant Total Disease Family history ControlCases positive O E Negative O E Total Answer

27 The odds ratio was developed to quantify exposure – disease relations using case- control data Once you have selected cases and controls  ascertain exposure Then, cross-tabulate data to form a 2-by-2 table of counts

28 2-by-2 Crosstab Notation Disease +Disease -Total Exposed +ABA+B Exposed -CDC+D TotalA+CB+DA+B+C+D Disease status A+C = no. of cases B+D = no. of non-cases Exposure status A+B = no. of exposed individuals C+D = no. of non-exposed individuals

29 Disease +Disease - Exposed +AB Exposed -CD Cross-product ratio

30 Exposure variable = Smoking Disease variable = Hypertension D+D- E+3071 E-122 Total3193 Example

31 Interpretation of the Odds Ratio Odds ratios are relative risk estimates Relative risk are risk multipliers The odds ratio of 9.3 implies 9.3× risk with exposure

32 No association OR < 1 OR = 1 OR > 1 Positive association Higher risk Negative association Lower risk (Protective)

33 In the previous example OR = % CI is 1.20 – 72.14

34 Multiple Levels of Exposure Smoking levelCasesControls Heavy smokers Moderate smokers61147 Light smokers1482 Non-smokers8115 Total296618

35 Multiple Levels of Exposure k levels of exposure  break up data into (k – 1) 2  2 tables Compare each exposure level to non-exposed e.g., heavy smokers vs. non-smokers CasesControls Heavy smokers Non-smokers8115

36 Multiple Levels of Exposure Smoking levelCasesControls Heavy smokers213274OR 3 =(213)(115)/(274)(8)=11.2 Moderate smokers61147OR 2 =(61)(115)/(147)(8) = 6.0 Light smokers1482OR 1 =(14)(115)/(82)(8) = 2.5 Non-smokers8115 Total Notice the trend in OR (dose-response relationship)

37 Small Sample Size Formula For the Odds Ratio It is recommend to add ½ to each cell before calculating the odds ratio when some cells are zeros D+D- E+3171 E-022 Total3193 OR Small Sample = (A+0.5)(D+0.5) (B+0.5)(C+0.5) OR Small Sample = (31+0.5)(22+0.5) =19.8 (71+0.5)(0+0.5)

38