Biostatistics course Part 14 Analysis of binary paired data

Slides:

Advertisements

Similar presentations

1 Radio Maria World. 2 Postazioni Transmitter locations.

Advertisements

EcoTherm Plus WGB-K 20 E 4,5 – 20 kW.

Trend for Precision Soil Testing % Zone or Grid Samples Tested compared to Total Samples.

Trend for Precision Soil Testing % Zone or Grid Samples Tested compared to Total Samples.

AGVISE Laboratories %Zone or Grid Samples – Northwood laboratory

Trend for Precision Soil Testing % Zone or Grid Samples Tested compared to Total Samples.

SKELETAL QUIZ 3.

PDAs Accept Context-Free Languages

Reflection nurulquran.com.

EuroCondens SGB E.

Slide 1Fig 26-CO, p.795. Slide 2Fig 26-1, p.796 Slide 3Fig 26-2, p.797.

Sequential Logic Design

STATISTICS Linear Statistical Models

STATISTICS INTERVAL ESTIMATION Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.

Addition and Subtraction Equations

Division ÷ 1 1 ÷ 1 = 1 2 ÷ 1 = 2 3 ÷ 1 = 3 4 ÷ 1 = 4 5 ÷ 1 = 5 6 ÷ 1 = 6 7 ÷ 1 = 7 8 ÷ 1 = 8 9 ÷ 1 = 9 10 ÷ 1 = ÷ 1 = ÷ 1 = 12 ÷ 2 2 ÷ 2 =

1 When you see… Find the zeros You think…. 2 To find the zeros...

Western Public Lands Grazing: The Real Costs Explore, enjoy and protect the planet Forest Guardians Jonathan Proctor.

EQUS Conference - Brussels, June 16, 2011 Ambros Uchtenhagen, Michael Schaub Minimum Quality Standards in the field of Drug Demand Reduction Parallel Session.

Add Governors Discretionary (1G) Grants Chapter 6.

CHAPTER 18 The Ankle and Lower Leg

Summative Math Test Algebra (28%) Geometry (29%)

ASCII stands for American Standard Code for Information Interchange

The 5S numbers game..

突破信息检索壁垒－SciFinder Scholar 介绍

A Fractional Order (Proportional and Derivative) Motion Controller Design for A Class of Second-order Systems Center for Self-Organizing Intelligent.

Break Time Remaining 10:00.

The basics for simulations

© 2010 Concept Systems, Inc.1 Concept Mapping Methodology: An Example.

PP Test Review Sections 6-1 to 6-6

MM4A6c: Apply the law of sines and the law of cosines.

Biostatistics course Part 13 Effect measures in 2 x 2 tables Dr. Sc. Nicolas Padilla Raygoza Department of Nursing and Obstetrics Division Health Sciences.

Figure 3–1 Standard logic symbols for the inverter (ANSI/IEEE Std

TCCI Barometer March “Establishing a reliable tool for monitoring the financial, business and social activity in the Prefecture of Thessaloniki”

Dynamic Access Control the file server, reimagined Presented by Mark on twitter 1 contents copyright 2013 Mark Minasi.

TCCI Barometer March “Establishing a reliable tool for monitoring the financial, business and social activity in the Prefecture of Thessaloniki”

Copyright © 2012, Elsevier Inc. All rights Reserved. 1 Chapter 7 Modeling Structure with Blocks.

Progressive Aerobic Cardiovascular Endurance Run

Biology 2 Plant Kingdom Identification Test Review.

MaK_Full ahead loaded 1 Alarm Page Directory (F11)

TCCI Barometer September “Establishing a reliable tool for monitoring the financial, business and social activity in the Prefecture of Thessaloniki”

When you see… Find the zeros You think….

2011 WINNISQUAM COMMUNITY SURVEY YOUTH RISK BEHAVIOR GRADES 9-12 STUDENTS=1021.

Before Between After.

2011 FRANKLIN COMMUNITY SURVEY YOUTH RISK BEHAVIOR GRADES 9-12 STUDENTS=332.

2.10% more children born Die 0.2 years sooner Spend 95.53% less money on health care No class divide 60.84% less electricity 84.40% less oil.

Foundation Stage Results CLL (6 or above) 79% 73.5%79.4%86.5% M (6 or above) 91%99%97%99% PSE (6 or above) 96%84%100%91.2%97.3% CLL.

Subtraction: Adding UP

Numeracy Resources for KS2

1 Non Deterministic Automata. 2 Alphabet = Nondeterministic Finite Accepter (NFA)

Static Equilibrium; Elasticity and Fracture

Converting a Fraction to %

Resistência dos Materiais, 5ª ed.

& dding ubtracting ractions.

Lial/Hungerford/Holcomb/Mullins: Mathematics with Applications 11e Finite Mathematics with Applications 11e Copyright ©2015 Pearson Education, Inc. All.

UNDERSTANDING THE ISSUES. 22 HILLSBOROUGH IS A REALLY BIG COUNTY.

A Data Warehouse Mining Tool Stephen Turner Chris Frala

1 Dr. Scott Schaefer Least Squares Curves, Rational Representations, Splines and Continuity.

Chart Deception Main Source: How to Lie with Charts, by Gerald E. Jones Dr. Michael R. Hyman, NMSU.

1 Non Deterministic Automata. 2 Alphabet = Nondeterministic Finite Accepter (NFA)

Introduction Embedded Universal Tools and Online Features 2.

What impact does the address have on the tribe?

úkol = A 77 B 72 C 67 D = A 77 B 72 C 67 D 79.

Schutzvermerk nach DIN 34 beachten 05/04/15 Seite 1 Training EPAM and CANopen Basic Solution: Password * * Level 1 Level 2 * Level 3 Password2 IP-Adr.

Presentation transcript:

Biostatistics course Part 14 Analysis of binary paired data Dr. Sc. Nicolas Padilla Raygoza Department of Nursing and Obstetrics Division Health Sciences and Engineering University of Guanajuato Campus Celaya-Salvatierra

Biosketch Medical Doctor by University Autonomous of Guadalajara. Pediatrician by the Mexican Council of Certification on Pediatrics. Postgraduate Diploma on Epidemiology, London School of Hygiene and Tropical Medicine, University of London. Master Sciences with aim in Epidemiology, Atlantic International University. Doctorate Sciences with aim in Epidemiology, Atlantic International University. Associated Professor B, School of Nursing and Obstetrics of Celaya, university of Guanajuato. padillawarm@gmail.com

Competencies The reader will know how show paired binary data. He (she) will apply hypothesis test for paired binary data – McNemar’s Chi-squared. He (she) will calculate confidence interval for paired binary data. He (she) will obtain Odds Ratio and confidence interval for cases-controls paired studies.

Introduction In Parts 12 and 13 of the biostatistics course, we knew, the methods for comparing two proportions estimated from independent samples. If the observations in a study are not independent, we need to use different methods. Often we use two types of studies that give rise to observations that are not independent: Repeated observations in the same individual Matched case-control studies

Example Tuberculosis can be diagnosed to use a culture media and looking if Mycobacterium tuberculosis is growing. In a experiment to compare two culture medias for the tuberculosis diagnosis, samples of expectoration from 100 patients were planted in the two medias. The half of the sample was planted in media A and another half of sample, planted in media B. In this study, the results of an individual are two observations that are matched to one another. Each result can be + or - to the tuberculosis bacillus.

Example In a study, to examine the relation between breast cancer and oral contraceptives, women with a breast cancer were matched with women without breast cancer, selected from electoral registries. This is an example of cases-controls study, where each individual with breast cancer is matched with an individual with similar age, for control the potential effect counding, of age.

Showing categorical paired data To use Z test or Chi squared test with paired data is a mistake, because we do not take into account the paired nature of data. Patient Culture A Culture B 1 - 2 3 + 4 5 6 7 8 9 10 11 12 13 14 15 Patient Culture A Culture B 16 + 17 18 - 19 20 21 22 23 24 25 26 27 28 29 30 Patient Culture A Culture B 31 - 32 + 33 34 35 36 37 38 39 40 41 42 43 44 45 Patient Culture A Culture B 46 + 47 - 48 49 50

Showing categorical paired data Patient Culture A Culture B 51 - 52 53 + 54 55 56 57 58 59 60 61 62 63 64 65 Patient Culture A Culture B 66 + 67 68 - 69 70 71 72 73 74 75 76 77 78 79 80 Patient CultureA Culture B 81 - 82 + 83 84 85 86 87 88 89 90 91 92 93 94 95 Patient Culture A Culture B 96 + 97 - 98 99 100

Showing categorical paired data The experiment compared the capacity of culture media to detect Mycobacterium tuberculosis. The results were positive (+) or negative (-). We have interest in to compare the samples positives of both culture media. The table summarize the results Culture media + - Total A 64 36 100 B 44 56

Showing categorical paired data From this, do you think that media A is better that media B to detect the tuberculosis bacilli? To make an adequate analysis, we need to compare the results with both media in each subject. There are four combinations of results that can occur in each subject: Combination Media A Media B Pairs 1 + k 2 - r 3 s 4 m We signalize: • The number of times that both media are positive = k • The number of times that A is positive and B is negative = r • The number of times that a is negative and B is positive = s • The number of times that both media are negative = m

Showing categorical paired data To compare the results of each subject, we need count how many times occur each combination. AN easy form to show the calues is tabulate the results from a sample against another sample. Media B + - A + k r k + r A - s m s + m k + s r + m N We signalize: • The number of times that both media are positive = k • The number of times that A is positive and B is negative = r • The number of times that a is negative and B is positive = s • The number of times that both media are negative = m

Showing categorical paired data The pairs with same result are pairs with agreement, and they do not give any information on what media is better to detect bacilli. Of the remaining results were different between the two media: 24 were positive for the A and negative for B. 4 were negative for A and positive for the B. The pairs whose results were different between both media, are called discordant pairs. Media B + - A + 40 24 64 A - 4 32 36 44 56 100 Where N is total number of pairs, N = 100 in this table. The 72 pairs that agree do not give us any information on what media is better to detect the bacilli. For this reason, only we take into account pairs that do not agree, in the analysis. The difference between paired proportions is from 64% - 44% = 20%. This mean that media A had 20% more of positive results than media B. The next step is to comprobe the hypothesis that ther is not difference between paired proportions, taking into account the paired nature of data.

Hypothesis test for binary paired data If there were no difference between the medias, we should expect similar numbers r and s, r ≈ s We can use a call McNemar test to assess whether the difference between the numbers of discordant pairs is greater than what you would expect by chance. To test the null hypothesis that there is no difference between the two proportions, we used the McNemar test: (|r-s|-1)2 X2paired= ----------------- r + s Subtracting 1 gives us a continuous correction. The value obtained is refered to X2 distribution table, with one degree of freedom.

Hypothesis test for binary paired data In the study of two culture media for tuberculosis bacilli: 24 were positives in media A and negatives in media B 4 were negatives in media A and positives in media B (|r-s|-1)2 (|24-4|-1)2 361 X2paired=---------------= --------------- = -------- = 12.81 p<0.05 r + s 24 + 4 28 Rejected the null hypothesis of non-difference between media. The obtained value is refered to X2 distribution table with one degree of freedom.

Confidence intervals for the difference of two paired proportions We know thta the difference between proportions of paired data can be calculate by: r – s / N Where: r and s are the number of discordant pairs N is the total number of pairs Standard error from the difference between paired proportions is: √r +s SE(p1-p2) = ----------- N

Confidence intervals from difference of two paired proportions General formula to calculate 95% confidence interval is: Estimate ± 1.96 x SE From the table of results of cultures from expectoration, with medias, A and B, we are using r and s values, and can calculate 95% confidence interval for paired proportions: r-s / N ± √r +s/N = 24-4/100±1.96 √24+4/100 = 0.2±0.10 = 0.1 a 0.3 = 10% a 30%- Confidence intervals from 0.1 to 0.3 mean that the percentage of positive cultures for the bacilli could be between 10% and 30% higher in media A than media B, in the population.

Odds Ratio for paired data In case-control studies, usually, we want to evaluate the risk with the exposure at a risk factor; for these studies, we need an effect measure. In case-control studies, we are using OR, that is a Ratio between odds of the exposure in the cases divided by odds of the exposure in controls. Calculate of OR with matched data, is based in discordant pairs, the same that the difference between proportions od paired data.

Odds Ratios for paired data Table of exposure in cases against exposure in controls Controls Cases Exposed Non-exposed Exposed k r Non-exposed s m k = number of pairs where the case and control were exposed r = number of pairs where the case was exposed and the control was not exposed s = number of pairs where the case was not exposed and the control was exposed. m = number of pairs where cases and controls were not exposed. Pairs of cases and controls that agree, they do not give any information on the risk associated with exposure. This information is given by pairs where exposure between cases and controls,differ.

Odds Ratio for paired data Odds Ratio is calculate as the Ratio of two groups of discordant pairs. r cases exposed controls not exposed OR = ---- = ------------------------------------------------- s cases not exposed controls exposed

Odds Ratio for paired data The table show the results of a matched case-control study, designed to investigate the association between the use of oral contraceptive (OCC) and thromboembolism. Controls Cases Use OCC Not use OCC Use OCC 10 57 Not use OCC 13 95 OR = 57/13 = 4.38

Confidence intervals for paired OR To calculate confidence intervals is a little more complicated. It is calculate using square root of the value of McNemar X2 test, instead of standard error. 95% confidence intervals for OR from apired data is: OR1±1.96/ X

Odds Ratio for paired data X²paired = 26.41 Xpaired = 5.14 Then, 95% confidence interval is: From 4.41-1.96/ 5.14 to 4.41+1.96/ 5.14 4.40.62 to 4.41.38 2.5 to 7.7 In this study, the probability of case exposed to oral contraceptive i 4.4 times higher that the probability of controls exposed to oral contraceptive. We have 95% of confidence that in the population, the probability of cases exposed is between 2.5 and 7.7 times higher that the probability of controls exposed. If the lower limit was below 1, the data should be consistent with the probability of exposure was higher in controls than in cases.

Bibliografía 1.- Last JM. A dictionary of epidemiology. New York, 4ª ed. Oxford University Press, 2001:173. 2.- Kirkwood BR. Essentials of medical ststistics. Oxford, Blackwell Science, 1988: 1-4. 3.- Altman DG. Practical statistics for medical research. Boca Ratón, Chapman & Hall/ CRC; 1991: 1-9.