Understanding Statistics in Research Articles Elizabeth Crabtree, MPH, PhD (c) Director of Evidence-Based Practice, Quality Management Assistant Professor,

Slides:



Advertisements
Similar presentations
How would you explain the smoking paradox. Smokers fair better after an infarction in hospital than non-smokers. This apparently disagrees with the view.
Advertisements

II. Potential Errors In Epidemiologic Studies Random Error Dr. Sherine Shawky.
Statistical Analysis and Data Interpretation What is significant for the athlete, the statistician and team doctor? important Will Hopkins
ADVANCED STATISTICS FOR MEDICAL STUDIES Mwarumba Mwavita, Ph.D. School of Educational Studies Research Evaluation Measurement and Statistics (REMS) Oklahoma.
KRUSKAL-WALIS ANOVA BY RANK (Nonparametric test)
Critically Evaluating the Evidence: diagnosis, prognosis, and screening Elizabeth Crabtree, MPH, PhD (c) Director of Evidence-Based Practice, Quality Management.
1 Case-Control Study Design Two groups are selected, one of people with the disease (cases), and the other of people with the same general characteristics.
Find the Joy in Stats ? ! ? Walt Senterfitt, Ph.D., PWA Los Angeles County Department of Public Health and CHAMP.
Introduction to Risk Factors & Measures of Effect Meg McCarron, CDC.
Journal Club Alcohol and Health: Current Evidence March-April 2007.
Nemours Biomedical Research Statistics March 19, 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility.
Lecture 5 Outline – Tues., Jan. 27 Miscellanea from Lecture 4 Case Study Chapter 2.2 –Probability model for random sampling (see also chapter 1.4.1)
Basic Elements of Testing Hypothesis Dr. M. H. Rahbar Professor of Biostatistics Department of Epidemiology Director, Data Coordinating Center College.
Statistics By Z S Chaudry. Why do I need to know about statistics ? Tested in AKT To understand Journal articles and research papers.
Statistics for Health Care
Basic Statistical Concepts Donald E. Mercante, Ph.D. Biostatistics School of Public Health L S U - H S C.
Today Concepts underlying inferential statistics
By Dr. Ahmed Mostafa Assist. Prof. of anesthesia & I.C.U. Evidence-based medicine.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
Thomas Songer, PhD with acknowledgment to several slides provided by M Rahbar and Moataza Mahmoud Abdel Wahab Introduction to Research Methods In the Internet.
The Bahrain Branch of the UK Cochrane Centre In Collaboration with Reyada Training & Management Consultancy, Dubai-UAE Cochrane Collaboration and Systematic.
Inference for regression - Simple linear regression
Tests of significance & hypothesis testing Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
Multiple Choice Questions for discussion
Evidence-Based Medicine 4 More Knowledge and Skills for Critical Reading Karen E. Schetzina, MD, MPH.
Statistics. Intro to statistics Presentations More on who to do qualitative analysis Tututorial time.
Estimation of Various Population Parameters Point Estimation and Confidence Intervals Dr. M. H. Rahbar Professor of Biostatistics Department of Epidemiology.
Evidence-Based Medicine 3 More Knowledge and Skills for Critical Reading Karen E. Schetzina, MD, MPH.
Basic statistics 11/09/13.
Confidence Intervals Nancy D. Barker, M.S.. Statistical Inference.
Health and Disease in Populations 2001 Sources of variation (2) Jane Hutton (Paul Burton)
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
FRAMING RESEARCH QUESTIONS The PICO Strategy. PICO P: Population of interest I: Intervention C: Control O: Outcome.
Statistics for Health Care Biostatistics. Phases of a Full Clinical Trial Phase I – the trial takes place after the development of a therapy and is designed.
Measures of Association
POTH 612A Quantitative Analysis Dr. Nancy Mayo. © Nancy E. Mayo A Framework for Asking Questions Population Exposure (Level 1) Comparison Level 2 OutcomeTimePECOT.
Beak of the Finch Natural Selection Statistical Analysis.
EBCP. Random vs Systemic error Random error: errors in measurement that lead to measured values being inconsistent when repeated measures are taken. Ie:
Introduction to Survival Analysis Utah State University January 28, 2008 Bill Welbourn.
Average Arithmetic and Average Quadratic Deviation.
Lecture 2 Review Probabilities Probability Distributions Normal probability distributions Sampling distributions and estimation.
Understanding Medical Articles and Reports Linda Vincent, MPH UCSF Breast SPORE Advocate September 24,
Measuring associations between exposures and outcomes
Describing the risk of an event and identifying risk factors Caroline Sabin Professor of Medical Statistics and Epidemiology, Research Department of Infection.
Going from data to analysis Dr. Nancy Mayo. Getting it right Research is about getting the right answer, not just an answer An answer is easy The right.
Lecture & tutorial material prepared by: Dr. Shaffi Shaikh Tutorial presented by: Dr. Rufaidah Dabbagh Dr. Nurah Al-Amro.
BC Jung A Brief Introduction to Epidemiology - XIII (Critiquing the Research: Statistical Considerations) Betty C. Jung, RN, MPH, CHES.
Organization of statistical research. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and.
Design and Analysis of Clinical Study 7. Analysis of Case-control Study Dr. Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia.
Diagnostic Test Characteristics: What does this result mean
BIOSTATISTICS Lecture 2. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and creating methods.
Course: Research in Biomedicine and Health III Seminar 5: Critical assessment of evidence.
A short introduction to epidemiology Chapter 6: Precision Neil Pearce Centre for Public Health Research Massey University Wellington, New Zealand.
Chapter 13 Understanding research results: statistical inference.
Chapter 7: Hypothesis Testing. Learning Objectives Describe the process of hypothesis testing Correctly state hypotheses Distinguish between one-tailed.
EBM --- Journal Reading Presenter :黃美琴 Date : 2005/10/27.
Direct method of standardization of indices. Average Values n Mean:  the average of the data  sensitive to outlying data n Median:  the middle of the.
2 3 انواع مطالعات توصيفي (Descriptive) تحليلي (Analytic) مداخله اي (Interventional) مشاهده اي ( Observational ) كارآزمايي باليني كارآزمايي اجتماعي كارآزمايي.
Dr.Theingi Community Medicine
Methods of Presenting and Interpreting Information Class 9.
HelpDesk Answers Synthesizing the Evidence
Lecture 9-I Data Analysis: Bivariate Analysis and Hypothesis Testing
The binomial applied: absolute and relative risks, chi-square
Diagnosis II Dr. Brent E. Faught, Ph.D. Assistant Professor
SAMPLE SIZE DETERMINATION
Evaluating the Role of Bias
Interpreting Basic Statistics
Measurements of Risk & Association …
Research Techniques Made Simple: Interpreting Measures of Association in Clinical Research Michelle Roberts PhD,1,2 Sepideh Ashrafzadeh,1,2 Maryam Asgari.
Risk Ratio A risk ratio, or relative risk, compares the risk of some health-related event such as disease or death in two groups. The two groups are typically.
Presentation transcript:

Understanding Statistics in Research Articles Elizabeth Crabtree, MPH, PhD (c) Director of Evidence-Based Practice, Quality Management Assistant Professor, Library

Statistics – definition and concepts Statistics are used to describe something, or to examine differences among groups, or relationships among characteristics – Descriptive Statistics Mean and median Standard deviation – Inferential Statistics Statistical significance – p-value Confidence intervals Odds ratio Relative Risk Sensitivity/Specificity Positive/Negative Predictive Values

Mean and Median What’s the average cost of a house in this neighborhood?

Mean and Median What’s the average cost of a house in this neighborhood? Mean value: $1,009,000

Mean and Median What’s the average cost of a house in this neighborhood? Median value: $10,000

Standard Deviation How spread out is the data from the mean?

The P value Taking statistics to the next level… “factors that raise your chance of divorce include living in a red state, having twins, and contracting cervical or testicular cancer…” differences between groups relationships between things

Testing for significance Sample size Findings Characteristics of population

Testing for significance Sample size Findings Characteristics of population p < 0.05

Confidence Intervals: another (and maybe better?) test for statistical significance Confidence intervals provide information about a range in which the true value lies with a certain degree of probability

Risk Factors for Deep Vein Thrombosis and Pulmonary Embolism (Heit et al., 2000) Objective: To identify independent risk factors for deep vein thrombosis and pulmonary embolism and to estimate the magnitude of risk for each. Results: “Independent risk factors for VTE included surgery (odds ratio [OR], 21.7; 95% confidence interval [CI], ), ….”

Interpreting the Results What does odds ratio 21.7 (95% CI ) mean? – We can be 95% confident that the odds ratio will fall between 9.4 and 49.9 if the study were replicated – OR if we performed the study 100 times, the odds ratio would be between 9.4 and 49.9 in 95 of the studies

P-values vs. Confidence Intervals P-valuesConfidence Intervals Clearer than confidence intervals Result given directly at level of data measurement Allow for rapid decision as to whether a value is statistically significant (binary response) Provide info about statistical significance as well as direction and STRENGTH of effect May be overly simplistic (really much difference between 0.04 and 0.06???) Allow for assessment of clinical relevance

Statistical significance and clinical relevance: one in the same?

Odds ratio compares whether the odds of a certain event happening is the same for two groups The odds of an event happening is found by taking the odds the event will happen/odds the event will not happen – An odds ratio of 1 implies the event is equally likely in both groups – An odds ratio > 1 implies the event is more likely in the first group – An odds ratio < 1 implies that the event is less likely in the first group

Males and Females on the Titanic AliveDeadTotal Female Male Total The odds ratio compares the relative odds of death in each group. For females the odds were 154/308=0.5 (or 2 to 1 against dying). For males the odds were almost 5 to 1 in favor of death (709/142=4.993). The odds ratio then is 4.993/0.5= There is a 10 fold greater odds of death for males than for females.

Relative Risk (sometimes called the risk ratio) compares the probability of death in each group AliveDeadTotal Female Male Total In the case of our Titanic example, the probability of death for females is 154/462= For males the probability is 709/851= The RR is then /0.3333= 2.5. There is a 2.5 greater probability of death for males than females. Relative Risk comes closer to what most people think of when they compare the relative likelihood of events, but sometimes it is not possible to compute RR in a research design.

Relative risk=1 When the relative risk is one, the risk in the exposed group is the same as the risk in the unexposed group. There is indication of neither benefit nor harm. Relative risk<1 When the relative risk is less than one then the exposure is associated with a protective effect. Relative risk>1 When the relative risk is greater than one, then the exposed group have greater risk of contracting the disease, so the exposure is associated with harm. Interpreting Relative Risk

Huh? Odds and Probability Explained Example: for every 3 attempts there will be one successful outcome The language differs: “one to two” is an odds; expressed as the number; 0.5 “one in three” is a probability; expressed as a fraction; 1/3

Risk Factors for Deep Vein Thrombosis and Pulmonary Embolism (Heit et al., 2000) Objective: To identify independent risk factors for deep vein thrombosis and pulmonary embolism and to estimate the magnitude of risk for each. Results: “Independent risk factors for VTE included surgery (odds ratio [OR], 21.7; 95% confidence interval [CI], ), ….”

Interpreting the Results What does (OR 21.7, 95% CI 9.4 – 49.9) mean? – Patients who have had surgery have a 21.7 to 1 odds of developing a venous thromboembolism, compared to patients who have not undergone surgery – We can be 95% confident that the odds ratio would be between 9.4 and 49.9 if the study were repeated

Sensitivity and Specificity Sensitivity is the proportion of true positives that are correctly identified by a test or measure (e.g., percent of sick people correctly identified as having the condition) Ex: If 100 patients known to have a disease were tested, and 43 test positive, then the test has 43% sensitivity. Specificity is the proportion of true negatives that are correctly identified by the test (e.g., percent of healthy people correctly identified as not having the condition) Ex: If 100 patients with no disease are tested and 96 return a negative result, then the test has 96% specificity.

Relationship between results of liver scan and correct diagnosis: sensitivity/specificity AbnormalNormal Liver Scan(+)(-)Total Abnormal Normal Total How good (sensitive/specific) is the liver scan at diagnosing abnormal pathology? There are 258 true positives and 86 true negatives. The proportions of these two groups that were correctly diagnosed by the scan were 231/258=0.90 and 54/86=0.63. We can expect that 90% of patients with abnormal pathology to have abnormal (positive) liver scans: 90% sensitivity. We can expect that 63% of the patients with normal pathology to have normal (negative) liver scans.: 63% specificity.

Patients and clinicians have a different question… Positive and Negative Predictive Values Positive predictive value is the probability that a patient with a positive test result really does have the condition for which the test was conducted. Negative predictive value is the probability that a patient with a negative test result really is free of the condition for which the test was conducted Predictive values give a direct assessment of the usefulness of the test in practice – influenced by the prevalence of disease in the population that is being tested

Relationship between results of liver scan and correct diagnosis: +/- predictive values AbnormalNormal Liver Scan(+)(-)Total Abnormal Normal Total Of the 263 patients with abnormal liver scans 231 had abnormal pathology, giving the proportion of correct diagnoses as 231/263 = Similarly, among the 81 patients with normal liver scans the proportion of correct diagnoses was 54/81 = 0.67.

Prevalence, Predictive Values and Sensitivity/Specificity

Acknowledgements Dr. Charles Macias, lecture, Evidence-based medicine: why does it matter? Texas Children’s Hospital Evidence-Based Outcomes Center Evidence-Based Medicine course handouts Texas Children’s Hospital Lean Six Sigma Green Belt Certification material Craig Hospital, Those Scary Statistics!