LIES, MORE LIES AND STATISTICS

Slides:



Advertisements
Similar presentations
To Select a Descriptive Statistic
Advertisements

CHAPTER TWELVE ANALYSING DATA I: QUANTITATIVE DATA ANALYSIS.
ADVANCED STATISTICS FOR MEDICAL STUDIES Mwarumba Mwavita, Ph.D. School of Educational Studies Research Evaluation Measurement and Statistics (REMS) Oklahoma.
Departments of Medicine and Biostatistics
Statistical Tests Karen H. Hagglund, M.S.
HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.
QUANTITATIVE DATA ANALYSIS
Chapter Seventeen HYPOTHESIS TESTING
DATA ANALYSIS I MKT525. Plan of analysis What decision must be made? What are research objectives? What do you have to know to reach those objectives?
Chapter 14 Analyzing Quantitative Data. LEVELS OF MEASUREMENT Nominal Measurement Nominal Measurement Ordinal Measurement Ordinal Measurement Interval.
Educational Research by John W. Creswell. Copyright © 2002 by Pearson Education. All rights reserved. Slide 1 Chapter 8 Analyzing and Interpreting Quantitative.
Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.
1 Introduction to biostatistics Lecture plan 1. Basics 2. Variable types 3. Descriptive statistics: Categorical data Categorical data Numerical data Numerical.
Hypothesis Testing Charity I. Mulig. Variable A variable is any property or quantity that can take on different values. Variables may take on discrete.
Copyright © 2008 by Pearson Education, Inc. Upper Saddle River, New Jersey All rights reserved. John W. Creswell Educational Research: Planning,
 Mean: true average  Median: middle number once ranked  Mode: most repetitive  Range : difference between largest and smallest.
Statistics Definition Methods of organizing and analyzing quantitative data Types Descriptive statistics –Central tendency, variability, etc. Inferential.
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
Analyzing and Interpreting Quantitative Data
STATISTICAL ANALYSIS FOR THE MATHEMATICALLY-CHALLENGED Associate Professor Phua Kai Lit School of Medicine & Health Sciences Monash University (Sunway.
Descriptive & Inferential Statistics Adopted from ;Merryellen Towey Schulz, Ph.D. College of Saint Mary EDU 496.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Analyzing and Interpreting Quantitative Data
Chapter 6: Analyzing and Interpreting Quantitative Data
BASIC STATISTICAL CONCEPTS Chapter Three. CHAPTER OBJECTIVES Scales of Measurement Measures of central tendency (mean, median, mode) Frequency distribution.
Organization of statistical research. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Statistical inference Statistical inference Its application for health science research Bandit Thinkhamrop, Ph.D.(Statistics) Department of Biostatistics.
BIOSTATISTICS Lecture 2. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and creating methods.
Easy (and not so easy) questions to ask about adolescent health data J. Dennis Fortenberry MD MS Indiana University School of Medicine.
Chapter 13 Understanding research results: statistical inference.
Power Point Slides by Ronald J. Shope in collaboration with John W. Creswell Chapter 7 Analyzing and Interpreting Quantitative Data.
Hypothesis Testing and Statistical Significance
Dr.Rehab F.M. Gwada. Measures of Central Tendency the average or a typical, middle observed value of a variable in a data set. There are three commonly.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Dr Hidayathulla Shaikh. Objectives At the end of the lecture student should be able to – Discuss normal curve Classify parametric and non parametric tests.
Statistics and probability Dr. Khaled Ismael Almghari Phone No:
Agenda n Probability n Sampling error n Hypothesis Testing n Significance level.
Outline Sampling Measurement Descriptive Statistics:
Statistics & Evidence-Based Practice
Inferential Statistics
Practice As part of a program to reducing smoking, a national organization ran an advertising campaign to convince people to quit or reduce their smoking.
BIOSTATISTICS Qualitative variable (Categorical) DESCRIPTIVE
CHAPTER 4 Research in Psychology: Methods & Design
Non-Parametric Tests 12/1.
Understanding Results
Non-Parametric Tests 12/6.
APPROACHES TO QUANTITATIVE DATA ANALYSIS
Analyzing and Interpreting Quantitative Data
Statistics for Psychology
Non-Parametric Tests.
USING DATA TO COMMUNICATE
Applied Statistical Analysis
Inferential statistics,
Georgi Iskrov, MBA, MPH, PhD Department of Social Medicine
Medical Statistics Dr. Gholamreza Khalili
SDPBRN Postgraduate Training Day Dundee Dental Education Centre
SA3202 Statistical Methods for Social Sciences
NURS 790: Methods for Research and Evidence Based Practice
1.3 Data Recording, Analysis and Presentation
Unit XI: Data Analysis in nursing research
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE
Experimental Design Data Normal Distribution
15.1 The Role of Statistics in the Research Process
Quantitative Data Analysis
Interpreting Epidemiologic Results.
InferentIal StatIstIcs
Georgi Iskrov, MBA, MPH, PhD Department of Social Medicine
Introductory Statistics
Examine Relationships
Presentation transcript:

LIES, MORE LIES AND STATISTICS AUTHORSHIP WORKSHOP: TRANSLATING YOUR THESIS INTO A PUBLICATION 7th December 2016 Dr James Kigera University of Nairobi

Data Flow Data management Study data Analysis base Clinical data Trial report Study data base Clinical data Source document CRF Publication Analysis Data base closure Checked & cleaned

Useful description Some people are young and some people are a bit older, and other people are even older and very few people are extremely old, and I forgot .. Some people are extremely young when they finish medical school ???

VARIABLES DISCRETE CONTINOUS BINARY/DICHOTOMOUS CONSTANT SCALES OF MEASUREMENT NOMINAL ORDINAL RATIO INTERVAL

DESCRIPTIVE CENTRAL TENDANCY MODE MEDIAN MEAN VARIATION RANGE QUARTILES VARIANCE STANDARD DEVIATION SHAPE SKEWNESS KURTOSIS

Epidemiological measures Measures of frequency Prevalence (point prevalence, period prevalence) Incidence (risk / cumulative incidence, odds, incidence rate) Quantifying the size of the problem, descriptive Measures of association Absolute measures (attributable risk, population attributable risk) Quantifying public health impact Relative measures (risk ratio, rate ratio, odds ratio) Quantifying strength of association between exposure and outcome, use for evaluation of causality

INFERENTIAL STATITICS CONFIDENCE INTERVALS HYPOTHESIS TESTING RELATIONSHIP BETWEEN GROUPS T TEST ANOVA CHI SQUARE FISCHERS EXACT SIGN TEST RANK SUM TEST SIGNED RANK TEST MANN-WHITENEY KRUSKAL-WALLIS

Choice of test Fisher’s exact test if exp value < 5

It must be alpha! Beautiful knight alpha? Ugly knight beta? He saw a relationship where there was none –alpha error He failed to see a relation where there was one – beta error It must be alpha!

P Hypothesis testing The P- value, what is it? The probability of obtaining a value of the test statistic that is at least as extreme as the one that was actually observed, given that the null hypothesis is true [Cave : in essence, the p-value is an attribute of the test statistic, NOT of the effect measure. This is often wrongly explained in reference text books , including Fletcher & Fletcher, the LSHTM intro book and the Gao Smith & Smith book. @check what Petrie and Hennekens say@ Who can tell me what it is? For example, the OR for the association between crowding and TB acquisition is 3.5, the associated X2 is 4.1 and the p-value is 0.10. What does this p-value of 0.10 mean? That if in reality there is no association between crowding and TB, the probability of obtaining a X2 of 4.1 or higher is 0.10. Or: if we would repeat the study an innumerable amount of times, in 1 out of 10 times we will incorrectly conclude that there is a difference in TB acquisition between the crowding groups while in reality there isn’t any difference. Notice that the size of the effect measure is not at all considered with hypothesis testing. This is a major objection against the use of p-values alone. 12 12

Applied Clinical Research Course, M2, Kigali, 7-11 July 2008 Hypothesis testing Caution with p-values! Black /white , yes / no result Always state the probability , NEVER write P <0.05 or P >0.05 Lends a false air of authority or objectivity If results are “not significant “ they may not be published Confusion of statistical significance with clinical significance Hypothesis testing is appealing because it seems to simplify the hard work of interpreting data . However, there are a number of poblems involved with hypothesis testing: [see slide] Applied Clinical Research Course, M2, Kigali, 7-11 July 2008 13 13

COMPARATIVE STATISTICS RELATIONSHIP BETWEEN VARIABLES VISUAL DIRECTION SHAPE CORRELATION COEFICIENT

Case study Location: KNH Study type: Intervention study Randomized clinical trial Subjects: consecutive CHOLELITHIASIS PATIENTS aged >=18 Outcome: PAIN AND SCAR APPEARANCE ASSESSED BY BLINDED NURSE

Two Interventions Laparascopic cholecystectomy and scar massage Randomization Patients underwent Open Cholecystectomy Patients underwent Laparascopic Cholecystectomy Patients underwent Open Cholecystectomy and Scar Massage

Study 3 N = 412 patients randomized Results: 223 patients had cholelithiasis Pain and Scar Composite Scores at 6 months follow-up Open group (n=75)  7200 Median (IQR: 1200,0000 to 21120) Laparascopic Group (n =75) 28900 Median (IQR: 15312 to 43812) Laparascopy and Massage group (n= 73)  34800 Median (IQR 18800 to 53350)

Study 3: Question 1 How many groups do we have? Is data normally distributed? No How many groups do we have? 3 groups. Is the data paired or unpaired (independent)? unpaired groups – independent from each other What type of data is this? Numerical/ continuous Is the sample size large (>20)? Yes (n = 223)

Choice of test Fisher’s exact test if exp value < 5

SUMMARY DATA ANALYSIS Collect Useful Data Process Appropriately Understand your Variables Use the correct Test Statistical vs Clinical Significance