Some Simple Statistical Slip- ups (and how to avoid them) Andrew Vickers Department of Epidemiology and Biostatistics Memorial Sloan-Kettering Cancer Center.

Slides:



Advertisements
Similar presentations
Inferential Statistics 1: Basic Concepts Fortrose Academy Geography Department Advanced Higher.
Advertisements

Regression and correlation methods
The %LRpowerCorr10 SAS Macro Power Estimation for Logistic Regression Models with Several Predictors of Interest in the Presence of Covariates D. Keith.
Statistical Analysis and Data Interpretation What is significant for the athlete, the statistician and team doctor? important Will Hopkins
1 Hypothesis Testing Chapter 8 of Howell How do we know when we can generalize our research findings? External validity must be good must have statistical.
Departments of Medicine and Biostatistics
1 Multiple Regression A single numerical response variable, Y. Multiple numerical explanatory variables, X 1, X 2,…, X k.
1 Multiple Regression Interpretation. 2 Correlation, Causation Think about a light switch and the light that is on the electrical circuit. If you and.
Tim Wiemken, PhD MPH CIC Assistant Professor Division of Infectious Diseases University of Louisville, Kentucky Planning Your Study Statistical Issues.
Statistical Tests Karen H. Hagglund, M.S.
Using Statistics in Research Psych 231: Research Methods in Psychology.
Statistical Methods Chichang Jou Tamkang University.
Confidence Interval for Relative Risk
Stat 512 – Lecture 13 Chi-Square Analysis (Ch. 8).
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE © 2012 The McGraw-Hill Companies, Inc.
Notes on Logistic Regression STAT 4330/8330. Introduction Previously, you learned about odds ratios (OR’s). We now transition and begin discussion of.
Basic Statistics for Research: Choosing Appropriate Analyses and Using SPSS Dr. Beth A. Bailey Dr. Tiejian Wu Department of Family Medicine.
Using Statistics in Research Psych 231: Research Methods in Psychology.
5-3 Inference on the Means of Two Populations, Variances Unknown
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Regression and Correlation
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
Inferential statistics Hypothesis testing. Questions statistics can help us answer Is the mean score (or variance) for a given population different from.
Biostat Didactic Seminar Series Analyzing Binary Outcomes: Analyzing Binary Outcomes: An Introduction to Logistic Regression Robert Boudreau, PhD Co-Director.
Hypothesis Testing:.
Comparing Two Population Means
April 11 Logistic Regression –Modeling interactions –Analysis of case-control studies –Data presentation.
COMM 250 Agenda - Week 12 Housekeeping RP2 Due Wed. RAT 5 – Wed. (FBK 12, 13) Lecture Experiments Descriptive and Inferential Statistics.
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
1 rules of engagement no computer or no power → no lesson no SPSS → no lesson no homework done → no lesson GE 5 Tutorial 5.
Introduction to Statistics Harry R. Erwin, PhD School of Computing and Technology University of Sunderland.
Slide 1 Copyright © 2004 Pearson Education, Inc..
Inference and Inferential Statistics Methods of Educational Research EDU 660.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Chapter 22: Comparing Two Proportions. Yet Another Standard Deviation (YASD) Standard deviation of the sampling distribution The variance of the sum or.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Reasoning in Psychology Using Statistics
Project 1 FINA B. Group of 5. Due by 18/09/ parts. Each worth 50% of total. Need to provide 1 excel workbook for part 1 and part 2. This.
STATISTICAL ANALYSIS FOR THE MATHEMATICALLY-CHALLENGED Associate Professor Phua Kai Lit School of Medicine & Health Sciences Monash University (Sunway.
Smith/Davis (c) 2005 Prentice Hall Chapter Nine Probability, the Normal Curve, and Sampling PowerPoint Presentation created by Dr. Susan R. Burns Morningside.
Statistics: Unlocking the Power of Data Lock 5 Exam 2 Review STAT 101 Dr. Kari Lock Morgan 11/13/12 Review of Chapters 5-9.
Chapter Eight: Using Statistics to Answer Questions.
Framework of Preferred Evaluation Methodologies for TAACCCT Impact/Outcomes Analysis Random Assignment (Experimental Design) preferred – High proportion.
CHAPTER OVERVIEW Say Hello to Inferential Statistics The Idea of Statistical Significance Significance Versus Meaningfulness Meta-analysis.
Inferential Statistics Introduction. If both variables are categorical, build tables... Convention: Each value of the independent (causal) variable has.
1 Chapter 15 Data Analysis: Basic Questions © 2005 Thomson/South-Western.
More Contingency Tables & Paired Categorical Data Lecture 8.
Descriptive and Inferential Statistics Or How I Learned to Stop Worrying and Love My IA.
Instrument design Essential concept behind the design Bandit Thinkhamrop, Ph.D.(Statistics) Department of Biostatistics and Demography Faculty of Public.
Significance Tests for Regression Analysis. A. Testing the Significance of Regression Models The first important significance test is for the regression.
Jump to first page Inferring Sample Findings to the Population and Testing for Differences.
LESSON 4.1. MULTIPLE LINEAR REGRESSION 1 Design and Data Analysis in Psychology II Salvador Chacón Moscoso Susana Sanduvete Chaves.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: Multiple, Logistic and Proportional Hazards Regression.
The 2 nd to last topic this year!!.  ANOVA Testing is similar to a “two sample t- test except” that it compares more than two samples to one another.
Dr.Theingi Community Medicine
Statistical analysis Christian Gratzke LMU Munich
Common Statistical Errors (and how to avoid them)
Hypothesis Testing.
What do you need to know? Introduction to the use of inferential statistics: the Sign Test What inferential tests allow us to do (that descriptive stats.
Applied Statistical Analysis
Inferential Statistics
Comparing Several Means: ANOVA
Statistical Inference about Regression
Inference for Relationships
Reasoning in Psychology Using Statistics
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE
15.1 The Role of Statistics in the Research Process
Presentation transcript:

Some Simple Statistical Slip- ups (and how to avoid them) Andrew Vickers Department of Epidemiology and Biostatistics Memorial Sloan-Kettering Cancer Center

Pop quiz p values

Perhaps the only slip up you need to avoid Not having a statistician

Statistics is essentially a straightforward issue of using computer software and can be done by a reasonably intelligent amateur

Anesthesia literature 9% of the 722 descriptive statistics had major errors 78% of inferential statistics had errors

An experiment Let’s choose the first paper from the Journal Urology Who did the stats? Were they any good?

*start with a "table 1" showing characteristics * we don't want list out all number of positive nodes, cap at 3 replace totalpos=3 if totalpos>3 *no positive nodes if no dissection! replace totalpos=. if lnd==0 *now create the categorical variable for number of positive nodes tab totalpos, g(posnoded) tempfile temp save `temp' *print out table 1 forvalues i=1(1)1{ quietly count disp "Total number of patients&", r(N) table1 lnd, type(cat) label(Lymph node dissection) table1 totalnodes if lnd==1, type(con) label(Lymph nodes removed) disp "Number of positive nodes" table1 posnoded1, type(cat) label(0) table1 posnoded2, type(cat) label(1) table1 posnoded3, type(cat) label(2) table1 posnoded4, type(cat) label(3+) }

g higleason=(bxggscat>6) g Stage_T2b=clinstagecat>2 *show multivariable model ** type in the rounding: n is how many significant figures local n=3 *** which type of estimate? *** answer Odds Ratio, Hazard Ratio or oefficient local q="Odds Ratio“ ***fixed number of decimal places? ***say yes or no local fixed="yes“ *** say how many places (ignored if "no") local d=2 ** type in the dependent variable for linear or logistic regression local dep = "lnd“ ** type in the name of the predictor variables local vars = " higleason psa" local vars = " higleason Stage_T2b psa" parmby "logistic `dep' `vars'", saving(results, replace) *

foreach v of local vars { quietly sum p if parm=="`v'" local ptemp=r(mean) if `ptemp'>=.95{ quietly replace pf="p=1" if parm=="`v'" } if `ptemp'>=0.2 & `ptemp'<0.95{ quietly replace pf="0"+string(round(`ptemp',.1)) if parm=="`v'" } if `ptemp' =0.1{ quietly replace pf="0"+string(round(`ptemp',.01)) if parm=="`v'" } if `ptemp' =0.001{ quietly replace pf="0"+string(round(`ptemp',.001)) if parm=="`v'" } if `ptemp' =0.0005{ quietly replace pf="0"+string(round(`ptemp',.0001)) if parm=="`v'" } if `ptemp'<0.0005{ quietly replace pf="<0.0005" if parm=="`v'" }

* establish variables which will contain the appropriate amount of rounding for each predictor local list = "estimate min95 max95" foreach l of local list { g `l'roundd =. g `l'roundf =. } * run this for each predictor foreach v of local vars { *this loop searches for how many decimal places are in the value forvalues i=`n'(-1)-8 { local decimals=10^(`i'-`n') *run this for each estimate foreach l of local list { quietly sum `l' if parm=="`v'" local e = r(mean) if abs(`e') = 10^(`i'-1) { quietly replace `l'roundd =`n'-`i' if parm=="`v'" }

Result? Predictor&Odds Ratio&95% C.I.&P Value Gleason 7+&42.81&16.54, &< Stage_T2b&2.10&0.52, 8.55&0.3 PSA&1.17&1.04, 1.32&0.01

Take home message Incorporation of biostatistical help is cited by experienced investigators as one of the key determinants of the success or failure of a research program

A quick tour of some assorted statistical slip ups

Slip up 1 Statisticians aren’t machines for producing p values

Statistical methods Inference –Is something there? –Hypothesis testing: p values Estimation –How big is it? –E.g. means, correlations, proportions, differences between groups

Statisticians can also help with… Thinking through the scientific question Experimental design Data collection Data quality assurance

Statistical slip up 2 I shoot penalties with Zlatan He scores 6 in a row I score 2 out of 6 P = 0.06 by Fisher’s exact

Zlatan won’t accept the null hypothesis I could play football in the Swedish national team

Inference 101 State a null hypothesis

Inference 101 State a null hypothesis Get your data, calculate p value

Inference 101 State a null hypothesis Get your data, calculate p value If p<5%, reject null hypothesis If p ≥5%, don’t reject null hypothesis

Statistical slip up 2 Don’t accept the null hypothesis In a court case: guilty or not guilty In a statistical test: reject or don’t reject

Statistical slip up 3 RESULTS: Compared with a BMI of 18.5 to 21.9 kg/m2 at age 18 years, the hazard ratio for premature death was 2.79 (CI, 2.04 to 3.81) for a BMI of 30 kg/m2 or greater. CONCLUSION: Moderately higher adiposity at age 18 years is associated with increased premature death in younger and middle-aged U.S. women

Biostatistics Biology Math Biology

Statistical slip up 3 A result isn’t a conclusion

Statistical slip up 4 Mean gestational time was weeks in the experimental group compared to weeks in controls (p=0.6945).

Statistical slip 4 Every number you write down means something

Statistical slip up 5 Whereas Erk3, ECAD, P21, P53, Cadherin, il 6, il12 and Jak had no association with outcome (p>0.2 for all), Ki67 was a predictor of recurrence (p=0.03). We recommend that Ki67 be measured to determined eligibility for adjuvant chemotherapy.

Statistical slip up 5 Multiple testing. Looked at 9 different biomarkers. 35% chance of at least one marker with p<0.05. A statistical association isn’t grounds for a change in practice.