University of Sydney Statistics 101: Power, p-values and ………... publications. Dr. Gordon S Doig, Senior Lecturer in Intensive Care, Northern Clinical School.

Slides:



Advertisements
Similar presentations
CHOOSING A STATISTICAL TEST © LOUIS COHEN, LAWRENCE MANION & KEITH MORRISON.
Advertisements

Chapter 16 Introduction to Nonparametric Statistics
PTP 560 Research Methods Week 9 Thomas Ruediger, PT.
Departments of Medicine and Biostatistics
PSY 307 – Statistics for the Behavioral Sciences Chapter 20 – Tests for Ranked Data, Choosing Statistical Tests.
Statistical Tests Karen H. Hagglund, M.S.
Nonparametric tests and ANOVAs: What you need to know.
© 2003 Pearson Prentice Hall Statistics for Business and Economics Nonparametric Statistics Chapter 14.
K sample problems and non-parametric tests. Two-Sample T-test (unpaired)
Test statistic: Group Comparison Jobayer Hossain Larry Holmes, Jr Research Statistics, Lecture 5 October 30,2008.
Final Review Session.
Analysis of Differential Expression T-test ANOVA Non-parametric methods Correlation Regression.
Lecture 9: One Way ANOVA Between Subjects
Chapter 19 Data Analysis Overview
EXPERIMENTAL DESIGN Random assignment Who gets assigned to what? How does it work What are limits to its efficacy?
15-1 Introduction Most of the hypothesis-testing and confidence interval procedures discussed in previous chapters are based on the assumption that.
Nonparametric and Resampling Statistics. Wilcoxon Rank-Sum Test To compare two independent samples Null is that the two populations are identical The.
Nonparametrics and goodness of fit Petter Mostad
Statistical Methods II
Nonparametric or Distribution-free Tests
Non-parametric Dr Azmi Mohd Tamil.
1 STATISTICAL HYPOTHESES AND THEIR VERIFICATION Kazimieras Pukėnas.
Hypothesis Testing Charity I. Mulig. Variable A variable is any property or quantity that can take on different values. Variables may take on discrete.
Comparing Two Samples Harry R. Erwin, PhD
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 21/09/2015 7:46 PM 1 Two-sample comparisons Underlying principles.
Statistics & Biology Shelly’s Super Happy Fun Times February 7, 2012 Will Herrick.
Non-parametric Tests. With histograms like these, there really isn’t a need to perform the Shapiro-Wilk tests!
Introduction To Biological Research. Step-by-step analysis of biological data The statistical analysis of a biological experiment may be broken down into.
Some terms Parametric data assumptions(more rigorous, so can make a better judgment) – Randomly drawn samples from normally distributed population – Homogenous.
Where are we?. What we have covered: - How to write a primary research paper.
Common Nonparametric Statistical Techniques in Behavioral Sciences Chi Zhang, Ph.D. University of Miami June, 2005.
Advanced statistical methods Michal Jurajda. Statistics What is statistics?
Nonparametric Statistical Methods: Overview and Examples ETM 568 ISE 468 Spring 2015 Dr. Joan Burtner.
Analysis of variance Petter Mostad Comparing more than two groups Up to now we have studied situations with –One observation per object One.
RESULTS & DATA ANALYSIS. Descriptive Statistics  Descriptive (describe)  Frequencies  Percents  Measures of Central Tendency mean median mode.
© 2000 Prentice-Hall, Inc. Statistics Nonparametric Statistics Chapter 14.
Biostatistics, statistical software VII. Non-parametric tests: Wilcoxon’s signed rank test, Mann-Whitney U-test, Kruskal- Wallis test, Spearman’ rank correlation.
Ordinally Scale Variables
Linear correlation and linear regression + summary of tests
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 26.
1 Nonparametric Statistical Techniques Chapter 17.
Single sample One-tailed versus two-tailed tests One-tailed versus two-tailed tests t-test: t-test:Use Significance level (type I and II errors) Degrees.
Experimental Design and Statistics. Scientific Method
1 ANALYSIS OF VARIANCE (ANOVA) Heibatollah Baghi, and Mastee Badii.
Nonparametric Statistical Methods. Definition When the data is generated from process (model) that is known except for finite number of unknown parameters.
Research Methods: 2 M.Sc. Physiotherapy/Podiatry/Pain Inferential Statistics.
Chap 18-1 Copyright ©2012 Pearson Education, Inc. publishing as Prentice Hall Chap 18-1 Chapter 18 A Roadmap for Analyzing Data Basic Business Statistics.
CD-ROM Chap 16-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition CD-ROM Chapter 16 Introduction.
NON-PARAMETRIC STATISTICS
Analisis Non-Parametrik Antonius NW Pratama MK Metodologi Penelitian Bagian Farmasi Klinik dan Komunitas Fakultas Farmasi Universitas Jember.
Principles of statistical testing
Biostatistics Nonparametric Statistics Class 8 March 14, 2000.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Value Stream Management for Lean Healthcare ISE 491 Fall 2009 Data Analysis - Lecture 7.
Nonparametric Statistical Methods. Definition When the data is generated from process (model) that is known except for finite number of unknown parameters.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
Nonparametric Statistics
Session 9: k Samples (Zar, Chapter 10). (1) General Setup: Group 1Group 2…Group k x 11 x 21 x k1 x 12 x 22 x k2 x 13 x 23 x k3 x 1n 1 x 2n 2 x kn k H.
Lecture 22 Dustin Lueker.  Similar to testing one proportion  Hypotheses are set up like two sample mean test ◦ H 0 :p 1 -p 2 =0  Same as H 0 : p 1.
Approaches to quantitative data analysis Lara Traeger, PhD Methods in Supportive Oncology Research.
 Kolmogor-Smirnov test  Mann-Whitney U test  Wilcoxon test  Kruskal-Wallis  Friedman test  Cochran Q test.
Inferential Statistics Assoc. Prof. Dr. Şehnaz Şahinkarakaş.
Chapter 4 Selected Nonparemetric Techniques: PARAMETRIC VS. NONPARAMETRIC.
Non-parametric Tests Research II MSW PT Class 8. Key Terms Power of a test refers to the probability of rejecting a false null hypothesis (or detect a.
Micro array Data Analysis. Differential Gene Expression Analysis The Experiment Micro-array experiment measures gene expression in Rats (>5000 genes).
Chapter 18 Data Analysis Overview Yandell – Econ 216 Chap 18-1.
Data Analysis and Interpretation
Presentation transcript:

University of Sydney Statistics 101: Power, p-values and ………... publications. Dr. Gordon S Doig, Senior Lecturer in Intensive Care, Northern Clinical School

Analysis 101: The basic tests t-test paired t-test Wilcoxon Rank Sum test (Mann-Whitney U test) Wilcoxon Signed Rank Sum test Kolmogorov-Smirnov (one and two sample test) Chi-square test Fisher’s Exact test ANOVA Kruskal-Wallis rank test repeated measures ANOVA

Why do we need statistics??? When we conduct any type of research, we can make at least two major types of errors when we draw our conclusions: I) II)

Why do we need statistics??? When we conduct any type of research, we can make at least two major types of errors when we draw our conclusions: I) we claim to have found an important treatment effect when in reality there is no treatment effect. II) we claim that no treatment effect exists when in reality there is an important treatment effect.

Why do we need statistics??? Some important definitions: What is a p-value? What is power?

Why do we need statistics??? Some important definitions: What is a p-value? What is power? P-value: The probability that the difference we observed could be due to chance alone.

Why do we need statistics??? Some important definitions: What is a p-value? What is power? P-value: The probability that the difference we observed could be due to chance alone. Power: The probability that if there is a real difference, our experiment will find it.

Why do we need statistics??? When we conduct any type of research, we can make at least two major types of errors when we draw our conclusions: I) we can claim to have found an important treatment effect when in reality there is no treatment effect. II) we can claim that no treatment effect exists when in reality there is an important treatment effect P-value: The probability that the difference we observed could be due to chance alone. Power: The probability that if there is a real difference, our experiment will find it.

Sample size calculations: The use of Power Every experiment should start with a sample size calculation. Having adequate power protects us from Type II errors.

Sample size calculations: The use of Power Every experiment should start with a sample size calculation. Having adequate power protects us from Type II errors. Forces us to consider a primary outcome for our experiment. primary outcomes can be continuous, categorical (interval, ordered, unordered), dichotomous

Sample size calculations: The use of Power Every experiment should start with a sample size calculation. Having adequate power protects us from Type II errors. Forces us to consider a primary outcome for our experiment. primary outcomes can be continuous, categorical (interval, ordered, unordered), dichotomous Should consider issues of design in order to simplify analysis.

Analysis 101: The use of P??? Selection of appropriate study design / analytic technique: protects from Type I errors. is driven by driven by a combination of study outcome and study design.

Analysis 101: Basics of experimental design 1) Before and after trial physiological parameter/outcome measured intervention delivered physiological parameter/outcome measured again compare measurement before with measurement after, usually in same subject

Analysis 101: Basics of experimental design 1) Before and after trial physiological parameter/outcome measured intervention delivered physiological parameter/outcome measured again compare measurement before with measurement after, usually in same subject 2) Comparison between two groups subjects are randomly assigned to one of two groups one group receives intervention compare outcome between two groups after intervention

Analysis 101: Basics of experimental design 1) Before and after trial physiological parameter/outcome measured intervention delivered physiological parameter/outcome measured again compare measurement before with measurement after, usually in same subject 2) Comparison between two groups subjects are randomly assigned to one of two groups one group receives intervention compare outcome between two groups after intervention 3) Comparison between more than two groups as above but subjects are assigned to more than two groups could compare 3 different drugs or 3 different doses

Analysis 101: Outcome identification Primary outcomes can be 1) continuous, 2) categorical (interval, ordered, unordered), 3) dichotomous 1) Continuous outcomes: most physiological parameters (Hb, pressures, biochemistry) usually involves a direct measurement often Normally distributed

Analysis 101: Outcome identification 2) Categorical outcomes: a) interval equal unit change between each ordered category

Analysis 101: Outcome identification 2) Categorical outcomes: a) interval equal unit change between each ordered category length of stay, age, time to event, some scoring systems may be Normally distributed

Analysis 101: Outcome identification 2) Categorical outcomes: a) interval equal unit change between each ordered category length of stay, age, time to event, some scoring systems may be Normally distributed b) ordered unequal unit change between each ordered category

Analysis 101: Outcome identification 2) Categorical outcomes: a) interval equal unit change between each ordered category length of stay, age, time to event, some scoring systems may be Normally distributed b) ordered unequal unit change between each ordered category most scoring systems, tumor stage or grade, low-medium- high not usually Normally distributed

Analysis 101: Outcome identification 2) Categorical outcomes: a) interval equal unit change between each ordered category length of stay, age, time to event, some scoring systems may be Normally distributed b) ordered unequal unit change between each ordered category most scoring systems, tumor stage or grade, low-medium- high not usually Normally distributed c) unordered no sequential order to categories

Analysis 101: Outcome identification 2) Categorical outcomes: a) interval equal unit change between each ordered category length of stay, age, time to event, some scoring systems may be Normally distributed b) ordered unequal unit change between each ordered category most scoring systems, tumor stage or grade, low-medium- high not usually Normally distributed c) unordered no sequential order to categories type of tumor, location, diagnosis re-think outcome selection!!!!

Analysis 101: Outcome identification 3) Dichotomous outcomes: only two possible outcome states tumor / no tumor dead / alive follows Binomial distribution

Analysis 101: The basic tests t-test paired t-test Wilcoxon Rank Sum test (Mann-Whitney U test) Wilcoxon Signed Rank Sum test Kolmogorov-Smirnov (one and two sample test) Chi-square test Fisher’s Exact test ANOVA Kruskal-Wallis rank test repeated measures ANOVA

Analysis 101: Design and Analysis 1) Before and after trial (same subjects, continuous and interval )

Analysis 101: Design and Analysis Step 1: Determine if outcome is Normally distributed plot histogram with density function line 1) Before and after trial (same subjects, continuous and interval )

Analysis 101: Design and Analysis Step 1: Determine if outcome is Normally distributed plot histogram with density function line could ‘formally’ test using Wilkes-Shapiro statistic 1) Before and after trial (same subjects, continuous and interval )

Analysis 101: Design and Analysis Step 1: Determine if outcome is Normally distributed plot histogram with density function line could ‘formally’ test using Wilkes-Shapiro statistic 1) Before and after trial (same subjects, continuous and interval )

Analysis 101: Design and Analysis 1) Before and after trial (same subjects, continuous and interval )

Analysis 101: Design and Analysis paired t-testWilcoxon Signed Rank Sum Test 1) Before and after trial (same subjects, continuous and interval )

Analysis 101: Design and Analysis paired t-testWilcoxon Signed Rank Sum Test NB - if ordered categorical outcome, use one sample Kolmogorov- Smirnov test 1) Before and after trial (same subjects, continuous and interval )

Analysis 101: Design and Analysis 2) Comparison between two groups(continuous and interval)

Analysis 101: Design and Analysis 2) Comparison between two groups Step 1: Determine if outcome is Normally distributed plot histogram (use all data) with density function line could ‘formally’ test using Wilkes-Shapiro statistic (continuous and interval)

Analysis 101: Design and Analysis 2) Comparison between two groups Step 1: Determine if outcome is Normally distributed plot histogram (use all data) with density function line could ‘formally’ test using Wilkes-Shapiro statistic (continuous and interval)

Analysis 101: Design and Analysis 2) Comparison between two groups(continuous and interval)

Analysis 101: Design and Analysis 2) Comparison between two groups t-testWilcoxon Rank Sum test (continuous and interval)

Analysis 101: Design and Analysis 2) Comparison between two groups t-testWilcoxon Rank Sum test NB - if ordered categorical outcome, use two sample Kolmogorov- Smirnov test (continuous and interval)

Analysis 101: Design and Analysis 3) Comparison between more than two groups

Analysis 101: Design and Analysis 3) Comparison between more than two groups Step 1: Determine if outcome is Normally distributed plot histogram (use all data) with density function line could ‘formally’ test using Wilkes-Shapiro statistic

Analysis 101: Design and Analysis 3) Comparison between more than two groups

Analysis 101: Design and Analysis 3) Comparison between more than two groups ANOVAKruskal-Wallis rank test

Analysis 101: Design and Analysis 3) Comparison between more than two groups ANOVAKruskal-Wallis rank test NB - could transform (calculate the log or ln) each outcome value and redo histogram…. if transformed values are Normally distributed, can now use ‘more powerful’ ANOVA (or t-test if 2 samples).

Analysis 101: Dichotomous outcomes 1) Before and after trial rate before intervention compared to rate after intervention McNemer’s chi-square

Analysis 101: Dichotomous outcomes 1) Before and after trial rate before intervention compared to rate after intervention McNemer’s chi-square 2) Comparison between two groups create 2x2 table, calculate rate for each Group DeadAlive Group A % mortality Group B % mortality compare using chi-square test

Analysis 101: Dichotomous outcomes 1) Before and after trial rate before intervention compared to rate after intervention McNemer’s chi-square 2) Comparison between two groups create 2x2 table, calculate rate for each Group DeadAlive Group A % mortality Group B % mortality compare using chi-square test NB - if any one cell contains < 5 counts, use Fisher’s Exact test

Analysis 101: Dichotomous outcomes 1) Before and after trial rate before intervention compared to rate after intervention McNemer’s chi-square 2) Comparison between two groups create 2x2 table, calculate rate for each Group DeadAlive Group A % mortality Group B % mortality compare using chi-square test NB - if any one cell contains < 5 counts, use Fisher’s Exact test 3) Comparison between more than two groups undertake a series of comparisons via 2x2 tables as above

Analysis 101: Special considerations Transformations: Sometimes its possible to ‘transform’ a long tailed distribution to a normal distribution. Calculate the log or ln of each outcome value and redo histogram. Allows us to apply ‘more powerful’ tests based on assumption of Normality (paired t-test, t-test, ANOVA). Try non-parametric test first <- fewer assumptions!!!!

Analysis 101: Special considerations The t-test has 3 basic, fundamental underlying assumptions: 1) Outcomes are Normally distributed test assumptions of Normality use non-parametric tests

Analysis 101: Special considerations The t-test has 3 basic, fundamental underlying assumptions: 1) Outcomes are Normally distributed test assumptions of Normality use non-parametric tests 2) Outcomes are independent if outcomes are from same subjects, use paired t-test

Analysis 101: Special considerations The t-test has 3 basic, fundamental underlying assumptions: 1) Outcomes are Normally distributed test assumptions of Normality use non-parametric tests 2) Outcomes are independent if outcomes are from same subjects, use paired t-test 3) The variance of each group is similar stats package should formally test equality of variances different p-values for each condition

Analysis 101: Summary t-test (two groups, Normally distributed) paired t-test (before/after, Normally distributed) Wilcoxon Rank Sum test (two groups, non-parametric) Wilcoxon Signed Rank Sum test (before/after, non-parametric) Kolmogorov-Smirnov (before/after, two groups, ordered categorical) Chi-square test (dichotomous outcome) Fisher’s Exact test (dichotomous outcome, any cell size < 5) ANOVA (more than two groups, Normally distributed) Kruskal-Wallis rank test (more than two groups, non-parametric) repeated measures ANOVA