+ Gladstone Bioinformatics Core Kirsten E. Eilertson + Statistics in Science: Best Practices.

Slides:



Advertisements
Similar presentations
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Advertisements

Copyright © 2009 Pearson Education, Inc. Chapter 29 Multiple Regression.
Inference Sampling distributions Hypothesis testing.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Testing Hypotheses About Proportions Chapter 20. Hypotheses Hypotheses are working models that we adopt temporarily. Our starting hypothesis is called.
AP Statistics – Chapter 9 Test Review
Copyright © 2010, 2007, 2004 Pearson Education, Inc. *Chapter 29 Multiple Regression.
Chapter 10 Simple Regression.
Today Today: Chapter 10 Sections from Chapter 10: Recommended Questions: 10.1, 10.2, 10-8, 10-10, 10.17,
The Basics of Regression continued
Today Concepts underlying inferential statistics
Richard M. Jacobs, OSA, Ph.D.
Introduction to Regression Analysis, Chapter 13,
Correlation & Regression
Are the results valid? Was the validity of the included studies appraised?
AM Recitation 2/10/11.
Estimation and Hypothesis Testing Faculty of Information Technology King Mongkut’s University of Technology North Bangkok 1.
Overview Definition Hypothesis
Introductory Statistical Concepts. Disclaimer – I am not an expert SAS programmer. – Nothing that I say is confirmed or denied by Texas A&M University.
Analysis & Interpretation: Individual Variables Independently Chapter 12.
Copyright © 2010 Pearson Education, Inc. Chapter 23 Inference About Means.
Copyright © 2009 Pearson Education, Inc. Chapter 23 Inferences About Means.
Slide 23-1 Copyright © 2004 Pearson Education, Inc.
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Statistical Techniques I EXST7005 Review. Objectives n Develop an understanding and appreciation of Statistical Inference - particularly Hypothesis testing.
+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.
T tests comparing two means t tests comparing two means.
CHAPTER 16: Inference in Practice. Chapter 16 Concepts 2  Conditions for Inference in Practice  Cautions About Confidence Intervals  Cautions About.
Statistics & Biology Shelly’s Super Happy Fun Times February 7, 2012 Will Herrick.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Quantitative Research Design and Statistical Analysis.
CHAPTER 18: Inference about a Population Mean
A Broad Overview of Key Statistical Concepts. An Overview of Our Review Populations and samples Parameters and statistics Confidence intervals Hypothesis.
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
CHAPTER 16: Inference in Practice ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests.
Multiple Regression Petter Mostad Review: Simple linear regression We define a model where are independent (normally distributed) with equal.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Copyright © 2012 Pearson Education. All rights reserved © 2010 Pearson Education Copyright © 2012 Pearson Education. All rights reserved. Chapter.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Issues concerning the interpretation of statistical significance tests.
Hypothesis Testing An understanding of the method of hypothesis testing is essential for understanding how both the natural and social sciences advance.
Chapter 12 Confidence Intervals and Hypothesis Tests for Means © 2010 Pearson Education 1.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: One-way ANOVA Marshall University Genomics Core.
The Single-Sample t Test Chapter 9. t distributions >Sometimes, we do not have the population standard deviation. (that’s actually really common). >So.
Chapter 10 The t Test for Two Independent Samples
I271B QUANTITATIVE METHODS Regression and Diagnostics.
Statistical Analysis II Lan Kong Associate Professor Division of Biostatistics and Bioinformatics Department of Public Health Sciences December 15, 2015.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 22, Slide 1 Chapter 22 Inferences about Means.
 Assumptions are an essential part of statistics and the process of building and testing models.  There are many different assumptions across the range.
Inference About Means Chapter 23. Getting Started Now that we know how to create confidence intervals and test hypotheses about proportions, it’d be nice.
Slide 20-1 Copyright © 2004 Pearson Education, Inc.
AP STATISTICS LESSON 11 – 1 (DAY 2) The t Confidence Intervals and Tests.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Hypothesis Tests for 1-Proportion Presentation 9.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Chapter 9 Introduction to the t Statistic
CHAPTER 9 Testing a Claim
Wednesday, March 15, 2017 Warm-up 80% 85% 92%
Chapter 23 Comparing Means.
CHAPTER 9 Testing a Claim
Chapter Nine Part 1 (Sections 9.1 & 9.2) Hypothesis Testing
Chapter 22 Inference About Means.
Inferences About Means
Chapter 23 Inference About Means.
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
Presentation transcript:

+ Gladstone Bioinformatics Core Kirsten E. Eilertson + Statistics in Science: Best Practices

+ Our Goal “thoughtful” “insightful” “rigorous” statistical analyses Meaningful and solid inference which can be the basis of future work Our Challenges Every application is different Precedents can be a problem P-value centric publication system Reproducibility

+ Reporting and Reproducibility Reproducibility Crisis!!! Dr. Ioannidis (2005) PLoS Medicine

+ Our Goal “thoughtful” “insightful” “rigorous” statistical analyses Meaningful and solid inference which can be the basis of future work Our Challenges Every application is different Precedents can be a problem P-value centric publication system Reproducibility Discussion Today: Reporting results Power and Experimental Design Outliers

+ Guidelines for reporting Resources: Annals of Internal Medicine statistical_reporting.htm statistical_reporting.htm American Physiological Society ‘Describe the statistical methods with enough detail to enable a knowledgeable reader with access to the original data to verify the reported results’ (Bailar & Mosteller, 1988, p. 266)

+ Guidelines for reporting ‘The design of an experiment, the analysis of its data, and the communication of the results are intertwined. In fact, design drives analysis and communication.’ Always report the test statistic, the degrees of freedom, the test value, and the P-value that the result occurred at chance under the null hypothesis. Report how assumptions were checked (e.g. histograms of residuals, tests of normality, etc.) Provide a clear description of the design of your study or experiment; state the null hypothesis and alternative

+ Guidelines for reporting Control for multiple comparisons. Report variability using a standard deviation (not standard error). Avoid sole reliance on statistical hypothesis testing, such as the use of P values, which fails to convey important quantitative information. Report uncertainty about scientific importance using a confidence interval. Interpret each main result by assessing the numerical bounds of the confidence interval and by considering the precise p value.

+ Experimental Design& Power The appropriate analysis depends on the design! Can I peek at my data? Can I add more samples later? Sequential or Adaptive design Multiple testing/Gambler’s ruin Prism Graphpad Example: es/prism/6/statistics/index.ht m?stat_why_you_shouldnt_rec ompute_p_v.htm

+ A stochastic process Error Statistics Blog Discussion Not an “Argument from intentions” but really a “Probabative capacity of the test”.

+ Power analysis Uses: Pilot studies! (don’t forget to control for multiple comparisons) Detectable Effect Size (when non-significant result) Consider confidence intervals instead From the American Statistician (2001)

+ Outliers measurement or model error? Reasons for concern Increases the estimated standard deviation May indicate the model (e.g. assumption of normality) is not correct May lead to model misspecification Biased parameter estimation

+ Methods Detection Visual inspection Grubbs Test (assumes Normality) Chauvenet’s criterion (assumes Normality) Dixon’s Q test (assumes Normality) Based on interquartile range measure 2 standard deviations

+ Methods Analysis Delete the outlier Trimmed Mean/Winsorized Mean Weighted regression techniques Do nothing Report with & without outliers Arguments for keeping Methods for identification does not make the practice of deleting scientifically or methodologically sound Minimal effect on estimates/model (low influence)

+ Influential Points Outlier Leverage Influence

+ Outliers: Decide whether or not deleting data points is warranted: Do not delete data points just because they do not fit your preconceived model. You must have a good, objective reason for deleting data points. Implausible; inaccuracy in measurement; from a different population If you delete any data after you've collected it, justify and describe it in your reports. If you are not sure what to do about a data point, analyze the data twice — once with and once without the data point — and report the results of both analyses.