Synthesis and Review 3/26/12 Multiple Comparisons Review of Concepts Review of Methods - Prezi Essential Synthesis 3 Professor Kari Lock Morgan Duke University.

Slides:



Advertisements
Similar presentations
STAT 101 Dr. Kari Lock Morgan
Advertisements

Introducing Hypothesis Tests
Hypothesis Testing, Synthesis
Hypothesis Testing I 2/8/12 More on bootstrapping Random chance
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Significance STAT 101 Dr. Kari Lock Morgan SECTION 4.3, 4.5 Significance level (4.3)
Testing Hypotheses About Proportions Chapter 20. Hypotheses Hypotheses are working models that we adopt temporarily. Our starting hypothesis is called.
Today’s Agenda Review of ANOVA Module 9 Review for Exam 2 Please log in with your UMID and your participation will be graded by the number of questions.
Stat 217 – Day 25 Regression. Last Time - ANOVA When?  Comparing 2 or means (one categorical and one quantitative variable) Research question  Null.
STAT 101 Dr. Kari Lock Morgan Exam 2 Review.
Connecting Simulation- Based Inference with Traditional Methods Kari Lock Morgan, Penn State Robin Lock, St. Lawrence University Patti Frazer Lock, St.
Chapter 9 Hypothesis Testing.
Multiple Regression III 4/16/12 More on categorical variables Missing data Variable Selection Stepwise Regression Confounding variables Not in book Professor.
Simple Linear Regression Least squares line Interpreting coefficients Prediction Cautions The formal model Section 2.6, 9.1, 9.2 Professor Kari Lock Morgan.
Inference for Categorical Variables 2/29/12 Single Proportion, p Distribution Intervals and tests Difference in proportions, p 1 – p 2 One proportion or.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: p-value STAT 101 Dr. Kari Lock Morgan 9/25/12 SECTION 4.2 Randomization distribution.
Confidence Intervals and Hypothesis Tests
Statistics: Unlocking the Power of Data Lock 5 Inference for Proportions STAT 250 Dr. Kari Lock Morgan Chapter 6.1, 6.2, 6.3, 6.7, 6.8, 6.9 Formulas for.
ANOVA 3/19/12 Mini Review of simulation versus formulas and theoretical distributions Analysis of Variance (ANOVA) to compare means: testing for a difference.
Hypothesis Testing III 2/15/12 Statistical significance Errors Power Significance and sample size Section 4.3 Professor Kari Lock Morgan Duke University.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Hypotheses STAT 101 Dr. Kari Lock Morgan SECTION 4.1 Statistical test Null and alternative.
Randomization Tests Dr. Kari Lock Morgan PSU /5/14.
More Randomization Distributions, Connections
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 23, Slide 1 Chapter 23 Comparing Means.
Chapter 9 Comparing More than Two Means. Review of Simulation-Based Tests  One proportion:  We created a null distribution by flipping a coin, rolling.
Essential Synthesis SECTION 4.4, 4.5, ES A, ES B
Using Lock5 Statistics: Unlocking the Power of Data
Comparing Two Proportions
Statistics: Unlocking the Power of Data Lock 5 Afternoon Session Using Lock5 Statistics: Unlocking the Power of Data Patti Frazer Lock University of Kentucky.
Regression Part II One-factor ANOVA Another dummy variable coding scheme Contrasts Multiple comparisons Interactions.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 10 Comparing Two Populations or Groups 10.2.
Topic 7 - Hypothesis tests based on a single sample Sampling distribution of the sample mean - pages Basics of hypothesis testing -
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan Multiple Regression SECTIONS 9.2, 10.1, 10.2 Multiple explanatory variables.
Confidence intervals are one of the two most common types of statistical inference. Use a confidence interval when your goal is to estimate a population.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Cautions STAT 250 Dr. Kari Lock Morgan SECTION 4.3, 4.5 Type I and II errors (4.3) Statistical.
Multiple Regression I 4/9/12 Transformations The model Individual coefficients R 2 ANOVA for regression Residual standard error Section 9.4, 9.5 Professor.
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 20 Testing Hypotheses About Proportions.
Regression Part II One-factor ANOVA Another dummy variable coding scheme Contrasts Multiple comparisons Interactions.
Inference after ANOVA, Multiple Comparisons 3/21/12 Inference after ANOVA The problem of multiple comparisons Bonferroni’s Correction Section 8.2 Professor.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Cautions STAT 250 Dr. Kari Lock Morgan SECTION 4.3, 4.5 Errors (4.3) Multiple testing.
CHAPTER 9 Testing a Claim
Introduction to Inferece BPS chapter 14 © 2010 W.H. Freeman and Company.
Introducing Inference with Bootstrapping and Randomization Kari Lock Morgan Department of Statistical Science, Duke University with.
The z test statistic & two-sided tests Section
Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall 9-1 σ σ.
Bayesian Inference, Review 4/25/12 Frequentist inference Bayesian inference Review The Bayesian Heresy (pdf)pdf Professor Kari Lock Morgan Duke University.
Statistics: Unlocking the Power of Data Lock 5 Exam 2 Review STAT 101 Dr. Kari Lock Morgan 11/13/12 Review of Chapters 5-9.
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan 12/6/12 Synthesis Big Picture Essential Synthesis Bayesian Inference (continued)
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan 11/20/12 Multiple Regression SECTIONS 9.2, 10.1, 10.2 Multiple explanatory.
Statistics: Unlocking the Power of Data Lock 5 STAT 250 Dr. Kari Lock Morgan Synthesis and Review for Exam 2.
Statistics: Unlocking the Power of Data Lock 5 Inference for Means STAT 250 Dr. Kari Lock Morgan Sections 6.4, 6.5, 6.6, 6.10, 6.11, 6.12, 6.13 t-distribution.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.2 Tests About a Population.
Synthesis and Review 2/20/12 Hypothesis Tests: the big picture Randomization distributions Connecting intervals and tests Review of major topics Open Q+A.
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
Statistics: Unlocking the Power of Data Lock 5 Section 4.1 Introducing Hypothesis Tests.
Copyright © 2009 Pearson Education, Inc t LEARNING GOAL Understand when it is appropriate to use the Student t distribution rather than the normal.
Randomization Tests PSU /2/14.
Introducing Hypothesis Tests
Testing Hypotheses About Proportions
When we free ourselves of desire,
Stat 217 – Day 28 Review Stat 217.
Introducing Hypothesis Tests
Testing Hypotheses About Proportions
CHAPTER 18: Inference about a Population Mean
Presentation transcript:

Synthesis and Review 3/26/12 Multiple Comparisons Review of Concepts Review of Methods - Prezi Essential Synthesis 3 Professor Kari Lock Morgan Duke University

Study and prepare for Exam 2 (Wednesday and Thursday) To Do

An exam absence is only excused if a short term illness form is submitted before the exam In this case, your final exam grade will be substituted Keep in mind that you will be responsible for a LOT more material on the final exam, and it is already worth 25% of your grade You can ONLY take the lab exam during your designated section. Set two alarms if needed. Exam Policies

Any cheating (either on the in-class exam or the lab exam) will result in an automatic 0, and will be treated as a serious case of academic misconduct This includes, but is not limited to, Looking at someone else’s exam or computer screen For the in-class exam, using pages of notes prepared by someone else Communicating (in any form) with anyone besides myself or your TAs during the exam Communicating (in any way) with any classmates about the lab exam, or sharing any code or materials related to the lab exam, before 4pm on Thursday, 3/29 Exam Policies

Analytic Approaches to Basketball Mike Zarren (Boston Celtics) Tuesday, 3/27, 5pm in 2231 French Family Science Michael Zarren is the Boston Celtics’ Assistant General Manager and Associate Team Counsel. Mike is widely recognized as one of the leaders in the field of advanced statistical analysis of basketball players and teams, and is an important part of the team’s strategic planning and player personnel evaluation processes. Mike is also the team’s salary cap expert and lead in- house counsel, and is responsible for the development of new technologies for team use, including the team’s statistical database and video archive/delivery system. Read more here: Talk

You have LOTS of opportunities for help! Monday, 3 – 4 pm (Prof Morgan) Monday, 4 – 6 pm (Christine) Tuesday, 3 – 6 pm (Prof Morgan) Tuesday, 6 – 8 pm (Yue) Office Hours before Exam

RStudio no longer supports importing data from a google doc  Importing from a Google Doc

Extrasensory Perception Is there such a thing as ESP? Let’s find out by conducting our own experiments!

Extrasensory Perception Get into pairs. “Randomly” choose A, B, C, or D, and write it down Try to transmit this information to your partner, without communicating the letter in any way that can be perceived by any of the five senses! Partner: guess the letter. Repeat this 10 times each, and keep track of the number of correct guesses. Once you have n = 20, come to the board and plot your sample proportion Test whether your experiment provides evidence of ESP

Extrasensory Perception Did your experiment provide evidence of extrasensory perception, using  = 0.05? (a) Yes (b) No

Test for a Proportion Which of the following ways are appropriate to test whether your sample proportion is significantly different from p = ¼? a)Randomization Test (only) b)Normal distribution (only) c)t-distribution (only) d)Either (a) or (b) e)Either (a), (b), or (c)

Randomization Distribution IF there is no such thing as ESP, then you all just created a randomization distribution.

Extrasensory Perception If there is no such thing as ESP, what percentage of experiments on ESP will get results that are significant, using  = 0.05? (a) None (b) All of them (c) 95% (d) 5%

Author: JB Landers

Author: JB Landers

Author: JB Landers

Multiple Comparisons Consider a topic that is being investigated by research teams all over the world  5% of teams are going to find something significant, even if the null is true

Multiple Comparisons Consider a research team/company doing many hypothesis tests  5% of tests are going to be significant, even if the nulls are all true

Multiple Comparisons Consider an experiment that randomizes units to treatment groups, and then looks at many response variables  5% of variables are going to be significantly different between the groups, just by random chance

Pairwise Comparisons Consider a study with many different treatment groups, and so many possible pairwise comparisons  5% of comparisons are going to be significantly different, even if no differences actually exist (This is the main reason for only testing pairwise comparisons if the overall ANOVA is found to be significant)

Publication Bias publication bias: usually, only the significant results get published The one study that turns out significant gets published, and no one knows about all the insignificant results

Jelly Beans Cause Acne!

This is a serious problem The most important thing is to simply be aware of this issue, and not to trust claims that are obviously one of many tests (unless they specifically mention an adjustment for multiple testing) Multiple Comparisons

REVIEW

Was the sample randomly selected? Possible to generalize to the population Yes Should not generalize to the population No Was the explanatory variable randomly assigned? Possible to make conclusions about causality Yes Can not make conclusions about causality No Data Collection

Confidence Interval A confidence interval for a parameter is an interval computed from sample data by a method that will capture the parameter for a specified proportion of all samples A 95% confidence interval will contain the true parameter for 95% of all samples

How unusual would it be to get results as extreme (or more extreme) than those observed, if the null hypothesis is true? If it would be very unusual, then the null hypothesis is probably not true! If it would not be very unusual, then there is not evidence against the null hypothesis Hypothesis Testing

The p-value is the probability of getting a statistic as extreme (or more extreme) as that observed, just by random chance, if the null hypothesis is true The p-value measures evidence against the null hypothesis p-value

Hypothesis Testing 1.State Hypotheses 2.Calculate a test statistic, based on your sample data 3.Create a distribution of this test statistic, as it would be observed if the null hypothesis were true 4.Use this distribution to measure how extreme your test statistic is

Distribution of the Sample Statistic 1.Sampling distribution: distribution of the statistic based on many samples from the population 2.Bootstrap Distribution: distribution of the statistic based on many samples with replacement from the original sample 3.Randomization Distribution: distribution of the statistic assuming the null hypothesis is true 4.Normal, t,  2, F: Theoretical distributions used to approximate the distribution of the statistic

Sample Size Conditions For large sample sizes, either simulation methods or theoretical methods work If sample sizes are too small, only simulation methods can be used

For confidence intervals, you find the desired percentage in the middle of the distribution, then find the corresponding value on the x-axis For p-values, you find the value of the observed statistic on the x-axis, then find the area in the tail(s) of the distribution Using Distributions

Confidence Intervals

Return to original scale with

Hypothesis Testing

General Formulas When performing inference for a single parameter (or difference in two parameters), the following formulas are used:

Standard Error The standard error is the standard deviation of the sample statistic The formula for the standard error depends on the type of statistic (which depends on the type of variable(s) being analyzed)

Multiple Categories These formulas do not work for categorical variables with more than two categories, because there are multiple parameters For one or two categorical variables with multiple categories, use  2 tests For testing for a difference in means across multiple groups, use ANOVA

Inference Methods