Class 09 Exam1 Prep Things you should know. Exam Details 75 minutes Allowed – Any book, the course website and all files linked to it, your notes, excel,

Slides:



Advertisements
Similar presentations
Chapter 12: Testing hypotheses about single means (z and t) Example: Suppose you have the hypothesis that UW undergrads have higher than the average IQ.
Advertisements

Block 4 Seats.
The Behavior of Sums and Averages Amore Frozen Foods
Case: The Roulette Wheel Goodness of Fit Tests
Class 05. Using the Normal Intro to Descriptive Statistics
Thursday, September 12, 2013 Effect Size, Power, and Exam Review.
1 Test a hypothesis about a mean Formulate hypothesis about mean, e.g., mean starting income for graduates from WSU is $25,000. Get random sample, say.
Normal Distribution The Normal Distribution is a density curve based on the following formula. It’s completely defined by two parameters: mean; and standard.
Statistics Are Fun! Analysis of Variance
LARGE SAMPLE TESTS ON PROPORTIONS
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 9 Hypothesis Testing: Single.
Lecture Slides Elementary Statistics Twelfth Edition
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics.
Chapter 8 Introduction to Hypothesis Testing
Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.
Discrete and Continuous Probability Distributions.
Class 04. Wunderdog and the Normal Distribution
Chi-Square Tests and the F-Distribution
1 Chapter 20 Two Categorical Variables: The Chi-Square Test.
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
Presentation 12 Chi-Square test.
How Can We Test whether Categorical Variables are Independent?
CHAPTER 6 Random Variables
Problem A newly married couple plans to have four children and would like to have three girls and a boy. What are the chances (probability) their desire.
8 - 1 © 2003 Pearson Prentice Hall Chi-Square (  2 ) Test of Variance.
Fundamentals of Hypothesis Testing: One-Sample Tests
Statistical Analysis Statistical Analysis
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 10.2.
More About Significance Tests
Week 8 Fundamentals of Hypothesis Testing: One-Sample Tests
Go to Index Analysis of Means Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Statistical Inferences Based on Two Samples Chapter 9.
Chapter 19 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 A dichotomous outcome is one that has only two possibilities (e.g., pass or fail;
1 Statistical Inference. 2 The larger the sample size (n) the more confident you can be that your sample mean is a good representation of the population.
Introduction to Hypothesis Testing: One Population Value Chapter 8 Handout.
● Final exam Wednesday, 6/10, 11:30-2:30. ● Bring your own blue books ● Closed book. Calculators and 2-page cheat sheet allowed. No cell phone/computer.
Chapter 21: More About Tests “The wise man proportions his belief to the evidence.” -David Hume 1748.
Two Way Tables and the Chi-Square Test ● Here we study relationships between two categorical variables. – The data can be displayed in a two way table.
Chapter 20 Testing hypotheses about proportions
Bernoulli Trials Two Possible Outcomes –Success, with probability p –Failure, with probability q = 1  p Trials are independent.
BINOMIALDISTRIBUTION AND ITS APPLICATION. Binomial Distribution  The binomial probability density function –f(x) = n C x p x q n-x for x=0,1,2,3…,n for.
Chapter 11: Inference for Distributions of Categorical Data Section 11.1 Chi-Square Goodness-of-Fit Tests.
Chapter 8 Introduction to Hypothesis Testing ©. Chapter 8 - Chapter Outcomes After studying the material in this chapter, you should be able to: 4 Formulate.
Section 9-1: Inference for Slope and Correlation Section 9-3: Confidence and Prediction Intervals Visit the Maths Study Centre.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Section 8-2 Basics of Hypothesis Testing.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Chap 8-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 8 Introduction to Hypothesis.
BPS - 5th Ed. Chapter 221 Two Categorical Variables: The Chi-Square Test.
Copyright © 2010 Pearson Education, Inc. Slide
Inferential Statistics. Coin Flip How many heads in a row would it take to convince you the coin is unfair? 1? 10?
Week121 Robustness of the two-sample procedures The two sample t-procedures are more robust against nonnormality than one-sample t-procedures. When the.
Slide 21-1 Copyright © 2004 Pearson Education, Inc.
AP Statistics Section 11.1 B More on Significance Tests.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 6 Random Variables 6.3 Binomial and Geometric.
Dan Piett STAT West Virginia University Lecture 12.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
© Copyright McGraw-Hill 2004
URBDP 591 I Lecture 4: Research Question Objectives How do we define a research question? What is a testable hypothesis? How do we test an hypothesis?
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
1 Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Section 10.2: Tests of Significance Hypothesis Testing Null and Alternative Hypothesis P-value Statistically Significant.
BPS - 5th Ed. Chapter 221 Two Categorical Variables: The Chi-Square Test.
1 Outline 1.Count data 2.Properties of the multinomial experiment 3.Testing the null hypothesis 4.Examples.
Objectives (BPS chapter 12) General rules of probability 1. Independence : Two events A and B are independent if the probability that one event occurs.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
The Chi-Square Distribution  Chi-square tests for ….. goodness of fit, and independence 1.
Chap 5-1 Discrete and Continuous Probability Distributions.
Chapter 7 (Probability)
Presentation transcript:

Class 09 Exam1 Prep Things you should know

Exam Details 75 minutes Allowed – Any book, the course website and all files linked to it, your notes, excel, word, calculators. Not Allowed – Communication with others (of any kind using any device) – Websites other than the course. Short answers, lots of partial credit. Some easy. Some difficult.

Class 01 Jefferson Had Girls We used BINOMDIST(false) to find the probabilities of 0,1,2,3,4 girls in four children IF the coin flip model is true. We constructed a table of Observed vs Expected Counts for the 31,595 Danish Families. You should be able to finish the hypothesis test. You should be able to do the hypothesis test for a p other than 0.5.

Class 02 Prob, Pmfs, Binomial The first probability problem – What if the situation were slightly different? How probability works for two either/or events. – Athlete and Gender, Disease and Test. – Either build a 2x2 table of counts…or draw a prob tree…or use formulas. – What if one of the events had three outcomes? Positive, Negative, Inconclusive Strain A, Strain B, no disease University Athlete, Intramural Athlete, Non-Athlete

Class 02 Prob, Pmfs, Binomial BINOMDIST(X,n,p,false) – Applying the rules of probability to calculate the probability of X successes in n independent trials. – This is a PMF – Characteristics of Pmfs A schedule assigning the unit of prob to the possible values of X. Mean, Mode, Median, Std Deviation, Variance. – Binomial Mean is n*p – Binomial Standard Deviation is [n*p*(1-p)]^.5 – Most of our problems had p=.5. What if p was different?

Class 03 Lady Tasting Tea Hypothesis Testing – Formulate H0 and Ha – Pick alpha (usually 0.05) – Identify and calculate the test statistics So far it has either been number correct (proportion correct) or calculated chi-squared. – Calculate the p-value…the prob of observing a test statistic more extreme than the one observed if H0 is true. So far we’ve used the binomial, the normal, the chi-squared to calculate p-values. – Reject H0 (in favor of Ha) if p-value is less than Say the result is statistically significant. (Our result is rare if H0 is true).

Class 04 Wunderdog and Normal As n gets big, the binomial looks like the normal. NORMDIST(X,μ,σ,TRUE) – As an approximation to the binomial set μ=n*p and σ=[n*p*(1-p)]^.5 – Normal is a PDF – Normal is a family of distributions….but all have identical properties…see next slide.

EMBS Fig 6.4, p 249 You can use Normdist(X,0,1,true) Norminv(p,0,1) For more detail

Normal continued NORMDIST(X,μ,σ,false) – The height of the normal density curve. We have never used it…and never will. NORMINV(p,μ,σ) – finds the x value such that P(X<x) = p Lots of variations of questions you can use the normal to answer. – What is p? – What X gives a specified p? – What is σ? Lots of decisions to make (what target lorex should use)

Class 06 Descriptive Statistics Each of the characteristics of a prob distribution (mean, median, mode, var, stdev, skew) has a corresponding summary descriptive statistic Know how to calculate summary descriptive statistics and what they tell you. – They are only useful for NUMERICAL (not categorical) scaled variables. EXCEPTION: If there are only two categories (Male/Female), (Right/Wrong), (Athlete/Not) then you can change to a 1/0 number and descriptive statistics make sense. Descriptive statistics make some sense for birth months…especially in the soccer context where birth month number predicts tournament participation.

Class 07 Roulette and GOF Observations of a categorical variable with multiple categories (n=904) are summarized using a table of counts. – Use Countif or pivottable to calculate the counts. If we have an H0 that leads to expected counts, then we can perform a chi-squared GOF test. – H0: All categories are equally probable (roulette and soccer birth month) Know how to perform and interpret a GOF test.

Class 07 Lorex GOF Start with n=144 observations of a numerical variable. Create (somewhat arbitrary) BINS, and a table of summary counts. H0: the data came from N(10.2,.16) – Under H0, we can calculate the expected counts in each bin. This is another use of the chi-squared GOF test. – Numerical data, bins and observed counts, H0 and expected counts, GOF test.

Class 08 2-tailed tests and sample proportions P-value is always the probability of observing a test statistic AS EXTREME as ours under H0. – The interpretation of “extreme” depends on Ha. Ha: p>.5, extreme means X ≥ 8. Ha: p≠.5, extreme means X ≥ 8 or X ≤ 2 – The calculation of the p-value depends on Ha. The p-value for a 2-tailed test will usually be twice that from a one-tailed test. – It is more difficult to find statistical significance when you are less sure what you are looking for.

There is often another way… X is Binomial(n,p) X is N(n*p,[n*p*(1-p)]^.5) = X/n is N(p,[p*(1-p)/n]^.5) – Going from binomial to normal is an approximation good when n is big – The last two ARE EXACLTY THE SAME.

Suppose n=100, p=.5 X is N(50,5) X is N(0.5,0.05) [n*p*(1-p)]^.5

When interpreting Statistics…. It’s all about the n! – 60% correct is unimpressive if n=10 and very impressive if n=100. – Even with the chi-squared GOF test, n matters Distance = (15-10)^2/10 = 2.5 Distance = ( )^2/100 = 25 n is the total count (907 for roulette, 288 for soccer, 144 for Lorex)

ExampleH0HaTest Statistic Distribution of test statistic given H0 p-valuepvalueconclusion LTTp=0.5p>0.5X=number correctBinomial=1-BINOMDIST(7,10,0.5,TRUE)0.055fail to reject Wunderdogp=0.5p>0.5X=number correctBinomial=1-BINOMDIST(86,149,0.5,TRUE)0.024reject Wunderdogp=0.5p>0.5X=number correctNormal=1-NORMDIST(87,74.5,6.10,TRUE)0.020reject Wunderdogp=0.5p>0.5 pbar = sample proportion correct Normal=1-NORMDIST(0.586,0.5,0.041,TRUE)0.020reject Buttered toast p=0.5p ≠.5X=number butter upBinomial =1-binomdist(28,48,.5,true) + binomdist(19,48,.5,true) 0.097fail to reject Roulette Wheel All 38 p's = 1/38 they do not calculated chi-squared chi-squared with 37 dof =CHIDIST(31.2,37)0.737fail to reject Lorex Data came from N(10.2,.16) they did not calculated chi-squared chi-squared with 7 dof =CHIDIST(7.577,7)0.371fail to reject [149*.5*.5]^.5 [.5*.5/149]^.5 Hypothesis Tests we have come to love…….

Pfeifer’s Pfoibles How many will she get correct? Do I have the disease? – Give me probabilities as answers even though I did not explicitly ask for them. H0 is a statement about the process, not the outcomes. If p-value < 0.05 say …. – The result is statistically significant – We reject the null hypothesis. If p-value > 0.05 say … – The result is NOT statistically significant. – We fail to (or cannot) reject H0.