Lecture 14 chi-square test, P-value Measurement error (review from lecture 13) Null hypothesis; alternative hypothesis Evidence against null hypothesis.

Slides:



Advertisements
Similar presentations
Introductory Mathematics & Statistics for Business
Advertisements

Lecture 2 ANALYSIS OF VARIANCE: AN INTRODUCTION
Inference in the Simple Regression Model
SADC Course in Statistics (Session 09)
Tests of Hypotheses Based on a Single Sample
Chapter 7 Sampling and Sampling Distributions
Tests of Significance and Measures of Association
“Students” t-test.
Chi-square and F Distributions
Statistical Inferences Based on Two Samples
Hypothesis Testing. To define a statistical Test we 1.Choose a statistic (called the test statistic) 2.Divide the range of possible values for the test.
Statistics Review – Part II Topics: – Hypothesis Testing – Paired Tests – Tests of variability 1.
1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Statistics.  Statistically significant– When the P-value falls below the alpha level, we say that the tests is “statistically significant” at the alpha.
Chapter 12 Tests of Hypotheses Means 12.1 Tests of Hypotheses 12.2 Significance of Tests 12.3 Tests concerning Means 12.4 Tests concerning Means(unknown.
1 1 Slide STATISTICS FOR BUSINESS AND ECONOMICS Seventh Edition AndersonSweeneyWilliams Slides Prepared by John Loucks © 1999 ITP/South-Western College.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 9 Hypothesis Testing Developing Null and Alternative Hypotheses Developing Null and.
Quantitative Skills 4: The Chi-Square Test
Chapter 7 Sampling and Sampling Distributions
Inferences About Process Quality
8-5 Testing a Claim About a Standard Deviation or Variance This section introduces methods for testing a claim made about a population standard deviation.
Chapter 9 Hypothesis Testing.
AM Recitation 2/10/11.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
Section 9.1 Introduction to Statistical Tests 9.1 / 1 Hypothesis testing is used to make decisions concerning the value of a parameter.
1 Tests with two+ groups We have examined tests of means for a single group, and for a difference if we have a matched sample (as in husbands and wives)
Hypothesis Testing. Steps for Hypothesis Testing Fig Draw Marketing Research Conclusion Formulate H 0 and H 1 Select Appropriate Test Choose Level.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 11 Inferences About Population Variances n Inference about a Population Variance n.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Hypothesis testing Chapter 9. Introduction to Statistical Tests.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Learning Objectives In this chapter you will learn about the t-test and its distribution t-test for related samples t-test for independent samples hypothesis.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
Lecture 13 Chi-square and sample variance Finish the discussion of chi-square distribution from lecture 12 Expected value of sum of squares equals n-1.
The binomial applied: absolute and relative risks, chi-square.
S-012 Testing statistical hypotheses The CI approach The NHST approach.
Chapter 7 Inferences Based on a Single Sample: Tests of Hypotheses.
5.1 Chapter 5 Inference in the Simple Regression Model In this chapter we study how to construct confidence intervals and how to conduct hypothesis tests.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
© Copyright McGraw-Hill 2004
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Copyright © Cengage Learning. All rights reserved. 9 Inferences Based on Two Samples.
The p-value approach to Hypothesis Testing
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 FINAL EXAMINATION STUDY MATERIAL III A ADDITIONAL READING MATERIAL – INTRO STATS 3 RD EDITION.
Hypothesis Testing. Steps for Hypothesis Testing Fig Draw Marketing Research Conclusion Formulate H 0 and H 1 Select Appropriate Test Choose Level.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Hypothesis Testing. Steps for Hypothesis Testing Fig Draw Marketing Research Conclusion Formulate H 0 and H 1 Select Appropriate Test Choose Level.
Hypothesis Testing: One-Sample Inference
Statistical Inferences for Population Variances
Inference concerning two population variances
Chapter 9 Hypothesis Testing.
Lecture Nine - Twelve Tests of Significance.
One-Sample Tests of Hypothesis
Hypothesis Testing I The One-sample Case
8-1 of 23.
P-values.
Math 4030 – 10b Inferences Concerning Variances: Hypothesis Testing
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
John Loucks St. Edward’s University . SLIDES . BY.
Hypothesis Testing: Hypotheses
Week 11 Chapter 17. Testing Hypotheses about Proportions
Introduction to Inference
Chapter 9 Hypothesis Testing.
Chapter 11 Inferences About Population Variances
Chapter 9 Hypothesis Testing.
Introduction to Inference
Statistical Process Control
Presentation transcript:

Lecture 14 chi-square test, P-value Measurement error (review from lecture 13) Null hypothesis; alternative hypothesis Evidence against null hypothesis Measuring the Strength of evidence by P-value Pre-setting significance level Conclusion Confidence interval

Some general thoughts about hypothesis testing A claim is any statement made about the truth; it could be a theory made by a scientist, or a statement from a prosecutor, a manufacture or a consumer Data cannot prove a claim however, because there May be other data that could contradict the theory Data can be used to reject the claim if there is a contradiction to what may be expected Put any claim in the null hypothesis H 0 Come up with an alternative hypothesis and put it as H 1 Study data and find a hypothesis testing statistics which is an informative summary of data that is most relevant in differentiating H 1 from H 0.

Testing statistics is obtained by experience or statistical training; it depends on the formulation of the problem and how the data are related to the hypothesis. Find the strength of evidence by P-value : from a future set of data, compute the probability that the summary testing statistics will be as large as or even greater than the one obtained from the current data. If P-value is very small, then either the null hypothesis is false or you are extremely unlucky. So statistician will argue that this is a strong evidence against null hypothesis. If P-value is smaller than a pre-specified level (called significance level, 5% for example), then null hypothesis is rejected.

Back to the microarray example H o : true SD denote 0.1 by 0 ) H 1 : true SD > 0.1 (because this is the main concern; you dont care if SD is small) Summary : Sample SD (s) = square root of ( sum of squares/ (n-1) ) = 0.18 Where sum of squares = ( ) 2 + ( ) 2 + ( ) 2 + ( ) 2 = 0.1, n=4 The ratio s/ is it too big ? The P-value consideration: Suppose a future data set (n=4) will be collected. Let s be the sample SD from this future dataset; it is random; so what is the probability that s/ will be As big as or bigger than 1.8 ? P(s/ 0 >1.8)

P(s/ 0 >1.8) But to find the probability we need to use chi- square distribution : Recall that sum of squares/ true variance follow a chi-square distribution ; Therefore, equivalently, we compute P ( future sum of squares/ 0 2 > sum of squares from the currently available data/ 0 2 ), (recall 0 is The value claimed under the null hypothesis) ;

Once again, if data were generated again, then Sum of squares/ true variance is random and follows a chi-squared distribution with n-1 degrees of freedom; where sum of squares= sum of squared distance between each data point and the sample mean Note : Sum of squares= (n-1) sample variance = (n-1)(sample SD) 2 For our case, n=4; so look at the chi-square distribution with df=3; from table we see : The value computed from available data =.10/.01=10 (note sum of squares=.1, true variance =.1 2 P-value = P(chi-square random variable> computed value from data)=P (chisquare random variable > 10.0) P-value is between.025 and.01, reject null hypothesis at 5% significance level

Confidence interval A 95% confidence interval for true variance 2 is (Sum of squares/C 2, sum of squares/C 1 ) Where C 1 and C 2 are the cutting points from chi- square table with d.f=n-1 so that P(chisquare random variable > C 1 )=.975 P(chisquare random variable>C 2 )=.025 This interval is derived from P( C 1 < sum of squares/ 2 <C 2 )=.95 For our data, sum of squares=.1 ; from d.f=3 of table, C1=.216, C2=9.348; so the confidence interval of 2 is to.4629; how about confidence interval of