Hypothesis: It is an assumption of population parameter ( mean, proportion, variance) There are two types of hypothesis : 1) Simple hypothesis :A statistical.

Slides:



Advertisements
Similar presentations
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Advertisements

Anthony Greene1 Simple Hypothesis Testing Detecting Statistical Differences In The Simplest Case:  and  are both known I The Logic of Hypothesis Testing:
Hypothesis Testing Steps in Hypothesis Testing:
Hypothesis Testing Developing Null and Alternative Hypotheses Developing Null and Alternative Hypotheses Type I and Type II Errors Type I and Type II Errors.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 9 Hypothesis Testing Developing Null and Alternative Hypotheses Developing Null and.
Hypothesis Testing IV Chi Square.
Significance Testing Chapter 13 Victor Katch Kinesiology.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Section 7-3 Estimating a Population Mean:  Known Created by.
8-4 Testing a Claim About a Mean
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
Chapter 9 Hypothesis Testing.
Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.
Copyright © 1998, Triola, Elementary Statistics Addison Wesley Longman 1 Chapter 7 Hypothesis Testing 7-1 Overview 7-2 Fundamentals of Hypothesis Testing.
Hypothesis Testing with One Sample
Confidence Interval A confidence interval (or interval estimate) is a range (or an interval) of values used to estimate the true value of a population.
AM Recitation 2/10/11.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Overview of Statistical Hypothesis Testing: The z-Test
Chapter 13 – 1 Chapter 12: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Errors Testing the difference between two.
Overview Definition Hypothesis
Presented by Mohammad Adil Khan
Hypothesis Testing.
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Sections 8-1 and 8-2 Review and Preview and Basics of Hypothesis Testing.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 8-6 Testing a Claim About a Standard Deviation or Variance.
Slide 1 Copyright © 2004 Pearson Education, Inc..
Go to Index Analysis of Means Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
Copyright © 2012 by Nelson Education Limited. Chapter 7 Hypothesis Testing I: The One-Sample Case 7-1.
Business Research Methods William G. Zikmund
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Estimating a Population Mean: σ Known 7-3, pg 355.
Say not, „I have found the truth," but rather, „I have found a truth.„ Kahlin Gibran “The Prophet” Hypothesis testing.
Chapter 8 Introduction to Hypothesis Testing ©. Chapter 8 - Chapter Outcomes After studying the material in this chapter, you should be able to: 4 Formulate.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides 11 th Edition Chapter 7.
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
1 Chapter 9 Hypothesis Testing. 2 Chapter Outline  Developing Null and Alternative Hypothesis  Type I and Type II Errors  Population Mean: Known 
Section 8-5 Testing a Claim about a Mean: σ Not Known.
Interval Estimation and Hypothesis Testing Prepared by Vera Tabakova, East Carolina University.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Statistical Testing of Differences CHAPTER fifteen.
Chapter 9: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Type I and II Errors Testing the difference between two means.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7-4 Estimating a Population Mean:  Not Known.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: In a recent poll, 70% of 1501 randomly selected adults said they believed.
© Copyright McGraw-Hill 2004
Formulating the Hypothesis null hypothesis 4 The null hypothesis is a statement about the population value that will be tested. null hypothesis 4 The null.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Sec 8.5 Test for a Variance or a Standard Deviation Bluman, Chapter 81.
Chapters 6 & 7 Overview Created by Erin Hodgess, Houston, Texas.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Nine Hypothesis Testing.
1 Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.
Copyright © 1998, Triola, Elementary Statistics Addison Wesley Longman 1 Assumptions 1) Sample is large (n > 30) a) Central limit theorem applies b) Can.
Chapter 7 Estimates and Sample Sizes 7-1 Overview 7-2 Estimating a Population Proportion 7-3 Estimating a Population Mean: σ Known 7-4 Estimating a Population.
Section 8-6 Testing a Claim about a Standard Deviation or Variance.
Lecture Slides Elementary Statistics Twelfth Edition
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7.4: Estimation of a population mean   is not known  This section.
Chapter 13 Understanding research results: statistical inference.
Chapter 10 Section 5 Chi-squared Test for a Variance or Standard Deviation.
Created by Erin Hodgess, Houston, Texas Section 7-1 & 7-2 Overview and Basics of Hypothesis Testing.
Hypothesis Tests u Structure of hypothesis tests 1. choose the appropriate test »based on: data characteristics, study objectives »parametric or nonparametric.
CHAPTER 7: TESTING HYPOTHESES Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
Confidence Intervals. Point Estimate u A specific numerical value estimate of a parameter. u The best point estimate for the population mean is the sample.
Chapter 9 Hypothesis Testing Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
Chapter Hypothesis Testing with One Sample 1 of © 2012 Pearson Education, Inc. All rights reserved.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Chapter 9 Introduction to the t Statistic
Lecture Nine - Twelve Tests of Significance.
Assumptions For testing a claim about the mean of a single population
Chapter 9 Hypothesis Testing.
Presentation transcript:

Hypothesis: It is an assumption of population parameter ( mean, proportion, variance) There are two types of hypothesis : 1) Simple hypothesis :A statistical hypothesis which specifies the population completely (i.e. the form of probability distribution and all parameters are known) is called a simple hypothesis. 2) Composite hypothesis : A statistical hypothesis which does not specifies the population completely (i.e. the form of probability distribution or some parameters remain unknown) is called a composite hypothesis.

Hypothesis Testing or Test of Hypothesis or Test of Significance : Hypothesis testing is a process of making a decision on whether to accept or reject an assumption about the population parameter on the basis of sample information at a given level of significance. Null Hypothesis : It is the assumption which we wish to test and whose validity is tested for possible rejection on the basis of sample information. It is denoted by Ho Alternative Hypothesis : It is the hypothesis which differs from the null hypothesis. It is not tested It is denoted by H1 or Ha Level of significance : It is the maximum probability of making a type I error when the null hypothesis is true as an equality.

For test of hypothesis involving large samples i.e. > 30 t-test It is usually expressed as % and is denoted by symbol α( called alpha) It is used as a guide in decision making. It is used to indicate the upper limit of the size of the critical region TEST STATISTIC Test Statistic Used for Z test For test of hypothesis involving large samples i.e. > 30 t-test For test of hypothesis involving small samples i.e. < 30 And if σ is unknown χ2 For testing the discrepancy between observed frequency and expected frequency without any reference to population parameter

Critical Region or Rejection Region : Critical region is the region which corresponds to a pre-determined level of significance. The set of value of the test statistic which leads to rejection of the null hypothesis is called region of rejection or Critical region of the test. Conversely the set of values of the test statistic which leads to the acceptance of Ho is called region of acceptance. Critical Value : It is that value of statistics which separate the critical region from the acceptance region. It lies at the boundary of the regions of acceptance and rejection. Size of critical region : The probability of rejecting a true null hypothesis is called as size of critical region.

Type I and type II error: The decision to accept or reject null hypothesis Ho is made on the basis of the information supplied by the sample data. There is always a chance of committing an error Type I error: This is an error committed by the test in rejecting a true null hypothesis. The probability of committing type I error is denoted by ‘α’ , the level of significance. Type II error : This is an error committed by the test in accepting a false null hypothesis. The probability of committing type II error is denoted by ‘β’.

Statistical decision of the test Ho is true Ho is false True situation Statistical decision of the test Ho is true Ho is false Correct decision Type I error Type II error Degrees of freedom: It means the number of variables for which one has freedom to choose. Need for degree of freedom: Influence the value of discrepancy between the observed and expected values

Case Formula for d.o.f Example In case of one sample [ say n=50] Sample size (n)-1 50-1=49 In case of two sample [say n1 =50,n2=60] (n1 - 1) + (n2 -1 ) Or n1 + n2 – 2 (50-1)+(60-1) = 108 (50+60-2) = 108 P –value : It is the probability , computed using the test statistic, that measure the support ( or lack of support) provided by the sample for the null hypothesis.

Critical Region Set of all values of the test statistic that would cause a rejection of the null hypothesis Critical Region

Critical Region Set of all values of the test statistic that would cause a rejection of the null hypothesis Critical Region

Critical Region Set of all values of the test statistic that would cause a rejection of the null hypothesis Critical Regions

Critical Value Value (s) that separates the critical region from the values that would not lead to a rejection of H 0 Critical Value ( z score )

Left-tailed Test H0: µ ³ 200 H1: µ < 200 Points Left Values that Reject H0 Fail to reject H0 Values that differ significantly from 200 200

Right-tailed Test H0: µ £ 200 H1: µ > 200 Points Right Values that Fail to reject H0 Reject H0 Values that differ significantly from 200 200

a is divided equally between the two tails of the critical Two-tailed Test H0: µ = 200 H1: µ ¹ 200 a is divided equally between the two tails of the critical region Means less than or greater than Reject H0 Fail to reject H0 Reject H0 200 Values that differ significantly from 200

Lower Tail Test Upper Tail Test Two-Tail Test Hypothesis Ho : μ ≥ μo H1 : μ < μo Ho : μ ≤ μo H1 : μ > μo Ho : μ = μo H1 : μ ≠ μo Test Z Rejection Rule p-value approach Reject Ho if p-value ≤ α Rejection rule critical value approach Reject Ho if Z ≤ -Zα Reject Ho if Z ≥ Z α Reject Ho if Z ≤ -Zα or Z ≥ Z α

Student t Distribution If the distribution of a population is essentially normal, then the distribution of t = x - µ s n Student t distribution is usually referred to as the t distribution. is a Student t Distribution for all samples of size n. It is often referred to as a t distribution and is used to find critical values denoted by t/2.

t = x – µx n t-distribution (DF = 24) Assume the conjecture is true! Test Statistic: S / n Critical value = 1.71 * 8000/5 + 30000 = 32736 Fail to reject H0 Reject H0 30 K ( t = 0) 32.7 k ( t = 1.71 )

Margin of Error E for Estimate of  (With σ Not Known) Formula 7-6 where t/2 has n – 1 degrees of freedom. n s E = t/ 2 Table lists values for tα/2

Notation  = population mean = sample mean s = sample standard deviation n = number of sample values E = margin of error t/2 = critical t value separating an area of /2 in the right tail of the t distribution Smaller samples will have means that are likely to vary more. The greater variation is accounted for by the t distribution. Students usually like hearing about the history of how the Student t distribution got its name. page 350 of text See Note to Instructor in margin for other ideas.

Important Properties of the Student t Distribution 1. The Student t distribution is different for different sample sizes (see the following slide, for the cases n = 3 and n = 12). 2. The Student t distribution has the same general symmetric bell shape as the standard normal distribution but it reflects the greater variability (with wider distributions) that is expected with small samples. The Student t distribution has a mean of t = 0 (just as the standard normal distribution has a mean of z = 0). As the sample size n gets larger, the Student t distribution gets closer to the normal distribution.

Application of t-distribution To test the significance of the mean of random sample To test the significance of the difference between the mean of two independent samples To test the significance of the difference between the mean of two dependent samples of the paired observation To test the significance of an observed correlation coefficient.

Choosing the Appropriate Distribution Use the normal (z) distribution  known and normally distributed population or  known and n > 30 Use t distribution  not known and normally distributed population or  not known and n < 30 Chi Square Method Use a nonparametric method or bootstrapping Population is not normally distributed

Where O= observed frequencies ,E= Expected frequencies 𝛘2 Defined : It is pronounced as Chi-Square test is one of the simplest and most widely used non-parametric tests in statistical work. It describes the magnitude of the discrepancy between theory and observation. It is defined as 𝛘2 = Where O= observed frequencies ,E= Expected frequencies Rejection Rule : p- value approach : reject Ho if p-value ≤ α critical value approach :reject Ho if 𝛘2 ≥ 𝛘α2 where α is the level of significance with n rows and n columns provide (n-1)(m-1) degrees of freedom

Properties of the Distribution of the Chi-Square Statistic The chi-square distribution is not symmetric, unlike the normal and Student t distributions. As the number of degrees of freedom increases, the distribution becomes more symmetric. Different degrees of freedom produce different shaped chi-square distributions. Chi-Square Distribution Chi-Square Distribution for df = 10 and df = 20

Properties of the Distribution of the Chi-Square Statistic 1. The values of chi-square can be zero or positive, but they cannot be negative. 2. The chi-square distribution is different for each number of degrees of freedom, which is d.f = n – 1. As the number of degrees of freedom increases, the chi-square distribution approaches a normal distribution. In Table, each critical value of 2 corresponds to an area given in the top row of the table, and that area represents the cumulative area located to the right of the critical value.

Condition when applying chi-square test In the first place N must be reasonably large to ensure the similarity between theoretically correct distribution and our sampling distribution of 𝛘2 it is difficult to say what constitutes largeness, but as a general rule 𝛘2 test should not be used when N is less than 50, however few the cells No theoretical cell frequency should be small when the expected frequency are too small., the value of 𝛘2 will be overestimated and will result in too many restrictions of the null hypothesis The constraints on the cell frequencies if any should be linear