Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses pt.1.

Slides:



Advertisements
Similar presentations
Introductory Mathematics & Statistics for Business
Advertisements

Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Hypothesis Testing A hypothesis is a claim or statement about a property of a population (in our case, about the mean or a proportion of the population)
Decision Errors and Power
Random variable Distribution. 200 trials where I flipped the coin 50 times and counted heads no_of_heads in a trial.
Probability & Statistical Inference Lecture 6
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.
Hypothesis Testing I.
Chapter Sampling Distributions and Hypothesis Testing.
Chapter 3 Hypothesis Testing. Curriculum Object Specified the problem based the form of hypothesis Student can arrange for hypothesis step Analyze a problem.
Experimental Evaluation
Inferences About Process Quality
Chapter 9 Hypothesis Testing.
BCOR 1020 Business Statistics
Today Concepts underlying inferential statistics
Chapter 9: Introduction to the t statistic
Nonparametric or Distribution-free Tests
Inferential Statistics
Chapter 9 Title and Outline 1 9 Tests of Hypotheses for a Single Sample 9-1 Hypothesis Testing Statistical Hypotheses Tests of Statistical.
Choosing Statistical Procedures
AM Recitation 2/10/11.
Statistics 11 Hypothesis Testing Discover the relationships that exist between events/things Accomplished by: Asking questions Getting answers In accord.
1 © Lecture note 3 Hypothesis Testing MAKE HYPOTHESIS ©
Hypothesis Testing.
Statistical inference: confidence intervals and hypothesis testing.
1 STATISTICAL HYPOTHESES AND THEIR VERIFICATION Kazimieras Pukėnas.
Section 9.1 Introduction to Statistical Tests 9.1 / 1 Hypothesis testing is used to make decisions concerning the value of a parameter.
Fundamentals of Data Analysis Lecture 7 ANOVA. Program for today F Analysis of variance; F One factor design; F Many factors design; F Latin square scheme.
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses.
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
Today’s lesson Confidence intervals for the expected value of a random variable. Determining the sample size needed to have a specified probability of.
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Making decisions about distributions: Introduction to the Null Hypothesis 47:269: Research Methods I Dr. Leonard April 14, 2010.
Chi-square Test of Independence Hypotheses Neha Jain Lecturer School of Biotechnology Devi Ahilya University, Indore.
9-1 Hypothesis Testing Statistical Hypotheses Definition Statistical hypothesis testing and confidence interval estimation of parameters are.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Learning Objectives In this chapter you will learn about the t-test and its distribution t-test for related samples t-test for independent samples hypothesis.
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
Chapter 7 Inferences Based on a Single Sample: Tests of Hypotheses.
Elementary statistics for foresters Lecture 5 Socrates/Erasmus WAU Spring semester 2005/2006.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Academic Research Academic Research Dr Kishor Bhanushali M
Hypothesis Testing An understanding of the method of hypothesis testing is essential for understanding how both the natural and social sciences advance.
3-1 MGMG 522 : Session #3 Hypothesis Testing (Ch. 5)
Ex St 801 Statistical Methods Inference about a Single Population Mean.
Chapter 10 The t Test for Two Independent Samples
© Copyright McGraw-Hill 2004
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Nine Hypothesis Testing.
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
Chapter 7: Hypothesis Testing. Learning Objectives Describe the process of hypothesis testing Correctly state hypotheses Distinguish between one-tailed.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Descriptive and Inferential Statistics
Chapter Nine Hypothesis Testing.
Logic of Hypothesis Testing
Unit 3 Hypothesis.
Sampling and Sampling Distributions
9 Tests of Hypotheses for a Single Sample CHAPTER OUTLINE
Chapter Nine Part 1 (Sections 9.1 & 9.2) Hypothesis Testing
Presentation transcript:

Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses pt.1

Program for today F Statistical hypothesis; F Parametric (standard) tests; F Nonparametric (distribution free) tests.

Statistical hypothesis n While attempting to make decisions some necessary assumptions or guesses about the populations or statements about the probability distribution of the populations made are called statistical hypothesis. n These assumptions are to be proved or disproved n A predictive statement usually put in the form of a null hypothesis and alternate hypothesis n Capable of being tested by scientific methods, that relates an independent variable to some dependent variable

Testing of statistical hypotheses n Researcher bets in advance of his experiment that the results will agree with his theory and cannot be accounted for by the chance variation involved in sampling n Procedures which enable researcher to decide whether to accept or reject hypothesis or whether observed samples differ significantly from expected results.

Testing of statistical hypotheses n Plan & conduct experiment so that if the results are not explained by the chance variation, theory is confirmed n Collect data n Set null hypotheses i.e. assume that results are due to chance alone n Use a theoretical sampling distribution n Obtain probability of sample data as if it is chance variation n If probability at 5 is less than some predetermined small percentage (say 1% or 5%) reject the null hypothesis and accept the alternate hypothesis

Procedure for hypothesis testing n Plan & conduct experiment so that if the results are not explained by the chance variation, theory is confirmed n Collect data n Set null hypotheses i.e. assume that results are due to chance alone n Use a theoretical sampling distribution n Obtain probability of sample data as if it is chance variation n If probability at 5 is less than some predetermined small percentage (say 0.1%, 1% or 5%) reject the null hypothesis and accept the alternate hypothesis

Type I and type II errors  The first kind of error that is possible involves the rejection of a null hypothesis that is actually true. This kind of error is called a type I error, and is sometimes called an error of the first kind.  The other kind of error that is possible occurs when we do not reject a null hypothesis that is false. This sort of error is called a type II error, and is also referred to as an error of the second kind.

Type I and type II errors  Error is determined in advance as level of significance for a given sample size  If we try to reduce type I error, the probability of committing type II error increases  Both type errors cannot be reduced simultaneously  Decision maker has to strike a balance / trade off examining the costs & penalties of both type errors

Null (H 0 ) & Alternative (H a ) hypotheses H 0 - while computing two methods assuming that both are equally good H a - a set of alternative to H 0 or rejecting the H 0 (what one wishes to prove)

The level of significance  Some percentage (usually 5%) chosen with great care, thought & reason so that how will be rejected when the sampling result (observed evidence) has a probability of < 0.05 of occurring if H 0 is true  Researcher is willing to take as much as a 5% risk of rejecting H 0  Significance level is the maximum value of the probability of rejecting H 0 when it is true  It is usually determined in advance, I.e., the probability of type I error (α) is assigned in advance and hence nothing can be done about it

Parametric tests Parametric tests allow you to make a number of requests for various statistical parameters. Examination of phenomena by calculating the parameters is a very effective way to learn, this is due to a concise and accurate form of the description. Parametric tests, despite its diversity, do not give answers to all the important questions, mainly because these tests can be applied if the tested quantity (the population) has normal distribution or very close to it. In addition, parametric tests, as the name suggests, describe a property of the phenomenon under study (test results), without giving sufficient grounds to formulate general conclusions.

Student's t test  It is based on t-distribution and only incase of small samples  Used for testing difference between means of two samples, coefficient of simple & partial correlations, etc  Using this test, we can test the null hypothesis as: H 0 :  =  0 while the alternative hypothesis is as follows : H 1 :    0

Student's t test In fact very few know the mean value and standard deviation of the general population, so we must be satisfied with estimate value using most frequently applied estimators - the average of the sample : and standard deviation inside the sample calculate with the aid of equation:

Student's t test We must calculate the statistics: which has the Student’s t distribution with n - 1 degrees of freedom (n – number of samples), provided that the population is normal or very close to it.

Student's t test So, if you want to check the null hypothesis of equality of the mean value for the sample with the average for the population, we use the Student's t- distribution tables and for ithe assumed level of confidence and read the critical value t , such that : Now compare this value t with the critical value t  and if :  |t|  t  then reject the null hypothesis ;  |t| < t  then there is no reason to reject the null hypothesis.

Student’s t test Example We know that the average light time of the bulb is   = 1059 hours. After making changes in the technology decided to see if these changes have not shortened the light time. The null hypothesis is therefore of the form H  :   =  , ie: the average burn time has not changed bulbs. For testing random sample of 10 light bulbs was taken, the results of these studies are presented in Table.

Student’s t test Example Lighting time

Student’s t test Example Read from the tables for a confidence level 0.95 critical value t a = 1.833, therefore there is no reason to reject the null hypothesis.

Student’s t test Exercise 12 farms was independently drawn in a village and the following values ​​ of crops of oats was obtained for them: 23.3, 22.1, 21.8, 19.9, 23.7, 22.3, 22.6, 21.5, 21.9, 22.8, 23.0, 22.2 At the level of significance of 5% test the hypothesis that the value of the average yield of oats in the whole village is 22,6 q/ha, alternative hypothesis is that the value of the average yield of oats is higher.

Chi-square test To perform the test one should : 1. Calculate the mean value and the standard deviationa according equation : 2. Calculate value of chi-square statistics: Test of variation of the general population

Chi-square test 3. For level of confidence a and degree of freedom k = n - 1 we must find in tables of chi-square distribution the critical value that satisfies the equation: Inequality       defines right-hand critical area. When during comparison of calculated value with the critical value we obtained that       the null hypothesis should be rejected. Otherwise, there is no reason to reject the null hypothesis.

Chi-square test 11 independent measurements were made of cast pipe diameter and following results were obtained: 50.2, 50.4, 50.6, 50.5, 49.9, 50.0, 50.3, 50.1, 50.0, 49.6, 50.6 mm At confidence level α = 95% we should test the hypothesis that the variance of the obtained diameter of the pipes is equal to 0.04 mm Example

Chi-square test Mean value: Standard deviation: The hypothetical value of the variance Example

Chi-square test Chi-square statistics is equal: Critical value for degree of freedom equal to k=n-1=11-1=10: so the critical value read from the tables is less than calculated one, therefore, the null hypothesis should be rejected. Example

Chi-square test In order to verify the accuracy of the measuring instrument 6 measurements were made of the same quantity and the following values ​​ were obtained: 1.017, 1,021, 1.015, 1.019, 1.022, The measurements are normally ditributed. At the confidence level α = 99% verify the hypothesis that the variance of the measurements is equal to Exercise

To be continued … !