The future is a vain hope, the past is a distracting thought

Slides:



Advertisements
Similar presentations
Statistics Review – Part II Topics: – Hypothesis Testing – Paired Tests – Tests of variability 1.
Advertisements

Happiness comes not from material wealth but less desire. 1.
Chapter 16 Introduction to Nonparametric Statistics
CHAPTER 2 Building Empirical Model. Basic Statistical Concepts Consider this situation: The tension bond strength of portland cement mortar is an important.
Confidence Interval and Hypothesis Testing for:
MARE 250 Dr. Jason Turner Hypothesis Testing II To ASSUME is to make an… Four assumptions for t-test hypothesis testing: 1. Random Samples 2. Independent.
MARE 250 Dr. Jason Turner Hypothesis Testing II. To ASSUME is to make an… Four assumptions for t-test hypothesis testing:
Topic 2: Statistical Concepts and Market Returns
Chap 9-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 9 Estimation: Additional Topics Statistics for Business and Economics.
Inference about a Mean Part II
Independent Sample T-test Often used with experimental designs N subjects are randomly assigned to two groups (Control * Treatment). After treatment, the.
Chapter 2 Simple Comparative Experiments
Inferences About Process Quality
15-1 Introduction Most of the hypothesis-testing and confidence interval procedures discussed in previous chapters are based on the assumption that.
Independent Sample T-test Classical design used in psychology/medicine N subjects are randomly assigned to two groups (Control * Treatment). After treatment,
5-3 Inference on the Means of Two Populations, Variances Unknown
1/49 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 9 Estimation: Additional Topics.
Hypothesis Testing and T-Tests. Hypothesis Tests Related to Differences Copyright © 2009 Pearson Education, Inc. Chapter Tests of Differences One.
Chapter 9 Title and Outline 1 9 Tests of Hypotheses for a Single Sample 9-1 Hypothesis Testing Statistical Hypotheses Tests of Statistical.
AM Recitation 2/10/11.
Hypothesis Testing.
1 The future is a vain hope, the past is a distracting thought. Uphold our loving kindness at this instant, and be committed to our duties and responsibilities.
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Education 793 Class Notes T-tests 29 October 2003.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 21/09/2015 7:46 PM 1 Two-sample comparisons Underlying principles.
Introduction to Hypothesis Testing: One Population Value Chapter 8 Handout.
Making decisions about distributions: Introduction to the Null Hypothesis 47:269: Research Methods I Dr. Leonard April 14, 2010.
Learning Objectives In this chapter you will learn about the t-test and its distribution t-test for related samples t-test for independent samples hypothesis.
9 Mar 2007 EMBnet Course – Introduction to Statistics for Biologists Nonparametric tests, Bootstrapping
B AD 6243: Applied Univariate Statistics Hypothesis Testing and the T-test Professor Laku Chidambaram Price College of Business University of Oklahoma.
1 Treat everyone with sincerity, they will certainly appear loveable and friendly.
Ordinally Scale Variables
Confidence intervals and hypothesis testing Petter Mostad
Nonparametric Statistics
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
1 Analysis of Variance & One Factor Designs Y= DEPENDENT VARIABLE (“yield”) (“response variable”) (“quality indicator”) X = INDEPENDENT VARIABLE (A possibly.
Ex St 801 Statistical Methods Inference about a Single Population Mean.
CD-ROM Chap 16-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition CD-ROM Chapter 16 Introduction.
© Copyright McGraw-Hill 2004
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Chapter 1 Introduction to Statistics. Section 1.1 Fundamental Statistical Concepts.
1 Testing Statistical Hypothesis The One Sample t-Test Heibatollah Baghi, and Mastee Badii.
1 Underlying population distribution is continuous. No other assumptions. Data need not be quantitative, but may be categorical or rank data. Very quick.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 7 l Hypothesis Tests 7.1 Developing Null and Alternative Hypotheses 7.2 Type I & Type.
Hypothesis Tests l Chapter 7 l 7.1 Developing Null and Alternative
Lecture Nine - Twelve Tests of Significance.
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
This Week Review of estimation and hypothesis testing
Chapter 2 Simple Comparative Experiments
Data Analysis and Interpretation
When we free ourselves of desire,
Inferences on Two Samples Summary
9 Tests of Hypotheses for a Single Sample CHAPTER OUTLINE
Statistical inference
Happiness comes not from material wealth but less desire.
CHAPTER 10 Comparing Two Populations or Groups
What are their purposes? What kinds?
CHAPTER 10 Comparing Two Populations or Groups
Treat everyone with sincerity,
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
Comparing Two Proportions
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
Chapter 9 Estimation: Additional Topics
CHAPTER 10 Comparing Two Populations or Groups
Statistical inference
Presentation transcript:

The future is a vain hope, the past is a distracting thought The future is a vain hope, the past is a distracting thought. Uphold our loving kindness at this instant, and be committed to our duties and responsibilities right now.

Applied Statistics Using SAS and SPSS Topic: Hypothesis Testing By Prof Kelly Fan, Cal State Univ, East Bay

Hypothesis Testing A statistical hypothesis is an assertion or conjecture concerning one or more populations. Agenda: Types of tests Types of errors P-value Summary of tests Assumption checking

Types of Tests

Types of Tests

Types of Tests

Type II Error, or “ Error” Type I Error, or “ Error” Types of Errors H0 true H0 false Type II Error, or “ Error” Good! (Correct!) we accept H0 Type I Error, or “ Error” Good! (Correct) we reject H0

= Probability of Type I error = P(rej. H0|H0 true)  = Probability of Type II error = P(acc. H0|H0 false) We often preset , called significance level. The value of  depends on the specifics of the H1 (and most often in the real world, we don’t know these specifics).

Suppose the Critical Value = 141: EXAMPLE: H0 :  < 100 H1 :  >100 Suppose the Critical Value = 141: X  =100 C=141

These are values corresp.to a value of 25 for the Std. Dev. of X  = P (X < 141/= 150) = .3594  = 150 What is ? 141  = 150 These are values corresp.to a value of 25 for the Std. Dev. of X  = P (X < 141/= 160) = .2236  = 160 141  = 160  = 170  = P (X < 141/= 170) = .1230  = 170 141  = P (X < 141/= 180)  = 180 = .0594  = P (X < 141|H0 false) 141  = 180

Note:. Had  been preset at. 025 (instead of Note: Had  been preset at .025 (instead of .05), C would have been 149 (and  would be larger); had  been preset at .10, C would have been 132 and  would be smaller.  and  “trade off”.

P Value Definition: the probability that we reject Ho when Ho is true based on the observed data Idea: the largest “risk” we pay to reject H0 Alternate name: the observed type I error rate / the observed significance level When will we reject Ho ? What is the formula to calculate the largest risk?

Steps of Hypothesis Tests Set up Ho and Ha properly Preset a level (the significant level) Select an appropriate test Calculate its p-value Reject Ho if p-value < or = the significant level; otherwise fail to reject Ho

Set Up Hypothesis Properly Conjecture: The fraction of defective product in a certain process is at most 10%. Which error is more seriously? Incorrectly claim this conjecture is true? false? The “=“ sign must be in Ho

One Population

Two Populations

Assumption Checking Tests/graphs for normality Tests for equal variances

Example: Mortar Strength The tension bond strength of cement mortar is an important characteristic of the product. An engineer is interested in comparing the strength of a modified formulation in which polymer latex emulsions have been added during mixing to the strength of the unmodified mortar. The experimenter has collected 10 observations on strength for the modified formulation and another 10 observations for the unmodified formulation.

Example: Mortar Strength Modified Unmodified 16.85 17.50 16.40 17.63 17.21 18.25 16.35 18.00 16.52 17.86 17.04 17.75 16.96 18.22 17.15 17.90 16.59 17.96 16.57 18.15

SAS/SPSS Data Input SPSS: One variable one column in the work sheet SAS: One variable one name

Normality Tests/Plots SAS: PROC UNIVARIATE DATA=** NORMAL PLOT; Tests for Normality Test --Statistic--- -----p Value------ Shapiro-Wilk W 0.918255 Pr < W 0.0917 Kolmogorov-Smirnov D 0.134926 Pr > D >0.1500 Cramer-von Mises W-Sq 0.081542 Pr > W-Sq 0.1936 Anderson-Darling A-Sq 0.537514 Pr > A-Sq 0.1503

Normality Tests/Plots SPSS: Analyze >> Descriptive Statistics >> Explore >> Plots , Normality plots with tests

One-sample t tests SAS: PROC UNIVARIATE DATA=** NORMAL PLOT MU0=*; Tests for Location: Mu0=18 Test Statistic p Value Student's t t -4.48366 Pr > |t| 0.0003 Sign M -6.5 Pr >= |M| 0.0044 Signed Rank S -79.5 Pr >= |S| 0.0005 SPSS: Analyze >> Compare means >> One sample t test , choose Test variable, Test value

Two-sample t Tests and Equal-variance Tests SAS: PROC TTEST DATA=** ;

Two-sample Normality Tests/Plots SAS: PROC UNIVARIATE DATA=** NORMAL PLOT; BY group-var; VAR testing-var; SPSS: Analyze >> Descriptive Statistics >> Explore , choose Dependent , Factor lists >> Plots , Normality plots with tests SPSS output:

Two-sample t Tests and Equal-variance Tests SPSS: see below; choose test and grouping variables

Research Question A researcher claims that a new series of math courses for elementary school is more effective than the current one. Two (1st grade) classes of students are selected to perform an experiment to verify this claim. How would you conduct the experiment to avoid confounding variables as much as possible?

Paired Samples If the same set of sources are used to obtain data representing two populations, the two samples are called paired. The data might be paired: As a result of the data from certain “before” and “after” studies From matching two subjects to form “matched pairs”

Tests for Paired Samples Calculate the pair differences Proceed as in one sample case Notes: SAS: all variables must be included in data SPSS: create/calculate all variables we need

Inferences about mean when “beyond the scope” When population is nonnormal and n is small, how to do inferences about m: 1). Non-parametric tests 2). (Optional) Use Bootstrap methods to simulate the sampling distribution of t test statistic and then the simulated distribution to find an (approximate) C.I. and p-value

Non-parametric Tests Independent samples: Wilcoxon Rank Sum Test (also called Manny-Whitney test) Assumption: two distributions of the same shape Paired samples/One sample: Wilcoxon Signed-Rank Test Assumption: a symmetric distribution (of the differences for paired samples)

Introduction to Bootstrap Methods How to simulate the sampling distribution of a given statistic, say t, based on a given sample of size n: Pretend the original sample is the entire population Select a random sample of size n from the original sample (now the population) with replacement ; this is called a bootstrap sample Calculate the t value of the bootstrap sample, t* Repeat steps 2, 3 many times, 1000 or more, say B times. Use the obtained t* values to obtain an approximation to the sampling distribution

Review: Confidence Interval

One Population

Two Populations