Economics 5550/6550 Statistical Tools 2014. Statistics and Hypotheses If you are following the health care system, you see a lot of discussion. Let's.

Slides:



Advertisements
Similar presentations
Overview of Inferential Statistics
Advertisements

Partly based on material by Sherry O’Sullivan
CHAPTER 2 Building Empirical Model. Basic Statistical Concepts Consider this situation: The tension bond strength of portland cement mortar is an important.
Effect Size and Meta-Analysis
+ Chapter 10 Section 10.4 Part 2 – Inference as Decision.
GG 313 Lecture 7 Chapter 2: Hypothesis Testing Sept. 13, 2005.
10 Hypothesis Testing. 10 Hypothesis Testing Statistical hypothesis testing The expression level of a gene in a given condition is measured several.
Statistics Are Fun! Analysis of Variance
Topic 2: Statistical Concepts and Market Returns
Hypothesis testing & Inferential Statistics
Economics 5550 Statistical Tools. Statistics and Hypotheses If you are following the health care system, you see a lot of discussion. Let's consider health.
Copyright (c) Bani Mallick1 Lecture 4 Stat 651. Copyright (c) Bani Mallick2 Topics in Lecture #4 Probability The bell-shaped (normal) curve Normal probability.
Class notes for ISE 201 San Jose State University
Data and Analysis Let’s Recall Some Numbers Early on, we wanted to look at the difference between men and women … w.r.t. cholesterol.
Experimental Evaluation
Inferences About Process Quality
“There are three types of lies: Lies, Damn Lies and Statistics” - Mark Twain.
Inferential Statistics
The t-test Inferences about Population Means when population SD is unknown.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
Chapter 5For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Suppose we wish to know whether children who grow up in homes without access to.
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
9. Statistical Inference: Confidence Intervals and T-Tests
Hypothesis Testing:.
Probability Distributions and Test of Hypothesis Ka-Lok Ng Dept. of Bioinformatics Asia University.
Hypothesis testing Draw inferences about a population based on a sample.
II.Simple Regression B. Hypothesis Testing Calculate t-ratios and confidence intervals for b 1 and b 2. Test the significance of b 1 and b 2 with: T-ratios.
Estimation and Hypothesis Testing. The Investment Decision What would you like to know? What will be the return on my investment? Not possible PDF for.
1 GE5 Lecture 6 rules of engagement no computer or no power → no lesson no SPSS → no lesson no homework done → no lesson.
Data Processing, Fundamental Data
Copyright © Cengage Learning. All rights reserved. 8 Tests of Hypotheses Based on a Single Sample.
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses.
Statistical Analysis A Quick Overview. The Scientific Method Establishing a hypothesis (idea) Collecting evidence (often in the form of numerical data)
Testing Theories: Three Reasons Why Data Might not Match the Theory Psych 437.
F OUNDATIONS OF S TATISTICAL I NFERENCE. D EFINITIONS Statistical inference is the process of reaching conclusions about characteristics of an entire.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Statistics and Quantitative Analysis Chemistry 321, Summer 2014.
Why statisticians were created Measure of dispersion FETP India.
© Nuffield Foundation 2011 Nuffield Free-Standing Mathematics Activity Rain or shine? Spreadsheet activity.
Nature of Science Science Nature of Science Scientific methods Formulation of a hypothesis Formulation of a hypothesis Survey literature/Archives.
Basic concept Measures of central tendency Measures of central tendency Measures of dispersion & variability.
Sample-Based Epidemiology Concepts Infant Mortality in the USA (1991) Infant Mortality in the USA (1991) UnmarriedMarriedTotal Deaths16,71218,78435,496.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
Experimental Design If a process is in statistical control but has poor capability it will often be necessary to reduce variability. Experimental design.
1 Univariate Inferences about a Mean Shyh-Kang Jeng Department of Electrical Engineering/ Graduate Institute of Communication/ Graduate Institute of Networking.
1 Psych 5500/6500 The t Test for a Single Group Mean (Part 1): Two-tail Tests & Confidence Intervals Fall, 2008.
Research Seminars in IT in Education (MIT6003) Quantitative Educational Research Design 2 Dr Jacky Pow.
Experimental Research Methods in Language Learning Chapter 9 Descriptive Statistics.
Basic Measurement and Statistics in Testing. Outline Central Tendency and Dispersion Standardized Scores Error and Standard Error of Measurement (Sm)
5.1 Chapter 5 Inference in the Simple Regression Model In this chapter we study how to construct confidence intervals and how to conduct hypothesis tests.
1 …continued… Part III. Performing the Research 3 Initial Research 4Research Approaches 5 Hypotheses 6 Data Collection 7 Data Analysis.
Sociology 5811: Lecture 3: Measures of Central Tendency and Dispersion Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
KNR 445 Statistics t-tests Slide 1 Introduction to Hypothesis Testing The z-test.
Sampling Design and Analysis MTH 494 Lecture-22 Ossam Chohan Assistant Professor CIIT Abbottabad.
Hypothesis Testing Introduction to Statistics Chapter 8 Feb 24-26, 2009 Classes #12-13.
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
Results: How to interpret and report statistical findings Today’s agenda: 1)A bit about statistical inference, as it is commonly described in scientific.
HL Psychology Internal Assessment
1 Research Methods in Psychology AS Descriptive Statistics.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses pt.1.
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
SECTION 1 TEST OF A SINGLE PROPORTION
PSY 626: Bayesian Statistics for Psychological Science
Chapter 5: Introduction to Statistical Inference
Unit 7 Today we will look at: Normal distributions
Inferential statistics,
PSY 626: Bayesian Statistics for Psychological Science
Inference for Who? Young adults. What? Heart rate (beats per minute).
Skills 5. Skills 5 Standard deviation What is it used for? This statistical test is used for measuring the degree of dispersion. It is another way.
Presentation transcript:

Economics 5550/6550 Statistical Tools 2014

Statistics and Hypotheses If you are following the health care system, you see a lot of discussion. Let's consider health care costs. Most simply: How fast are they rising? Are ours higher than Canada's, UK, or others; if so how much? Does managed care reduce costs; if so, how much? Are patients in managed care settings sicker, healthier; if so how much?

Most of the time, such assertions are made with certainty. Yet, they are almost certainly not certain. These involve testing hypotheses. Many of you have had some passing experience with experimental laboratory work. Most often the work is a demonstration, rather than an experiment (i.e. mix up these 2 chemicals and you'll blow up the lab). Let's see what we have in mind with empirical work. Statistics and Hypotheses

Toggle to Country Data Pct GDP Spent Over Time (EXCEL SS)Pct GDP Spent Over Time Look at the Data

What does this look like to you?

How do we formulate hypotheses? The style is to use the "null" hypothesis; then contrast it to the "alternative" hypothesis. Null means “nothing is happening.” Let's look at several countries. Are the health expenditure shares equal? H 0 : s US = s UK. H 1 : s US  s UK.

H 0 : s US = s UK. H 1 : s US  s UK. So, if s US seems to be about the same as s UK, we'll accept the null hypothesis. If it seems to be different, we'll reject the null hypothesis. This type of hypothesis is called a "simple hypothesis." Simple Hypotheses

Composite Hypotheses Often we want to consider more than simply equality. Suppose we are examining the health expenditures of rich people, E r, as opposed to poor people, E p. We could certainly compare equalities. We might also wish to test the received wisdom that the rich spend more, or: H 0 : E r = E p H 1 : E r > E p

Composite Hypotheses H 0 : E r = E p H 1 : E r > E p As opposed to a "simple hypothesis" above, this called a "composite hypothesis." Note that this is not the same as above. If somehow, we discovered that E p > E r, it would not validate this hypothesis. So what? Well, let's go back to the health care system, for example. What costs more? What is more effective? Ultimately, statistical analysis is how the linkages between smoking and cancer were discovered. Tobacco industry fought this for years!

So, how do we test hypotheses? Let’s look at the US v. the UK Is the share different? It was in That’s only one data point. Let’s look at some others. It looks like most of the time US > UK.

Measure of central tendency There is an enormous body of statistical literature that suggests that if we take the average, or arithmetic mean, of a group of people, we will have a better measure of "central tendency." Think GPA. The intuition is straightforward. While any single observation may be suspect, the premise is that a measure taken from a larger number of observations may be less suspect. What other measures are sometimes used? Median, mode. Why?

Hypothesis testing Here (Spreadsheet):Spreadsheet

Hypothesis testing Well, they look different. Hard to imagine that they would be identical. Are they different? d = 10.6 – 6.0 = 4.6. Is that little or big, compared to 0?

Testing Differences Let's see if we can do things a bit more rigorously, looking at the "spread" of the data. So:

Testing Differences We'd like to determine whether the difference d could have been generated randomly; that is, what is the possibility that in truth s US = s UK, and somehow we just managed to find 2 samples that generated these separate means? We want to compare the difference to the spread. If the difference is large (relative to the spread), it would seem to be unlikely that it was caused at random. If it is small, it might have been caused at random.

Testing Differences A useful way, and by far the most frequent way, that we calculate the "spread" is to subtract each value from the mean, square the term, add the squared terms, and divide by the number of observations.

Testing Differences If the spread is large, then this measure is large. If the spread is small, then this measure will be small. We often take the square root of this difference to get the standard error for the US. We can do the same for the UK, as well. Our EXCEL worksheet helps us out.EXCEL

Testing Differences It turns out that we can calculate a standard error of the difference which equals: sd d = (sd US 2 + sd UK 2 ) 1/2 sd d = ( ) 1/2 = Let's compare the spread, or standard error, of the difference to the difference. The difference of 4.6 is about 8.3 times as big as the standard error. Clearly, if the difference was 0, we'd accept the fact that s US = s UK.

t statistic The ratio of the difference to the standard error is called a Student’s t statistic. You used to have to look these up with tables, but now we can actually use a spreadsheet. Let’s take a look.

What I learned on this summer on vacation “Student” was William Sealy Gossett, a statistician for Guiness. Gosset applied his statistical knowledge – both in the brewery and on the farm – to the selection of the best yielding varieties of barley.

Another researcher at Guinness had previously published a paper containing trade secrets of the Guinness brewery. To prevent further disclosure of confidential information, Guinness prohibited its employees from publishing any papers regardless of the contained information. However, after pleading with the brewery and explaining that his mathematical and philosophical conclusions were of no possible practical use to competing brewers, he was allowed to publish them, but under a pseudonym ("Student"), to avoid difficulties with the rest of the staff. Thus his most noteworthy achievement is now called Student's, rather than Gosset's, t-distribution.

Key points for hypothesis testing: State hypotheses clearly Choose suitable sample Calculate appropriate measures of central tendency and dispersion Draw appropriate inferences