Download presentation
Presentation is loading. Please wait.
1
Introduction to Econometrics The Statistical Analysis of Economic (and related) Data
2
2 What do economists study?
3
3 How do we answer these questions?
4
4 Review of Probability and Statistics (SW Chapters 2, 3)
5
5 The California Test Score Data Set
6
6 Initial look at the data: (You should already know how to interpret this table) This table doesn’t tell us anything about the relationship between test scores and the STR.
7
7 Do districts with smaller classes have higher test scores? Scatterplot of test score v. student-teacher ratio What does this figure show?
8
8 How do we answer this question with data?
9
9 Compare districts with “small” (STR < 20) and “large” (STR ≥ 20) class sizes 1.Estimation of = difference between group means 2.Test the hypothesis that = 0 3.Construct a confidence interval for Class SizeAverage score ( ) Standard deviation (s Y ) n Small657.419.4238 Large650.017.9182
10
10 1. Estimation
11
11 2. Hypothesis testing
12
12 Compute the difference-of-means t-statistic:
13
13 3. Confidence interval
14
14 Review of Statistical Theory
15
15 (a) Population, random variable, and distribution
16
16 Population distribution of Y
17
(b) Characteristics (a.k.a. moments) of a population distribution 17
18
Flip coin to see how many heads result from 2 flips E(Y) = 0*(0.25) + 1*(0.50) + 2*(0.25) = 0 + 0.50 + 0.50 = 1 var(Y) = (0.25)*(0 - 1)² + (0.50)*(1 – 1)² + (0.25)*(2 – 1)² = 0.25 + 0 + 0.25 =.50 stdev(Y) = √.50 = 0.7071 18
19
19
20
20
21
21 2 random variables: joint distributions and covariance
22
Joint Probability Example: The relationship between commute time and rain Pr(X=x, Y=y) is the joint probability, where X = 0 if raining = 1 otherwise Y = 1 if commute time is short (<20 minutes) = 0 if commute time is long (>= 20 minutes) Positive or negative relationship? 22
23
Conditional Probability Conditional probability is used to determine the probability of one event given the occurrence of another related event. Conditional probabilities are written as P(X | Y). They are read as “the probability of X given Y” and are calculated as: 23
24
Joint Independence Two random variables, X and Y, are independently distributed if for all X and Y Pr(X = x,Y = y) = Pr(X = x)*Pr(Y = y) or Pr(Y = y | X = x) = Pr(Y = y) 1. Do these hold in the rain and commute example? 2. Pr (X = 1, Y=1) = ? 3. E (X | Y=1) = ? 4. Pr (X=0 | Y=0) = ? 24
25
25 The correlation coefficient is defined in terms of the covariance:
26
26 The correlation coefficient measures linear association
27
27 (c) Conditional distributions and conditional means
28
28 Conditional mean, ctd.
29
29 (d) Distribution of a sample of data drawn randomly from a population: Y 1,…, Y n
30
30 Distribution of Y 1,…, Y n under simple random sampling
31
31
32
32 (a) The sampling distribution of
33
33 The sampling distribution of, ctd.
34
34 The sampling distribution of when Y is Bernoulli (p =.78):
35
35 Things we want to know about the sampling distribution:
36
36 The mean and variance of the sampling distribution of
37
37
38
38 Mean and variance of sampling distribution of, ctd.
39
39 The sampling distribution of when n is large
40
40 The Law of Large Numbers:
41
41 The Central Limit Theorem (CLT):
42
42 Sampling distribution of when Y is Bernoulli, p = 0.78:
43
43 Same example: sampling distribution of :
44
44 Summary: The Sampling Distribution of
45
45 (b) Why Use To Estimate Y ?
46
46
47
Test statistic = t-statistic: Significance level: Specified probability of Type I error Significance level = α Critical Value: Value of test statistic for which the test just rejects the null at a given significance level Language of Hypothesis Testing 47
48
Language of Hypothesis Testing, ctd. p-value Probability of drawing a statistic (e.g. Y) at least as adverse to the null hypothesis as the value computed with your data, assuming the null hypothesis is true The smallest significance level at which you can reject the null hypothesis |Test statistic| > |critical value| → reject null hypothesis |Test statistic| < |critical value| → fail to reject null hypothesis 48
49
49 Calculating the p-value with Y known:
50
50 Estimator of the variance of Y :
51
51 What is the link between the p-value and the significance level?
52
Common Critical Values One-Tail TestTwo-Tail Test 1-ααCritical Value 1-αα/2Critical Value 0.900.101.2820.900.051.645 0.950.051.6450.950.0251.960 0.990.012.3260.990.0052.576 52
53
53
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.