Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics.

Slides:



Advertisements
Similar presentations
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Test Review: Ch. 1-3 Peer Tutor Slides Instructor: Mr. Ethan W. Cooper, Lead Tutor © 2013.
Advertisements

Percentiles and the Normal Curve
Math 3680 Lecture #19 Correlation and Regression.
Correlations and scatterplots -- Optical illusion ? -- Finding the marginal distributions in the scatterplots (shoe size vs. hours of TV) Regressions --
MA-250 Probability and Statistics
Linear Regression.
Relationships Between Quantitative Variables
2-5 : Normal Distribution
Optical illusion ? Correlation ( r or R or  ) -- One-number summary of the strength of a relationship -- How to recognize -- How to compute Regressions.
5-1 bivar. Unit 5 Correlation and Regression: Examining and Modeling Relationships Between Variables Chapters Outline: Two variables Scatter Diagrams.
Statistics 303 Chapter 10 Least Squares Regression Analysis.
Introduction to Probability and Statistics Linear Regression and Correlation.
Multiple Choice Review
Data Analysis Statistics. Inferential statistics.
Correlation MEASURING ASSOCIATION Establishing a degree of association between two or more variables gets at the central objective of the scientific enterprise.
LINEAR REGRESSIONS: About lines Line as a model: Understanding the slope Predicted values Residuals How to pick a line? Least squares criterion “Point.
Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology.
Today Concepts underlying inferential statistics
Correlation and Regression. Relationships between variables Example: Suppose that you notice that the more you study for an exam, the better your score.
Chapter 11: Random Sampling and Sampling Distributions
Chapters 10 and 11: Using Regression to Predict Math 1680.
Chapter 9: Interpretative aspects of correlation and regression.
Understanding Research Results
EC339: Lecture 6 Chapter 5: Interpreting OLS Regression.
Chapters 8 and 9: Correlations Between Data Sets Math 1680.
The Role of Confidence Intervals in Research 1. A study compared the serum HDL cholesterol levels in people with low-fat diets to people with diets high.
Data Analysis 17 Data are obtained from a random sample of adult women with regard to their ages and their monthly expenditures on health products. The.
Body Parts – Lab Write-up A hiker was walking in the woods and found a 54 cm fibula bone buried in the ground Forensic investigators were called to the.
1 DATA DESCRIPTION. 2 Units l Unit: entity we are studying, subject if human being l Each unit/subject has certain parameters, e.g., a student (subject)
Confidence Intervals for the Regression Slope 12.1b Target Goal: I can perform a significance test about the slope β of a population (true) regression.
The Mean of a Discrete Probability Distribution
Descriptive Statistics Descriptive Statistics describe a set of data.
Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Stat 1510: Statistical Thinking and Concepts 1 Density Curves and Normal Distribution.
● Final exam Wednesday, 6/10, 11:30-2:30. ● Bring your own blue books ● Closed book. Calculators and 2-page cheat sheet allowed. No cell phone/computer.
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics.
ESSENTIAL QUESTION: HOW CAN WE USE MATH TO PREDICT THE FUTURE? 7.1 Fitting Data to a Line.
Investigating the Relationship between Scores
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics.
Part IV Significantly Different: Using Inferential Statistics
TYPES OF STATISTICAL METHODS USED IN PSYCHOLOGY Statistics.
By: Amani Albraikan.  Pearson r  Spearman rho  Linearity  Range restrictions  Outliers  Beware of spurious correlations….take care in interpretation.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Find out where you can find rand and randInt in your calculator. Write down the keystrokes.
Descriptive & Inferential Statistics Adopted from ;Merryellen Towey Schulz, Ph.D. College of Saint Mary EDU 496.
Correlation MEASURING ASSOCIATION Establishing a degree of association between two or more variables gets at the central objective of the scientific enterprise.
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics.
2 Kinds of Statistics: 1.Descriptive: listing and summarizing data in a practical and efficient way 2.Inferential: methods used to determine whether data.
Introduction to Regression (Dr. Monticino). Assignment Sheet Math 1680  Read Chapter 9 and 10  Assignment # 7 (Due March 2)  Chapter 9 Exercise Set.
Data Analysis.
Basic Statistics Linear Regression. X Y Simple Linear Regression.
Is their a correlation between GPA and number of hours worked? By: Excellent Student #1 Excellent Student #2 Excellent Student #3.
Chapter 6 The Normal Distribution.  The Normal Distribution  The Standard Normal Distribution  Applications of Normal Distributions  Sampling Distributions.
STATISTICS STATISTICS Numerical data. How Do We Make Sense of the Data? descriptively Researchers use statistics for two major purposes: (1) descriptively.
Learn to create and interpret scatter plots.
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics.
The Normal Distribution Chapter 2 Continuous Random Variable A continuous random variable: –Represented by a function/graph. –Area under the curve represents.
1 Copyright © 2014, 2012, 2009 Pearson Education, Inc. Chapter 5 The Standard Deviation as a Ruler and the Normal Model.
Introduction Many problems in Engineering, Management, Health Sciences and other Sciences involve exploring the relationships between two or more variables.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 3 Investigating the Relationship of Scores.
AP PSYCHOLOGY: UNIT I Introductory Psychology: Statistical Analysis The use of mathematics to organize, summarize and interpret numerical data.
 Understand how to determine a data point is influential  Understand the difference between Extrapolation and Interpolation  Understand that lurking.
Introductory Psychology: Statistical Analysis
Correlation & Forecasting
CHAPTER 7 LINEAR RELATIONSHIPS
Chapter 15 Linear Regression
Statistical significance & the Normal Curve
Lecture Slides Elementary Statistics Thirteenth Edition
CHAPTER 2 Modeling Distributions of Data
Descriptive Statistics Univariate Data
Presentation transcript:

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics for Economist Chap 6. Regression 1.The Relation of Two Variables 2.The Graph of Averages 3.The Regression Method 4.The Regression Effect 5.The Regression Line for x on y and y on x

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 2/23 INDEX 1 The Relation of Two Variables 2 The Graph of Averages 3 The Regression Method 4 The Regression Effect 5 The Regression Line for x on y and y on x

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 3/23 1. Relation of Two Variables Regression and SD line go through the point of averages Height (cm) Weight (kg) SD line regression line Associated with an increase of 1 SD in height there is an increase of only 0.67 SD in weight, on the average. Scatter Plot

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 4/23 Regression Associated with each increase of 1SD in x there is an increase of only r SDs in y, on the average.  The regression line for y on x estimates the average value for y corresponding to each value of x. r: correlation coefficient SD x r  SD y estimation average y x 1. Relation of Two Variables

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 5/23 INDEX 1 The Relation of Two Variables 2 The Graph of Averages 3 The Regression Method 4 The Regression Effect 5 The Regression Line for x on y and y on x

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 6/23 2. Graph of Averages Graph of Averages 각 점들은 각각의 키에 대하여 그 키에 해당하는 집단의 평균 몸무게를 보여준다. 점 위에 표시된 숫자는 해당집단의 크기를 나타낸다. Many points on the graph of averages are near the line. This line is regression line. regression Graph of averages

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 7/23 Improper case of regression line – nonlinear association When the graph of averages is nonlinear,the regression line does not reflect theexact association. Regression line 2. Graph of Averages

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 8/23 INDEX 1 The Relation of Two Variables 2 The Graph of Averages 3 The Regression Method 4 The Regression Effect 5 The Regression Line for x on y and y on x

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 9/23 3. Regression Method Guess the weight Without being told anything else about him, the best guess on the weight of one of these men is the overall average weight 63.5kg. If you are told the man ’ s height: 179.5cm then your best guess for his weight is the average for all the 179.5cm men in the study, 74.7kg. <Health and Nutrition Examination Survey> height(cm) weight (kg)

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 10/23 Example 1  Followings are the results of 100 students majoring in economics in a certain college. Economics 1 average = 3.00 SD X = 0.87 Statistics average = 2.80 SD Y = 0.86 r = 0.36 EX) Predict the statistics GPA of a student whose GPA of Economics I is Regression Method

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 11/23 Regression Analysis 1.0.8SDx above average on the Economics = 0.7 = 0.87   0.8 = SDy above average on the Statistics. 4. The predicted GPA of statistics is = 3.05 Example1 3. That is, 0.29  0.86 = 0.25(GPA) above average on the Statistics. Corr. coeff SD X SD y 3. Regression Method

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 12/23 notice ☞ when estimating on new subjects, Foregoing data can represent the new subjects ; reasonable!! Foregoing data and the new subjects are different in groups or in the range ; you have to think about the issue carefully before applying the regression method. 3. Regression Method

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 13/23 In the case of [example 1] … Would be the data from the students majoring in Economics applied directly to Jane whose major is Philosophy?  What if Jane majors in Philosophy not in Economics?  What if Jane ’ s GPA of Economics I is 4.3 but 3.7?  If most GPAs of the Economics 1 (x-value) are between 2.0 and 4.0, Jane ’ s x-value(4.3) is out of the previous x-range. Would the estimated regression line based on the x- value range 2.0~4.0 be applied to Jane ’ s GPA which is over the range?  [The extrapolation is not a simple problem.] 3. Regression Method

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 14/23 Regression Analysis example2- predict percentile ranks  Suppose the percentile rank of one student on the Economics 1 GPA is 90%. Predict her percentile rank on statistics. 1.This student scored 1.3 SDx above average on the Economics  1.3 = 0.47 SDy above average on the statistics. =68% z = 1.3 z = 0.47 =90% 3. The percentile rank on statistics is predicted as 68%. 3. Regression Method

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 15/23 INDEX 1 The Relation of Two Variables 2 The Graph of Averages 3 The Regression Method 4 The Regression Effect 5 The Regression Line for x on y and y on x

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 16/23 containing too many points with small x-value. Most of these points ’ y-values are above the SD line. Y-values of points are spread symmetrically along the SD line overall. Containing too many points with large x-value. Most of these points ’ y-values are below the SD line. 4. Regression Effect regression to mediocrity Regression line SD line

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 17/23 Regression effect Final average scores corresponding to each midterm scores SD line Regression line – summarizes each average points well 4. Regression Effect

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 18/23 regression fallacy  The regression effect occurs when each point representing data is spread out around the SD line.  The fallacy that the regression effect must be due to something important, not just the spread around the line ☞ Regression Fallacy 4. Regression Effect

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 19/23 Observed score= Suppose that the probability error is  5 with probability 0.5. The people scored 140 can be divided into two groups with true score 135 or 145. The model for regression effect If someone scores above average on the first test, the true score is probably a bit lower than the observed score. (observed score) = (true score) + (chance error) 4. Regression Effect

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 20/23 INDEX 1 The Relation of Two Variables 2 The Graph of Averages 3 The Regression Method 4 The Regression Effect 5 The Regression Line for x on y and y on x

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 21/23 5. Regression line for x on y and y on x Two different regression lines weight height Regression of weight on height The regression line is solid, and the SD line is dashed

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 22/23 exmaple3  The men whose IQ was 140 had wives whose IQ averaged 120. Look at the wives whose IQ was 120; should the average IQ of their husbands be 140? (IQ scores are scaled to have an average of about 100, and an SD of about 15, both for men and women. The correlation is about 0.50) Wife ’ s IQ Husbnad ’ s IQ Husband ’ s IQ ? 5. Regression line for x on y and y on x

Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics STATISTICS 23/23 example 3 Wife ’ s IQ Husband ’ s IQ Regression line for husband ’ s IQ on wife ’ s IQ Regression line for wife ’ s IQ on husband ’ s IQ There are two different regression lines; one for predicting the wife ’ s IQ from her husband ’ s IQ and the other for vice versa. 5. Regression line for x on y and y on x