Class 3: Thursday, Sept. 16 Reliability and Validity of Measurements Introduction to Regression Analysis Simple Linear Regression (2.3)

Slides:



Advertisements
Similar presentations
Chap 12-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 12 Simple Regression Statistics for Business and Economics 6.
Advertisements

Regresi Linear Sederhana Pertemuan 01 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
Correlation & Regression Chapter 15. Correlation statistical technique that is used to measure and describe a relationship between two variables (X and.
Chapter 4 Part I - Introduction to Simple Linear Regression Applied Management Science for Decision Making, 2e © 2014 Pearson Learning Solutions Philip.
Chapter 12 Simple Regression
Simple Linear Regression
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
Stat 217 – Day 26 Regression, cont.. Last Time – Two quantitative variables Graphical summary  Scatterplot: direction, form (linear?), strength Numerical.
Linear Regression and Correlation Analysis
1 Simple Linear Regression Chapter Introduction In this chapter we examine the relationship among interval variables via a mathematical equation.
Class 2: Tues., Sept. 14th Correlation (2.2) Introduction to Measurement Theory: –Reliability of measurements and correlation –Example that demonstrates.
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Stat 217 – Day 25 Regression. Last Time - ANOVA When?  Comparing 2 or means (one categorical and one quantitative variable) Research question  Null.
Lecture 17 Interaction Plots Simple Linear Regression (Chapter ) Homework 4 due Friday. JMP instructions for question are actually for.
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Forecasting Outside the Range of the Explanatory Variable: Chapter
Stat 112: Lecture 9 Notes Homework 3: Due next Thursday
1 Chapter 17: Introduction to Regression. 2 Introduction to Linear Regression The Pearson correlation measures the degree to which a set of data points.
Simple Linear Regression. Introduction In Chapters 17 to 19, we examine the relationship between interval variables via a mathematical equation. The motivation.
Chapter 6 (cont.) Regression Estimation. Simple Linear Regression: review of least squares procedure 2.
1 Simple Linear Regression 1. review of least squares procedure 2. inference for least squares lines.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Introduction to Multiple Regression Statistics for Managers.
Correlation and Linear Regression
Correlation and Linear Regression
Regression and Correlation Methods Judy Zhong Ph.D.
Linear Regression and Correlation
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Statistics for the Social Sciences Psychology 340 Fall 2013 Correlation and Regression.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 12-1 Correlation and Regression.
Introduction to Linear Regression
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
Applied Quantitative Analysis and Practices LECTURE#22 By Dr. Osman Sadiq Paracha.
EQT 373 Chapter 3 Simple Linear Regression. EQT 373 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value.
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.2 Extending the Correlation and R-Squared for Multiple.
Tests and Measurements Intersession 2006.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 13 Linear Regression and Correlation.
Topic 10 - Linear Regression Least squares principle - pages 301 – – 309 Hypothesis tests/confidence intervals/prediction intervals for regression.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 19 Linear Patterns.
Section 4.2: Least-Squares Regression Goal: Fit a straight line to a set of points as a way to describe the relationship between the X and Y variables.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 13-1 Introduction to Regression Analysis Regression analysis is used.
Stat 112 Notes 9 Today: –Multicollinearity (Chapter 4.6) –Multiple regression and causal inference.
Lecture 10: Correlation and Regression Model.
Economics 173 Business Statistics Lecture 10 Fall, 2001 Professor J. Petry
© 2001 Prentice-Hall, Inc.Chap 13-1 BA 201 Lecture 18 Introduction to Simple Linear Regression (Data)Data.
1 Simple Linear Regression and Correlation Least Squares Method The Model Estimating the Coefficients EXAMPLE 1: USED CAR SALES.
Linear Regression Linear Regression. Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Purpose Understand Linear Regression. Use R functions.
Lecture 10 Introduction to Linear Regression and Correlation Analysis.
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
11-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Introduction. We want to see if there is any relationship between the results on exams and the amount of hours used for studies. Person ABCDEFGHIJ Hours/
Chapter 12 Simple Regression Statistika.  Analisis regresi adalah analisis hubungan linear antar 2 variabel random yang mempunyai hub linear,  Variabel.
5. Evaluation of measuring tools: reliability Psychometrics. 2011/12. Group A (English)
Introduction Many problems in Engineering, Management, Health Sciences and other Sciences involve exploring the relationships between two or more variables.
Chapter 13 Simple Linear Regression
The simple linear regression model and parameter estimation
Chapter 14 Introduction to Multiple Regression
Linear Regression.
Chapter 13 Multiple Regression
Chapter 11: Simple Linear Regression
Chapter 5 STATISTICS (PART 4).
The Least-Squares Regression Line
Simple Linear Regression
Stat 112 Notes 4 Today: Review of p-values for one-sided tests
Least-Squares Regression
Least-Squares Regression
Presentation transcript:

Class 3: Thursday, Sept. 16 Reliability and Validity of Measurements Introduction to Regression Analysis Simple Linear Regression (2.3)

Correlation and Reliability of Measurements Measurement theory: Branch of applied statistics that attempts to describe, categorize, evaluate and improve the quality of measurements. Measurement theory for psychological attributes such as intellectual ability or personality is called psychometrics. Reliability of a measurement: The degree of consistency with which a trait or attribute is measured. A perfectly reliable measurement will produce the same value each time assuming the trait or attribute remains constant. Validity of a measurement: The degree to which a measurement measures what it purports to measure. The reliability of a measurement is often determined by the correlation between repeated measurements of the same trait/attribute.

Reliability of Pulse Measurement Correlation =

Types of Reliability External reliability (stability): Does the measurement vary from one use to another? Internal reliability (Internal consistency): Is the measurement consistent within itself? Checking external reliability: Test-retest method. Correlation between the scores of the same people on two different administrations of test (e.g., pulse measurement). Checking internal reliability of a test: Split-half method. Randomly split a test into two halves. Compute correlation of scores on two halves.

Validity Validity of a measurement: The degree to which a measurement measures what it purports to measure. To what extent is one’s pulse a valid measure of one’s general state of health. What time did you go to bed last night? –Is it reliable as a measure of when you went to bed last night? –Is it reliable as a measure of your average bed time? –Is it valid as a measure of what time you actually went to bed? –Is it valid as a measure of time spent studying?

Evidence for Validity of a Test 1.Face validity: –Definition: Items on a test should look like they are covering the proper topics. –Example: Math test should not have history items. 2.Content validity: –Definition: Test covers range of material it is intended to cover. –Example: SAT should cover different areas of math and verbal ability.

Evidence for Validity of a Test 3.Predictive validity (criterion validity) –Definition: Test scores should be positively associated with real world outcomes that the test is supposed to predict. –Example: Correlation between SAT verbal scores and freshman grades is 0.47, correlation between SAT math scores and freshman grades is Construct validity –Definition: Test scores should be positively associated with other similar measures and should not be associated with irrelevant measures. –Example: SAT scores should be positively associated with high school GPA and other academic ability tests. SAT scores should not be associated with political attitudes.

Regression Analysis Setup: Response variable Y. Explanatory (predictor) variables Goal: (1) Understand how changes in are associated with changes in Y; (2) Predict Y based on Examples of applications: – Building an spam filter. Predict whether an e- mail is spam based on the frequency of certain words and punctuation. –The product manager in charge of a brand of children’s cereal would like to predict demand during the next year. She has available the following “predictor” variables: price of the product, number of children in target market, price of competitors’ products, effectiveness of advertising, annual sales this year and previous year

Simple Regression Analysis There is only one explanatory variable X. Example: A real estate agent would like to understand the relationship between Y=selling price of a house (dollars) and X=house size (square feet) and to predict selling price based on house size. To build a regression model, the real estate agent (who lives in Gainesville, FL) randomly samples 93 recently sold homes in Gainesville and finds out their selling price and their size (Data from Agresti and Finlay, Statistical Methods for the Social Sciences)

Regression Model There is a population of units, each of which has a (Y,X) value. Regression model: A model for the mean (expected value) of Y for the subpopulation of units with, denoted by. The model has a deterministic and a random component: The Y-value for unit i equals the mean value of Y given plus a random error that has mean zero. Application of regression model to prediction: A good prediction of Y for a unit with is. Example: A house has a size of 1000 feet. A good prediction of the house’s selling price is the mean selling price of all houses of size 1000 feet.

Simple Linear Regression Model The simple linear regression model is that the mean of Y given X is a straight line: Example:. For each additional one unit increase in X, the amount by which the mean of Y given X increases (or decreases), e.g., for each additional additional square foot of house size, mean selling price increases by 75. intercept. The mean of Y for X=0. Mean selling price of an empty lot is $25,000.

House size House Cost Most lots sell for $25,000 Building a house costs about $75 per square foot. House cost = (Size) Simple Linear Regression Model The model has a deterministic and a probabilistic components

House cost = (Size) House size House Cost Most lots sell for $25,000   However, house cost vary even among same size houses! Simple Linear Regression Model Since cost behave unpredictably, we add a random component.

Estimating the Slope and Intercept The estimates are determined by –drawing a sample from the population of interest, –calculating sample statistics. –producing a straight line that cuts into the data.           Question: What should be considered a good line? x y

The Least Squares Principle A good line is one that minimizes the sum of squared differences between the points and the line.

The Least Squares (Regression) Line 3 3     (1,2) 2 2 (2,4) (3,1.5) Sum of squared differences =(2 - 1) 2 +(4 - 2) 2 +( ) 2 + (4,3.2) ( ) 2 = 6.89 Sum of squared differences =(2 -2.5) 2 +( ) 2 +( ) 2 +( ) 2 = Let us compare two lines The second line is horizontal The smaller the sum of squared differences the better the fit of the line to the data.

Regression Analysis in JMP Use Analyze, Fit Y by X. Put response variable in Y and explanatory variable in X (make sure X is continuous by clicking on the X column, clicking Cols and Column Info and checking that the Modeling Type is Continuous). Click on fit line under red triangle next to Bivariate Fit of Y by X.

Prediction Least squares estimate of regression model: = Predicted selling price for a house that is 1000 square feet = $50,398 Predicted selling price for a house that is 0 square feet = -$25,222 !

Extrapolation Extrapolation: Use of the regression model to predict far outside the range of values of the explanatory variable X that you used to obtain the line. Such predictions are often not accurate.

Olympic Long Jump: Length of gold medal jump (Y) vs. Year (X)

Predictions from Long Jump Simple Linear Regression Model Predicted olympic gold medal winning long jumps: – 2008 (Beijing): *2008 = feet –2028: *2028 = feet –3000: *3000 = feet

Summary A good test is both reliable and valid. Correlation is useful in measuring reliability and predictive validity. Regression Analysis: Model for the mean of Y given X. Useful for predicting Y from X. Simple Linear Regression Model: Interpretation of slope and intercept, estimation of slope and intercept by least squares. Be careful about extrapolation when using a regression model to predict Y from X. Next class: Finish Chapter 2.3, start Chapter 2.4.