Describing Bivariate Relationships 17.871 Spring 2006.

Slides:



Advertisements
Similar presentations
Rubric Unit Plan Univariate Bivariate Examples Resources Curriculum Statistically Thinking A study of univariate and bivariate statistics.
Advertisements

Things to do in Lecture 1 Outline basic concepts of causality
Hypothesis Testing Steps in Hypothesis Testing:
Regression Greg C Elvers.
Regression, Correlation. Research Theoretical empirical Usually combination of the two.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Linear regression models
Learning Objectives 1 Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Correlation & Regression Chapter 15. Correlation statistical technique that is used to measure and describe a relationship between two variables (X and.
Regression Regression: Mathematical method for determining the best equation that reproduces a data set Linear Regression: Regression method applied with.
Chapter 4 Describing the Relation Between Two Variables
Statistics for the Social Sciences
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
Topics: Regression Simple Linear Regression: one dependent variable and one independent variable Multiple Regression: one dependent variable and two or.
Regression Forced March Spring Regression quantifies how one variable can be described in terms of another.
Lecture 11 Chapter 6. Correlation and Linear Regression.
Correlation and Regression Analysis
Correlation and Regression 1. Bivariate data When measurements on two characteristics are to be studied simultaneously because of their interdependence,
Chapter 7: Correlation Bivariate distribution: a distribution that shows the relation between two variables
Chapter 21 Correlation. Correlation A measure of the strength of a linear relationship Although there are at least 6 methods for measuring correlation,
Simple Linear Regression
Linear Regression Analysis
Chapter 4 Two-Variables Analysis 09/19-20/2013. Outline  Issue: How to identify the linear relationship between two variables?  Relationship: Scatter.
Calibration & Curve Fitting
Correlation and Regression
Bivariate Relationships Testing associations (not causation!) Continuous data  Scatter plot (always use first!)  (Pearson) correlation coefficient.
Regression and Correlation Methods Judy Zhong Ph.D.
Equations in Simple Regression Analysis. The Variance.
Relationship of two variables
Correlation Scatter Plots Correlation Coefficients Significance Test.
Chapter 11 Simple Regression
MAT 254 – Probability and Statistics Sections 1,2 & Spring.
Introduction to Regression Analysis. Two Purposes Explanation –Explain (or account for) the variance in a variable (e.g., explain why children’s test.
Chapter 14 – Correlation and Simple Regression Math 22 Introductory Statistics.
Is there a relationship between the lengths of body parts ?
Learning Objective Chapter 14 Correlation and Regression Analysis CHAPTER fourteen Correlation and Regression Analysis Copyright © 2000 by John Wiley &
Prior Knowledge Linear and non linear relationships x and y coordinates Linear graphs are straight line graphs Non-linear graphs do not have a straight.
Fundamental Statistics in Applied Linguistics Research Spring 2010 Weekend MA Program on Applied English Dr. Da-Fu Huang.
Linear Regression James H. Steiger. Regression – The General Setup You have a set of data on two variables, X and Y, represented in a scatter plot. You.
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
BIOL 582 Lecture Set 11 Bivariate Data Correlation Regression.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 4 Section 2 – Slide 1 of 20 Chapter 4 Section 2 Least-Squares Regression.
Regression Regression relationship = trend + scatter
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 13-1 Introduction to Regression Analysis Regression analysis is used.
Describing Relationships Using Correlations. 2 More Statistical Notation Correlational analysis requires scores from two variables. X stands for the scores.
Scatter Diagram of Bivariate Measurement Data. Bivariate Measurement Data Example of Bivariate Measurement:
Abortion Politics in the US The relationship between public opinion and governmental policy.
Curve Fitting Pertemuan 10 Matakuliah: S0262-Analisis Numerik Tahun: 2010.
Chapter Thirteen Bivariate Correlation and Regression Chapter Thirteen.
Psychology 202a Advanced Psychological Statistics October 22, 2015.
Chapter 11: Linear Regression and Correlation Regression analysis is a statistical tool that utilizes the relation between two or more quantitative variables.
Lecture 10 Introduction to Linear Regression and Correlation Analysis.
Correlation Assumptions: You can plot a scatter graph You know what positive, negative and no correlation look like on a scatter graph.
11.1 Heteroskedasticity: Nature and Detection Aims and Learning Objectives By the end of this session students should be able to: Explain the nature.
Simple linear regression and correlation Regression analysis is the process of constructing a mathematical model or function that can be used to predict.
The “Big Picture” (from Heath 1995). Simple Linear Regression.
The simple linear regression model and parameter estimation
Correlation – Regression
Describing Bivariate Relationships
Scatterplots A way of displaying numeric data
CHAPTER fourteen Correlation and Regression Analysis
BIVARIATE REGRESSION AND CORRELATION
1) A residual: a) is the amount of variation explained by the LSRL of y on x b) is how much an observed y-value differs from a predicted y-value c) predicts.
2. Find the equation of line of regression
Discrete Least Squares Approximation
11C Line of Best Fit By Eye, 11D Linear Regression
Ch 4.1 & 4.2 Two dimensions concept
Descriptive Statistics Univariate Data
Presentation transcript:

Describing Bivariate Relationships Spring 2006

Testing associations Continuous data  Scatter plot (always use first!)  (Pearson) correlation coefficient (should be rare)  (Spearman) rank-order correlation coefficient (rare)  Regression coefficient (common) Discrete data  Cross tabulations  Differences in means, box plots  χ 2  Gamma, Beta, etc.

Continuous DV, continuous EV Example: What is the relationship between Bush’s vote (by county) in 2000 and in 2004?

2004 Prez. Vote vs Pres. Vote

Subtract each observation from its mean x’=x y’=y-0.609

Covariance formula Cov(BushPct 00,BushPct 04 ) =

Correlation formula (compare with Tufte p. 102) Corr(BushPct 00,BushPct 04 ) =0.96 =

Warning: Don’t correlate often! Correlation only measures linear relationship Correlation is sensitive to variance Correlation usually doesn’t measure a theoretically interesting quantity

Regression quantifies how one variable can be described in terms of another

The Linear Relationship between Two Variables

The Linear Relationship between African American Population & Black Legislators ^ ^

How did we get that line? 1. Pick a value of Y i YiYi

How did we get that line? 2. Decompose Y i into two parts

How did we get that line? 3. Label the points YiYi YiYi ^ εiεi Y i -Y i ^ “residual”

What is ε i ? Vagueness of theory Poor proxies (i.e., measurement error) Wrong functional form

The Method of Least Squares YiYi YiYi ^ εiεi Y i -Y i ^

Solve for (Tufte, p. 68) ^

Discrete DV, discrete EV

Example What is the relationship between abortion sentiments and vote choice? The abortion scale: 1. BY LAW, ABORTION SHOULD NEVER BE PERMITTED. 2. THE LAW SHOULD PERMIT ABORTION ONLY IN CASE OF RAPE, INCEST, OR WHEN THE WOMAN'S LIFE IS IN DANGER. 3. THE LAW SHOULD PERMIT ABORTION FOR REASONS OTHER THAN RAPE, INCEST, OR DANGER TO THE WOMAN'S LIFE, BUT ONLY AFTER THE NEED FOR THE ABORTION HAS BEEN CLEARLY ESTABLISHED. 4. BY LAW, A WOMAN SHOULD ALWAYS BE ABLE TO OBTAIN AN ABORTION AS A MATTER OF PERSONAL CHOICE.

Abortion and vote choice in 2006

Use the appropriate graph Continuous DV, continuous EV  E.g., vote share by income growth  Use scatter plot Continuous DV, discrete and unordered EV  E.g., vote share by religion or by union membership  Box plot, dot plot, Discrete DV, discrete EV