Statistics Correlation

Slides:



Advertisements
Similar presentations
Warm Up… Solve each equation for y.
Advertisements

WARM UP  Using the following table and y = mx + b;  1) What is the y?  2) What is the b? xy
1 Functions and Applications
1 Chapter 7 My interest is in the future because I am going to spend the rest of my life there.— Charles F. Kettering Forecasting.
Chapter 10 Regression. Defining Regression Simple linear regression features one independent variable and one dependent variable, as in correlation the.
5-7: Scatter Plots & Lines of Best Fit. What is a scatter plot?  A graph in which two sets of data are plotted as ordered pairs  When looking at the.
Graphing in Excel X-Y Scatter Plot SCI 110 CCC Skills Training.
Regression Basics For Business Analysis If you've ever wondered how two or more things relate to each other, or if you've ever had your boss ask you to.
2.4 Using Linear Models. The Trick: Converting Word Problems into Equations Warm Up: –How many ways can a $50 bill be changed into $5 and $20 bills. Work.
The Line of Best Fit Linear Regression. Definition - A Line of Best or a trend line is a straight line on a Scatter plot that comes closest to all of.
How do scientists show the results of investigations?
Trend lines and Lines of best fit.  Statisticians gather data to determine correlations (relationships) between events.  Scatter plots will often show.
Prior Knowledge Linear and non linear relationships x and y coordinates Linear graphs are straight line graphs Non-linear graphs do not have a straight.
Ekstrom Math 115b Mathematics for Business Decisions, part II Trend Lines Math 115b.
Regression Lesson 11. The General Linear Model n Relationship b/n predictor & outcome variables form straight line l Correlation, regression, t-tests,
Graphical Analysis in Excel EGN 1006 – Introduction to Engineering.
Creating a Residual Plot and Investigating the Correlation Coefficient.
Correlation The apparent relation between two variables.
Correlation Coefficient -used as a measure of correlation between 2 variables -the closer observed values are to the most probable values, the more definite.
Scatter Diagrams scatter plot scatter diagram A scatter plot is a graph that may be used to represent the relationship between two variables. Also referred.
Scatter Diagram of Bivariate Measurement Data. Bivariate Measurement Data Example of Bivariate Measurement:
EXCEL DECISION MAKING TOOLS BASIC FORMULAE - REGRESSION - GOAL SEEK - SOLVER.
EXCEL GRAPHING *Basic Graphing Steps* by A.B. -NNHS.
Section 1.6 Fitting Linear Functions to Data. Consider the set of points {(3,1), (4,3), (6,6), (8,12)} Plot these points on a graph –This is called a.
Discovering Mathematics Week 9 – Unit 6 Graphs MU123 Dr. Hassan Sharafuddin.
6.7 Scatter Plots. 6.7 – Scatter Plots Goals / “I can…”  Write an equation for a trend line and use it to make predictions  Write the equation for a.
Graphing in Excel X-Y Scatter Plot SCI 110 CCC Skills Training.
EXCEL DECISION MAKING TOOLS AND CHARTS BASIC FORMULAE - REGRESSION - GOAL SEEK - SOLVER.
Introduction to regression 3C. Least-squares regression.
PreCalculus 1-7 Linear Models. Our goal is to create a scatter plot to look for a mathematical correlation to this data.
Welcome to Week 05 College Statistics
HA1-439: Functions Intro Remember, a relation is ANY set of ordered pairs like (3,2), (-2, 4), (4.5, 6) …It is any set of x’s and y’s. A FUNCTION is a.
Chapter 2 Linear regression.
Linear Regression Essentials Line Basics y = mx + b vs. Definitions
Section 12.2 Linear Regression
Correlation & Forecasting
Linear Regression.
1 Functions and Applications
Have you got your workbook with you
Practice. Practice Practice Practice Practice r = X = 20 X2 = 120 Y = 19 Y2 = 123 XY = 72 N = 4 (4) 72.
Copyright © Cengage Learning. All rights reserved.
Welcome to . Week 12 Thurs . MAT135 Statistics.
Mixed Costs Chapter 2: Managerial Accounting and Cost Concepts. In this chapter we explain how managers need to rely on different cost classifications.
“Scatter Plots” and “Lines of Fit”
SIMPLE LINEAR REGRESSION MODEL
Using Excel to Graph Data
Creating Scatterplots
The Least Squares Line Lesson 1.3.
2. Find the equation of line of regression
Creating Scatterplots
Statistics Time Series – Moving Average
Equations of Lines and Modeling
Section 3.3 Linear Regression
Unit 4 Mathematics Created by Educational Technology Network
Residuals and Residual Plots
Merve denizci nazlıgül, M.s.
M248: Analyzing data Block D UNIT D2 Regression.
Statistics Time Series
Correlation and Regression
3.1 Reading Graphs; Linear Equations in Two Variables
Using Excel to Graph Data
Y x Linear vs. Non-linear.
Graphing Linear Equations
Graphing Linear Equations
Sleeping and Happiness
7.1 Draw Scatter Plots and Best Fitting Lines
Distance – Time Graphs Time is usually the independent variable (plotted on the x-axis) Distance is usually the dependent variable (plotted on the y-axis)
9/27/ A Least-Squares Regression.
Data Frame and Hubble's Plot
Finding Correlation Coefficient & Line of Best Fit
Presentation transcript:

Statistics Correlation https://www.123rf.com/photo_6622261_statistics-and-analysis-of-data-as-background.html

Inputs and Outputs In formulas, some variables are “input” variables The thing you are calculating with the formula is the “output”

Inputs and Outputs In a factory, the inputs are the components

Inputs and Outputs The output is the final product

Inputs and Outputs The input variable is ALWAYS put on the x-axis (horizontal axis) No reason – it’s a TRADITION!

Inputs and Outputs It can sometimes be hard to figure out which is the input variable and which the output variable

Inputs and Outputs Input variables sometimes CAUSE the result

Inputs and Outputs Sometimes input variables occur before the out variables

Inputs and Outputs Frequently the input variables cannot be controlled by us (time, for example) For this reason, input variables are frequently called “independent” variables

Inputs and Outputs And, the output variables (which depend on the values of the independent input variables) are called “dependent” variables

Questions?

Correlation A relationship can be seen by graphing the independent and dependent variables in a scatter graph

Correlation A linear relationship is very common

Correlation When we calculated a correlation coefficient, we said it was a measure of the closeness to a linear relationship between the two variables

Line of Best Fit That means, we could find the formula for a line that would be the best fit for the two variables

Line of Best Fit We “fit” a line to the data

Line of Best Fit Real-world data rarely lands exactly on a straight line

Line of Best Fit But we fit the “best” line to the data

Line of Best Fit When you graph two variables on an x-y plot, you can fit a line through the data called a “trend line”

Line of Best Fit This trend line is a “line of best fit” to the data

Regression The “line of best fit” is created by minimizing the total distance of all the points to the line (deviations)

Regression The line of best fit is called the “regression” line

Regression Because it is a line, it has an equation: y = b + mx m = slope b = y-intercept

Regression The slope “m” and the correlation coefficient “r” will both have the same sign

Regression R2 tells how closely the regression line “fits” the data – “goodness of fit”

Regression As you can imagine, the calculations for correlation and the regression line are scary

Regression Hooray for Excel!

Regression Francis Galton

Questions?

Regression in Excel Does coming to class affect my grade?

What Does It Say?

Regression in Excel Edwin Hubble gathered and analyzed data from astronomical objects He used regression to show that the universe is expanding http://www.thefamouspeople.com/profiles/images/edwin-powell-hubble-1.jpg

Regression in Excel Let’s take a look!

Regression in Excel What do we do first?

Regression in Excel What do we do first? GRAPH THE DATA!

Regression in Excel

Regression in Excel Does it look like a straight line would fit the data well?

Regression in Excel Now we’re going to go to: Data Data Analysis Regression

Regression in Excel They want “y” first (I HATE this…)

Regression in Excel Let’s use “distance” for “x” and “velocity” for “y”

Regression in Excel Eeek! What’s all this????

Regression in Excel Here’s the RSQ. What is the %?

Regression in Excel For the trend line, you need:

Regression in Excel This (believe it or not) is the equation of the line of best fit!

Regression in Excel Line of best fit: y = mx + b

Regression in Excel Our equation is: Vel = 505.8409 x Dist + -48.3429

Regression in Excel Highlight and copy:

Regression in Excel Paste on the “Hubble” page

Regression in Excel Add a new column heading: Trend

We’re going to calculate our line:

Copy it down…

Oops! That doesn’t look right!

The reference cells are changing for each row We need to make those constant

Go back to the first entry Add a $ before the row numbers you want to keep constant

Now, copy it down!

Much better!

Regression in Excel Create a new graph:

Regression in Excel Make it purty! To make the trend line a line: change Marker Options to “none” change line color to “solid line”

TAH-DAH!

What Does It Say?

Regression Note: some people use the symbol “ŷ” for the trend data corresponding to the “y” observation data

Regression in Excel Summary of Regression Analysis: Data/Data Analysis/Regression Enter “y” first Copy the first two coefficients in the bottom table Trend line is: =Coeff*Data+Intercept Make a graph Change the trend dots into a line

Questions?