11. Multivariate Analysis CSCI N207 Data Analysis Using Spreadsheet Lingma Acheson Department of Computer and Information Science, IUPUI.

Slides:



Advertisements
Similar presentations
SCATTERPLOT AND PERSONS PRODUCT- MOMENT CORRELATION COEFFICIENT. EXERCISE 2E AND 2F.
Advertisements

Rubric Unit Plan Univariate Bivariate Examples Resources Curriculum Statistically Thinking A study of univariate and bivariate statistics.
Covariance and Correlation
Chapter 16: Correlation.
When Simultaneous observations on hydrological variables are available then one may be interested in the linear association between the variables. This.
Linear regression and correlation
13- 1 Chapter Thirteen McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
9a. Charts CSCI N207 Data Analysis Using Spreadsheet Department of Computer and Information Science, IUPUI Lingma Acheson
Correlation CJ 526 Statistical Analysis in Criminal Justice.
9b. Column/Bar Charts CSCI N207 Data Analysis Using Spreadsheet Department of Computer and Information Science, IUPUI Lingma Acheson
Correlation Chapter 9.
CJ 526 Statistical Analysis in Criminal Justice
Correlation and Regression Analysis
Linear Regression and Correlation
S519: Evaluation of Information Systems Social Statistics Ch5: Correlation.
Perfect Negative Correlation Perfect Positive Correlation Non-Existent Correlation Imperfect Negative Correlation Imperfect Positive Correlation.
Correlation 1. Correlation - degree to which variables are associated or covary. (Changes in the value of one tends to be associated with changes in the.
Aim: How do we calculate and interpret correlation coefficients with SPSS? SPSS Assignment Due Friday 2/12/10.
Calculating and Interpreting the Correlation Coefficient ~adapted from walch education.
Correlation & Regression Math 137 Fresno State Burger.
Chapter 21 Correlation. Correlation A measure of the strength of a linear relationship Although there are at least 6 methods for measuring correlation,
Chapter 4 Two-Variables Analysis 09/19-20/2013. Outline  Issue: How to identify the linear relationship between two variables?  Relationship: Scatter.
LIS 570 Summarising and presenting data - Univariate analysis continued Bivariate analysis.
MEASURES OF RELATIONSHIP Correlations. Key Concepts Pearson Correlation  interpretation  limits  computation  graphing Factors that affect the Pearson.
1 MBF 2263 Portfolio Management & Security Analysis Lecture 2 Risk and Return.
Scatter Plots and Linear Correlation. How do you determine if something causes something else to happen? We want to see if the dependent variable (response.
Correlation1.  The variance of a variable X provides information on the variability of X.  The covariance of two variables X and Y provides information.
12b. Regression Analysis, Part 2 CSCI N207 Data Analysis Using Spreadsheet Lingma Acheson Department of Computer and Information Science,
Multivariate Analysis Trying to establish a mathematical relationship between multiple data sets. (e.g. smoking/cancer, salary/productivity, pressure/volume,
Linear Regression When looking for a linear relationship between two sets of data we can plot what is known as a scatter diagram. x y Looking at the graph.
Scatterplots October 14, Warm-Up Given the following domain and range in set notation, write the equivalent domain and range in algebraic notation.
Chapter 6 & 7 Linear Regression & Correlation
JDS Special Program: Pre-training1 Basic Statistics 01 Describing Data.
12a. Regression Analysis, Part 1 CSCI N207 Data Analysis Using Spreadsheet Lingma Acheson Department of Computer and Information Science,
10a. Univariate Analysis Part 1 CSCI N207 Data Analysis Using Spreadsheet Lingma Acheson Department of Computer and Information Science,
WELCOME TO THETOPPERSWAY.COM.
Figure 15-3 (p. 512) Examples of positive and negative relationships. (a) Beer sales are positively related to temperature. (b) Coffee sales are negatively.
Example 1: page 161 #5 Example 2: page 160 #1 Explanatory Variable - Response Variable - independent variable dependent variable.
 Graph of a set of data points  Used to evaluate the correlation between two variables.
10b. Univariate Analysis Part 2 CSCI N207 Data Analysis Using Spreadsheet Lingma Acheson Department of Computer and Information Science,
9c. Line Charts CSCI N207 Data Analysis Using Spreadsheet Department of Computer and Information Science, IUPUI Lingma Acheson
Regression MBA/510 Week 5. Objectives Describe the use of correlation in making business decisions Apply linear regression and correlation analysis. Interpret.
Numerical Measures of Variability
Correlation iNZight gives r. POSITIVE linear correlation r=1 "Perfect" 0.9
5.4 Line of Best Fit Given the following scatter plots, draw in your line of best fit and classify the type of relationship: Strong Positive Linear Strong.
Correlation. Correlation is a measure of the strength of the relation between two or more variables. Any correlation coefficient has two parts – Valence:
CHAPTER 5 CORRELATION & LINEAR REGRESSION. GOAL : Understand and interpret the terms dependent variable and independent variable. Draw a scatter diagram.
Measures of Association: Pairwise Correlation
Correlations AP Psychology. Correlations  Co-relation  It describes the relationship b/w two variables.  Example #1  How is studying related to grades?
9e. Scatter Charts CSCI N207 Data Analysis Using Spreadsheet Department of Computer and Information Science, IUPUI Lingma Acheson
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
.  Relationship between two sets of data  The word Correlation is made of Co- (meaning "together"), and Relation  Correlation is Positive when the.
CORRELATION ANALYSIS.
The coefficient of determination, r 2, is The fraction of the variation in the value of y that is explained by the regression line and the explanatory.
Chapter 15: Correlation. Correlations: Measuring and Describing Relationships A correlation is a statistical method used to measure and describe the relationship.
9d. Pie Charts CSCI N207 Data Analysis Using Spreadsheet Department of Computer and Information Science, IUPUI Lingma Acheson
LESSON 5 - STATISTICS & RESEARCH STATISTICS – USE OF MATH TO ORGANIZE, SUMMARIZE, AND INTERPRET DATA.
Chapter 2 Bivariate Data Scatterplots.   A scatterplot, which gives a visual display of the relationship between two variables.   In analysing the.
1 Press Ctrl-A ©G Dear 2009 – Not to be sold/Free to use CorrelationCoefficient Stage 6 - Year 11 General Mathematics HSC.
Simple Linear Correlation
Pearson’s Correlation Coefficient
Correlation & Regression
Covariance and Correlation
CORRELATION & LINEAR REGRESSION
Investigation 4 Students will be able to identify correlations in data and calculate and interpret standard deviation.
CORRELATION ANALYSIS.
Chapter 3D Chapter 3, part D Fall 2000.
CORRELATION AND MULTIPLE REGRESSION ANALYSIS
Correlation & Trend Lines
Correlation & Regression
Presentation transcript:

11. Multivariate Analysis CSCI N207 Data Analysis Using Spreadsheet Lingma Acheson Department of Computer and Information Science, IUPUI

Multivariate Data Analysis Univariate data analysis concerned itself with describing an entity using a single variable. Multivariate data analysis tries to establish a mathematical relationship between multiple data sets. –smoking/cancer –salary/productivity –temperature/chirps in 15 seconds MeasureTemperature (X)Chirps (Y) ………

Correlation Multivariate data analysis depends largely on correlation. Correlation is a mathematical tool used to establish a dependency between two variables. Researchers use Pearson's Correlation Coefficient to represent correlation, signified by R:

Review Variance: One measure of dispersion (deviation from the mean) of a data set. The larger the variance, the greater is the average deviation of each datum from the average value. Standard deviation: Square root of the variance. The magnitude of the number is more in line with the values in the data set. Variance = Average value of the data set Standard Deviation =

Calculating Covariance

Measuring Correlation R: Values range between -1 (perfect negative or inverse correlation) and +1 (perfect positive correlation). A positive correlation (+) reflects a situation where an increase in value of one variable accompanies an increase in the value of the second variable. An R value of +1 is called "perfect positive." A negative correlation (-) reflects a situation where an increase in value of one variable accompanies an decrease in the value of the second variable (inverse correlation). An R value of -1 is called "perfect negative."

Measuring Correlation This measurement applies only to linear systems. Excel Covariance Function: =COVAR(Range1, Range2) Excel Correlation Function: =CORREL(Range1, Range2)

Magnitude of Association Although interpretation is discipline specific, we can generally draw the following strengths for |R|, where -1 <= R <= 1 : Correlation Strength Value of |R| Weak0.0 – 0.3 Moderate0.3 – 0.6 Strong0.6 – 1.0

R is close to +1

R is close to.5

R is close to zero