Slide 1 The introductory statement in the question indicates: The data set to use (2001WorldFactBook) The task to accomplish (association between variables)

Slides:



Advertisements
Similar presentations
Descriptive Statistics-II
Advertisements

4/4/2015Slide 1 SOLVING THE PROBLEM A one-sample t-test of a population mean requires that the variable be quantitative. A one-sample test of a population.
4/12/2015Slide 1 We have seen that skewness affects the way we describe the central tendency and variability of a quantitative variable: if a distribution.
Correlation and Linear Regression.
5/15/2015Slide 1 SOLVING THE PROBLEM The one sample t-test compares two values for the population mean of a single variable. The two-sample test of a population.
Assumption of normality
Correlation CJ 526 Statistical Analysis in Criminal Justice.
Correlation Chapter 9.
CJ 526 Statistical Analysis in Criminal Justice
Chap 3-1 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 3 Describing Data: Numerical.
Project #3 by Daiva Kuncaite Problem 31 (p. 190)
Principal component analysis
Chap 3-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 3 Describing Data: Numerical Statistics for Business and Economics.
Multiple Regression – Basic Relationships
SW388R7 Data Analysis & Computers II Slide 1 Computing Transformations Transforming variables Transformations for normality Transformations for linearity.
8/2/2015Slide 1 SPSS does not calculate confidence intervals for proportions. The Excel spreadsheet that I used to calculate the proportions can be downloaded.
Correlation Analysis 5th - 9th December 2011, Rome.
Regression Analysis We have previously studied the Pearson’s r correlation coefficient and the r2 coefficient of determination as measures of association.
8/7/2015Slide 1 Simple linear regression is an appropriate model of the relationship between two quantitative variables provided: the data satisfies the.
Problem 1: Relationship between Two Variables-1 (1)
8/9/2015Slide 1 The standard deviation statistic is challenging to present to our audiences. Statisticians often resort to the “empirical rule” to describe.
SW388R7 Data Analysis & Computers II Slide 1 Assumption of normality Transformations Assumption of normality script Practice problems.
SW388R7 Data Analysis & Computers II Slide 1 Multiple Regression – Basic Relationships Purpose of multiple regression Different types of multiple regression.
Correlation Question 1 This question asks you to use the Pearson correlation coefficient to measure the association between [educ4] and [empstat]. However,
Aim: How do we calculate and interpret correlation coefficients with SPSS? SPSS Assignment Due Friday 2/12/10.
Review Regression and Pearson’s R SPSS Demo
SW388R7 Data Analysis & Computers II Slide 1 Multiple Regression – Split Sample Validation General criteria for split sample validation Sample problems.
Assumption of linearity
8/10/2015Slide 1 The relationship between two quantitative variables is pictured with a scatterplot. The dependent variable is plotted on the vertical.
8/15/2015Slide 1 The only legitimate mathematical operation that we can use with a variable that we treat as categorical is to count the number of cases.
Simple Linear Regression
Slide 1 SOLVING THE HOMEWORK PROBLEMS Simple linear regression is an appropriate model of the relationship between two quantitative variables provided.
8/20/2015Slide 1 SOLVING THE PROBLEM The two-sample t-test compare the means for two groups on a single variable. the The paired t-test compares the means.
SW388R7 Data Analysis & Computers II Slide 1 Logistic Regression – Hierarchical Entry of Variables Sample Problem Steps in Solving Problems.
8/23/2015Slide 1 The introductory statement in the question indicates: The data set to use: GSS2000R.SAV The task to accomplish: a one-sample test of a.
LEARNING PROGRAMME Hypothesis testing Intermediate Training in Quantitative Analysis Bangkok November 2007.
9/18/2015Slide 1 The homework problems on comparing central tendency and variability extend the focus central tendency and variability to a comparison.
Hierarchical Binary Logistic Regression
Stepwise Multiple Regression
110/10/2015Slide 1 The homework problems on comparing central tendency and variability extend our focus on central tendency and variability to a comparison.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Slide 1 SOLVING THE HOMEWORK PROBLEMS Pearson's r correlation coefficient measures the strength of the linear relationship between the distributions of.
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Pearson Chi-Square Contingency Table Analysis.
By: Amani Albraikan.  Pearson r  Spearman rho  Linearity  Range restrictions  Outliers  Beware of spurious correlations….take care in interpretation.
Variation This presentation should be read by students at home to be able to solve problems.
6/2/2016Slide 1 To extend the comparison of population means beyond the two groups tested by the independent samples t-test, we use a one-way analysis.
6/4/2016Slide 1 The one sample t-test compares two values for the population mean of a single variable. The two-sample t-test of population means (aka.
SW388R6 Data Analysis and Computers I Slide 1 Multiple Regression Key Points about Multiple Regression Sample Homework Problem Solving the Problem with.
11/4/2015Slide 1 SOLVING THE PROBLEM Simple linear regression is an appropriate model of the relationship between two quantitative variables provided the.
Correlation & Regression Chapter 15. Correlation It is a statistical technique that is used to measure and describe a relationship between two variables.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
SW388R7 Data Analysis & Computers II Slide 1 Hierarchical Multiple Regression Differences between hierarchical and standard multiple regression Sample.
Correlation. Correlation Analysis Correlations tell us to the degree that two variables are similar or associated with each other. It is a measure of.
12/14/2015Slide 1 The dependent variable, poverty, is plotted on the vertical axis. The independent variable, enrolPop, is plotted on the horizontal axis.
Practice Problem: Lambda (1)
12/23/2015Slide 1 The chi-square test of independence is one of the most frequently used hypothesis tests in the social sciences because it can be used.
1/11/2016Slide 1 Extending the relationships found in linear regression to a population is procedurally similar to what we have done for t-tests and chi-square.
1/23/2016Slide 1 We have seen that skewness affects the way we describe the central tendency and variability of a quantitative variable: if a distribution.
2/24/2016Slide 1 The standard deviation statistic is challenging to present to our audiences. Statisticians often resort to the “empirical rule” to describe.
SW388R7 Data Analysis & Computers II Slide 1 Principal component analysis Strategy for solving problems Sample problem Steps in principal component analysis.
(Slides not created solely by me – the internet is a wonderful tool) SW388R7 Data Analysis & Compute rs II Slide 1.
Correlation & Linear Regression Using a TI-Nspire.
SW388R7 Data Analysis & Computers II Slide 1 Assumption of linearity Strategy for solving problems Producing outputs for evaluating linearity Assumption.
Bivariate Relationships
Assumption of normality
Multiple Regression.
Theme 7 Correlation.
Multiple Regression – Split Sample Validation
COMPARING VARIABLES OF ORDINAL OR DICHOTOMOUS SCALES: SPEARMAN RANK- ORDER, POINT-BISERIAL, AND BISERIAL CORRELATIONS.
Performing the Spearman Rank-Order Correlation Using SPSS
Presentation transcript:

Slide 1 The introductory statement in the question indicates: The data set to use (2001WorldFactBook) The task to accomplish (association between variables) The variables to use in the analysis: percent of the total population who was literate [literacy] HIV-AIDS adult prevalence rate [hivaids]

Slide 2 These problem also contains a second paragraph of instructions that provide the formulas to use if our examination of the association between quantitative variables requires us to re-express or transform one or both of the variables.

Slide 3 The first statement concerns the number of valid cases. To answer this question, we produce the statistics using the SPSS Correlate procedure.

Slide 4 To compute correlations, select Correlate > Bivariate from the Correlate menu.

Slide 5 First, move the variables hivLive and literacy to the Variables list box. Second, mark the check box for Spearman and leave the check box for Pearson marked. Third, click on the OK button to produce the output.

Slide 6 The Correlations table shows us the number of cases available for computing the correlation – 131.

Slide 7 The number of cases with valid data to analyze the relationship between "percent of the total population who was literate" and "number of people living with HIV- AIDS" was 131, out of the total of 218 cases in the data set. Click on the check box to mark the statement as correct.

Slide 8 The next statement asks us to extract the Pearson correlation coefficient from the SPSS output.

Slide 9 The Correlations table shows us that the Pearson r correlation is

Slide 10 The Pearson's r correlation coefficient between "percent of the total population who was literate" [literacy] and "number of people living with HIV-AIDS" [hivlive] was Click on the check box to mark the statement as correct.

Slide 11 The next statement asks us to interpret the Pearson correlation coefficient using the guidelines attributed to Tukey.

Slide 12 Using the rule of thumb attributed to Tukey, a correlation between 0.0 and ±0.20 is very weak; ±0.20 to ±0.40 is weak; ±0.40 to ±0.60 is moderate; ±0.60 to ±0.80 is strong; and greater than ±0.80 is very strong, the relationship was correctly characterized as a weak (Pearson's r = -.203). Click on the check box to mark the statement as correct.

Slide 13 The next statement asks us to interpret the Pearson correlation coefficient using the guidelines attributed to Cohen.

Slide 14 Applying Cohen's criteria for effect size (less than ±0.10 = trivial; ±0.10 up to ±0.30 = weak or small; ±0.30 up to ±0.50 = moderate; ±0.50 or greater = strong or large), the relationship was correctly characterized as a weak (Pearson's r = -.203). Click on the check box to mark the statement as correct.

Slide 15 The next statement asks us to interpret the direction of the relationship between the variables. A direct or positive relationship means that the values of the variables change in the same direction, i.e. when one goes up or down, the other goes up or down. An inverse or negative relationship means that the values of the variables move in different directions.

Slide 16 Since the sign of the correlation coefficient was negative (Pearson's r = -.203), the relationship is inverse and the values for the variables move in opposite directions. The statement that "Higher scores on the variable "percent of the total population who was literate" were associated with lower scores on "number of people living with HIV-AIDS"" is correct. Click on the check box to mark the statement as correct.

Slide 17 The final question asks about the impact that re-expressing the data for badly skewed variables has on the correlation coefficient. Operationally, we define badly skewed as a skew value less than or equal to -1.0 or greater than or equal to 1.0

11/19/2015Slide 18 Use Descriptives to compute skew for both variables. If skew is negative, do square transformation. If skew is positive, do log transformation. Compute correlation for transformed variables. If Pearson’s r is higher, the linear fit improves (and the question is correct). If Pearson’s r is lower, the linear fit is not improved (and the question is not correct). The purpose of this question is to show that we can get an idea of what will happen if we compare pearson’s r to spearman’s rho. If rho is higher, transformations will improve fit; if rho is smaller, transformations reduce fit. (I only picked problems with substantial differences between r and rho; if the values are close, it is less predictable)

Slide 19 Pearson's r for the the square transformation of "percent of the total population who was literate" [SQ_literacy] and the log transformation of "number of people living with HIV-AIDS" [LG_hivlive] (-0.487) improved the linear fit of the relationship between "percent of the total population who was literate" [literacy] and "number of people living with HIV-AIDS" [hivlive] above what was obtained with the original variables (-0.203). The fact that Spearman's rho was larger than Pearson's r supported the presence of a stronger relationship if one or both variables are re-expressed to reduce skewness. Click on the check box to mark the statement as correct.