Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2015 Room 150 Harvill.

Slides:



Advertisements
Similar presentations
Lesson 10: Linear Regression and Correlation
Advertisements

13- 1 Chapter Thirteen McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
Learning Objectives 1 Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Linear Regression and Correlation
SIMPLE LINEAR REGRESSION
Topics: Regression Simple Linear Regression: one dependent variable and one independent variable Multiple Regression: one dependent variable and two or.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2015 Room 150 Harvill.
Relationships Among Variables
Lecture 5 Correlation and Regression
Correlation and Linear Regression
Lecturer’s desk INTEGRATED LEARNING CENTER ILC 120 Screen Row A Row B Row C Row D Row E Row F Row G Row.
SIMPLE LINEAR REGRESSION
Linear Regression and Correlation
Correlation and Linear Regression
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2015 Room 150 Harvill.
Chapter 15 Correlation and Regression
Lecturer’s desk INTEGRATED LEARNING CENTER ILC 120 Screen Row A Row B Row C Row D Row E Row F Row G Row.
Chapter 6 & 7 Linear Regression & Correlation
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2015 Room 150 Harvill.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2015 Room 150 Harvill.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2015 Room 150 Harvill.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2015 Room 150 Harvill.
Lecturer’s desk INTEGRATED LEARNING CENTER ILC 120 Screen Row A Row B Row C Row D Row E Row F Row G Row.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2015 Room 150 Harvill.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2015 Room 150 Harvill.
Lecturer’s desk INTEGRATED LEARNING CENTER ILC 120 Screen Row A Row B Row C Row D Row E Row F Row G Row.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2015 Room 150 Harvill.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2015 Room 150 Harvill.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2015 Room 150 Harvill.
Lecturer’s desk Physics- atmospheric Sciences (PAS) - Room 201 s c r e e n Row A Row B Row C Row D Row E Row F Row G Row H Row A
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2015 Room 150 Harvill.
Lecturer’s desk INTEGRATED LEARNING CENTER ILC 120 Screen Row A Row B Row C Row D Row E Row F Row G Row.
Chapter 7 Calculation of Pearson Coefficient of Correlation, r and testing its significance.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
Stage Screen Row B Gallagher Theater Row R Lecturer’s desk Row A Row B Row C
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Fall 2015 Room 150 Harvill.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Fall 2015 Room 150 Harvill.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Fall 2015 Room 150 Harvill.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Fall 2015 Room 150 Harvill.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Fall 2015 Room 150 Harvill.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Fall 2015 Room 150 Harvill.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Fall 2015 Room 150 Harvill.
Modern Languages Row A Row B Row C Row D Row E Row F Row G Row H Row J Row K Row L Row M
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Fall 2015 Room 150 Harvill.
Lecturer’s desk INTEGRATED LEARNING CENTER ILC 120 Screen Row A Row B Row C Row D Row E Row F Row G Row.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Linear Regression and Correlation Chapter 13.
Lecturer’s desk INTEGRATED LEARNING CENTER ILC 120 Screen Row A Row B Row C Row D Row E Row F Row G Row.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2016 Room 150 Harvill.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2016 Room 150 Harvill.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2016 Room 150 Harvill.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Fall 2015 Room 150 Harvill.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2016 Room 150 Harvill.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
Just one quick favor… Please use your phone or laptop Please take just a minute to complete Course Evaluations online….. Check your for a link or.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2016 Room 150 Harvill.
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Spring 2017 Room 150 Harvill Building 9:00 - 9:50 Mondays, Wednesdays.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2016 Room 150 Harvill.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2016 Room 150 Harvill.
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Fall 2016 Room 150 Harvill Building 10: :50 Mondays, Wednesdays.
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Spring 2017 Room 150 Harvill Building 9:00 - 9:50 Mondays, Wednesdays.
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Spring 2017 Room 150 Harvill Building 9:00 - 9:50 Mondays, Wednesdays.
Modern Languages Projection Booth Screen Stage Lecturer’s desk broken
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Spring 2017 Room 150 Harvill Building 9:00 - 9:50 Mondays, Wednesdays.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2016 Room 150 Harvill.
BNAD 276: Statistical Inference in Management Spring 2016
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Spring 2017 Room 150 Harvill Building 9:00 - 9:50 Mondays, Wednesdays.
Chapter Thirteen McGraw-Hill/Irwin
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Spring 2019 Room 150 Harvill Building 9:00 - 9:50 Mondays, Wednesdays.
Presentation transcript:

Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2015 Room 150 Harvill Building 8:00 - 8:50 Mondays, Wednesdays & Fridays.

No more lab Sessions this semester

Schedule of readings Before next exam (Monday May 4 th ) Please read chapters 10 – 14 Please read Chapters 17, and 18 in Plous Chapter 17: Social Influences Chapter 18: Group Judgments and Decisions

Exam 4 – Monday, May 4 th Study guide is online Shorter and longer versions Bring 2 calculators Bring 2 pencils (with good erasers) Bring ID Review Homework Review for Exam 4

Yes

± (1.96) (289.06) ± ± (2.58) (289.06) ±

Number of doors 22-doors vs 4-doors Quasi Price of car (dollars) Ratio Between Two-tailed $23, $20, alpha = p = The average price for 2-door cars was $23,807.14, while the average price for 4-door cars was $20, A t-test was conducted and found this to be a significant difference t(802) = ; p < 0.01

Size of the engine 3 4- versus 6- versus 8 cylinder engines Quasi Price of car (dollars) Ratio Between $17,862 $20,081 alpha = p =.(17 zeroes, then)69755 The average prices were $17,862, $20,081 and $ for the 4-, 6-, and 8-cylinder engines (respectively). An ANOVA was conducted and found this to be a significant difference F(2,801) = 345; p < $38,968

The relationship between mileage and car price was This is a weak and not significant correlation r(802) = ;n.s

df = 802 b = r = a = lower higher x For each additional mile driven (as x goes up by 1), cost of the car goes down by cents. The base cost for the car (before taking into account the mileage) is $24,765 (30,000)( ) = $19, = or 2.04% The proportion of total variance for price of car that was accounted for by miles was 2.04%

Y’ = cost of car (dollars) mileagesize of car Y’ = ( )(mileage) + ( )( car size) yes cents $4,027.67

Today we will be reviewing for the test using clicker questions.

What if we were looking to see if our stop-smoking program affects peoples‘ desire to smoke. What would null hypothesis be? a. Can’t know without knowing the dependent variable b. The program does not work c. The programs works d. Comparing the null and alternative hypothesis Correct

Which of the following is a Type I error: a. We conclude that the program works when it fact it doesn’t b. We conclude that the program works when in fact it does c. We conclude that the program doesn’t work when in fact it does d. We conclude that the program doesn’t work when in fact it doesn’t Correct

What is the null hypothesis of a correlation coefficient? a. It is zero (nothing going on) b. It is less than zero c. It is more than zero d. It equals the computed sample correlation Correct

Let’s try one Winnie found an observed correlation coefficient of 0, what should she conclude? a. Reject the null hypothesis b. Do not reject the null hypothesis c. Not enough info is given Correct

In the regression equation, what does the letter "a" represent? a. Y intercept b. Slope of the line c. Any value of the independent variable that is selected d. None of these Y’ = a + bx 1 + bx 2 + bx 3 + bx 4 Correct

Assume the least squares equation is Y’ = X. What does the value of 10 in the equation indicate? a. Y intercept b. For each unit increased in Y, X increases by 10 c. For each unit increased in X, Y increases by 10 d. None of these. Correct

In the least squares equation, Y’ = X the value of 20 indicates a. the Y intercept. b slope (so for each unit increase in X, Y’ increases by 20). c. slope (so for each unit increase in Y’, X increases by 20). d. none of these. Correct

In the equation Y’ = a + bX, what is Y’ ? a. Slope of the line b. Y intercept C. Predicted value of Y, given a specific X value d. Value of Y when X = 0 Correct

If there are four independent variables in a multiple regression equation, there are also four a. Y-intercepts (a). b. regression coefficients (slopes or bs). c. dependent variables. d. constant terms (k). Y’ = a + bx 1 + bx 2 + bx 3 + bx 4 Correct

As n ↑ According to the Central Limit Theorem, which is false? As n ↑ x will approach µ As n ↑ curve will approach normal shape As n ↑ curve variability gets larger a. b. c. d. Correct

If the coefficient of determination is 0.80, what percent of variation is explained? a. 20% b. 90% c. 64% d. 80% What percent of variation is not explained? a. 20% b. 90% c. 64% d. 80% Correct coefficient of determination = r 2 Correct

Which of the following represents a significant finding: a. p < 0.05 b. t(3) = 0.23; n.s. c. the observed t statistic is nearly zero d. we do not reject the null hypothesis Correct

In multiple regression what is the range of values for a coefficient of regression? a. 0 to +1.0 b. 0 to -1.0 c to +1.0 d. Any number at all Y’ = a + b 1 X 1 + b 2 X 2 + b 3 X 3 Correct

If r = 1.00, which inference cannot be made? a. The dependent variable can be perfectly predicted by the independent variable b. This provides evidence that the dependent variable is caused by the independent variable c. All of the variation in the dependent variable can be accounted for by the independent variable d. Coefficient of determination is 100%. Correct

In a regression analysis what do we call the variable used to predict the value of another variable? a. Independent b. Dependent c. Correlation d. Determination Let’s try one Correct

What can we conclude if the coefficient of determination is 0.94? a. r 2 = 0.94 b. direction of relationship is positive c. 94% of total variation of one variable is explained by variation in the other variable. d. Both A and C Correct

Which of the following statements regarding the coefficient of correlation is true? a. It ranges from -1.0 to +1.0 b. It measures the strength of the relationship between two variables c. A value of 0.00 indicates two variables are not related d. All of these Correct

What does a coefficient of correlation of 0.70 infer? (r = +0.70) a. Almost no correlation because 0.70 is close to 1.0 b. 70% of the variation in one variable is explained by the other c. Coefficient of determination is 0.49 d. Coefficient of nondetermination is 0.30 Correct coefficient of correlation = r coefficient of determination = r 2

The Pearson product-moment correlation coefficient, r, requires that variables are measured with: a. an interval scale b. a ratio scale c. a nominal scale d. either A or B. Correct

If r = 0.65, what does the coefficient of determination equal? a b c d Correct

If the coefficient of correlation is 0.60, what percent of variation is not explained? a. 20% b. 90% c. 64% d. 80% Correct

If the coefficient of determination is 0.20, what percent of variation is not explained? a. 20% b. 90% c. 64% d. 80% Correct

What is the measure that indicates how precise a prediction of Y is based on X or, conversely, how inaccurate the prediction might be? a. Regression equation b. Slope of the line c. Standard error of estimate d. Least squares principle Correct

Let’s try one Agnes compared the heights of the women’s gymnastics team and the women’s basketball team. If she doubled the number of players measured (but ended up with the same means) what effect would that have on the results? a. the means are the same, so the t-test would yield the same results. b. the means are the same, but the variability would increase so it would be harder to reject the null hypothesis. c. the means are the same, but the variability would decrease so it would be easier to reject the null hypothesis. Correct

Let’s try one Agnes compared the heights of the women’s gymnastics team and the scores they got. If she doubled the number of players measured, but ended up with the same correlation (r) what effect would that have on the results? a. the r is the same, so the conclusion would be the same b. the r is the same, but with more people, degrees of freedom (df) would go up and it would be harder to reject the null hypothesis. c. the r is the same, but with more people, degrees of freedom (df) would go up and it would be easier to reject the null hypothesis. Correct

Which of the following is true about the standard error of estimate? a. It is a measure of the accuracy of the prediction b. It is based on squared vertical deviations between Y and Y’ c. It cannot be negative d. All of these Standard error of the estimate: a measure of the average amount of predictive error the average amount that Y’ scores differ from Y scores a mean of the lengths of the green lines Standard error of the estimate (line) Correct

If all the plots on a scatter diagram lie on a straight line, (perfect correlation) what is the standard error of estimate? a. - 1 b. +1 c. 0 d. Infinity Standard error of the estimate: a measure of the average amount of predictive error the average amount that Y’ scores differ from Y scores a mean of the lengths of the green lines Standard error of the estimate (line) Correct

Let’s try one Which of these correlations would be most likely to have the highest positive value for r? a. Scatterplot A b. Scatterplot B c. Scatterplot C d. Can not be determined from the information given Scatterplot AScatterplot BScatterplot C Correct

Let’s try one Which of the these scatterplots will have the smallest “y intercept”? a. Scatterplot A b. Scatterplot B c. Scatterplot C d. Can not be determined from the information given Scatterplot AScatterplot BScatterplot C Correct

Let’s try one Which of the these correlations would be most likely to represent the correlation between salary and expenses? a. Scatterplot A b. Scatterplot B c. Scatterplot C d. Can not be determined from the information given Scatterplot AScatterplot BScatterplot C Correct

Let’s try one Which of the following correlations would allow you the most accurate predictions? a. r = b. r = c. r = d. r = Correct

After duplicate correlations have been discarded and trivial correlations have been ignored, there remain a. two correlations b. three correlations c. six correlations d. nine correlations Let’s try one Correct

Which of the following conclusions can not be made from the data in the matrix? a. There is a significant correlation between Science and Reading b. There is a significant correlation between Math and Reading c. There is a significant correlation between Math and Science Let’s try one Correct

Thank you for a wonderful semester! and good luck with your studies See you at the final exam.