Presentation is loading. Please wait.

Presentation is loading. Please wait.

Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2016 Room 150 Harvill.

Similar presentations


Presentation on theme: "Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2016 Room 150 Harvill."— Presentation transcript:

1

2 Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2016 Room 150 Harvill Building 9:00 - 9:50 Mondays, Wednesdays & Fridays

3

4 On class website: Please complete homework worksheet #24 Please complete the homework modules on the D2L website Hypothesis Testing, Correlations Due: Friday, April 15 th Homework Please note: In the graded quiz portion of the online D2L Homework 24 you will be asked to access Zeanna's data. The link within the homework may not work properly. Please download the Excel file on the class website or within D2L to complete the homework assignment

5 By the end of lecture today 4/13/16 Project 4 Simple Regression Using correlation for predictions

6 Before our fourth and final exam (May 2 nd ) OpenStax Chapters 1 – 13 (Chapter 12 is emphasized) Plous Chapter 17: Social Influences Chapter 18: Group Judgments and Decisions Schedule of readings

7

8

9 Labs will meet this week Project 4

10 +0.9199 3 0.878

11

12 +0.9199 3 0.878 Yes The relationship between the hours worked and weekly pay is a strong positive correlation. This correlation is significant, r(3) = 0.92; p < 0.05

13 -0.73 3 0.878 No The relationship between wait time and number of operators working is negative and strong, but not reliable enough to reach significance. This correlation is not significant, r(3) = -0.73; n.s. 3

14 We are measuring 9 students

15

16 4.0 3.0 2.0 1.0 0 1 2 3 4 High School GPA GPA r(7) = 0.50 r(7) = + 0.911444123 0 200 300 400 500 600 SAT (Verbal) GPA r(7) = + 0.80 r(7) = + 0.616334867 SAT (Mathematical) GPA r(7) = + 0.80 r(7) = + 0.487295007 4.0 3.0 2.0 1.0 4.0 3.0 2.0 1.0 0 200 300 400 500 600 Critical r = 0.666 Reject Null r is significant Do not reject null r is not significant Do not reject null r is not significant

17 4.0 3.0 2.0 1.0 0 1 2 3 4 High School GPA GPA r(7) = 0.50 r(7) = + 0.911444123 0 200 300 400 500 600 SAT (Verbal) GPA r(7) = + 0.80 r(7) = + 0.616334867 SAT (Mathematical) GPA r(7) = + 0.80 r(7) = + 0.487295007 4.0 3.0 2.0 1.0 4.0 3.0 2.0 1.0 0 200 300 400 500 600

18 4.0 3.0 2.0 1.0 0 1 2 3 4 High School GPA GPA r(7) = 0.50 r(7) = + 0.911444123 0 200 300 400 500 600 SAT (Verbal) GPA r(7) = + 0.80 r(7) = + 0.616334867 SAT (Mathematical) GPA r(7) = + 0.80 r(7) = + 0.487295007 4.0 3.0 2.0 1.0 4.0 3.0 2.0 1.0 0 200 300 400 500 600

19 4.0 3.0 2.0 1.0 0 1 2 3 4 High School GPA GPA r(7) = 0.50 r(7) = + 0.911444123 0 200 300 400 500 600 SAT (Verbal) GPA r(7) = + 0.80 r(7) = + 0.616334867 SAT (Mathematical) GPA r(7) = + 0.80 r(7) = + 0.487295007 4.0 3.0 2.0 1.0 4.0 3.0 2.0 1.0 0 200 300 400 500 600

20 Correlation: Independent and dependent variables When used for prediction we refer to the predicted variable as the dependent variable and the predictor variable as the independent variable Dependent Variable Dependent Variable Independent Variable Independent Variable What are we predicting?

21 Correlation - What do we need to define a line Expenses per year Yearly Income Y-intercept = “a” ( also “b 0 ”) Where the line crosses the Y axis Slope = “b” ( also “b 1 ”) How steep the line is If you spend this much If you probably make this much The predicted variable goes on the “Y” axis and is called the dependent variable The predictor variable goes on the “X” axis and is called the independent variable

22 Angelina Jolie Buys Brad Pitt a $24 million Heart-Shaped Island for his 50th Birthday Expenses per year Yearly Income Angelina spent this much Angelina probably makes this much Dustin spends $12 for his Birthday Dustin spent this much Dustin probably makes this much Revisit this slide

23 Assumptions Underlying Linear Regression These Y values are normally distributed. The means of these normal distributions of Y values all lie on the straight line of regression. For each value of X, there is a group of Y values The standard deviations of these normal distributions are equal. Revisit this slide

24 Correlation - the prediction line Prediction line makes the relationship easier to see (even if specific observations - dots - are removed) identifies the center of the cluster of (paired) observations identifies the central tendency of the relationship (kind of like a mean) can be used for prediction should be drawn to provide a “best fit” for the data should be drawn to provide maximum predictive power for the data should be drawn to provide minimum predictive error - what is it good for?

25 Predicting Restaurant Bill The expected cost for dinner for two couples (4 people) would be $95.06 Cost = 15.22 + 19.96 Persons If “Persons” = 4, what is the prediction for “Cost”? Cost = 15.22 + 19.96 Persons Cost = 15.22 + 19.96 (4) Cost = 15.22 + 79.84 = 95.06 Prediction line Y’ = a + b 1 X 1 Y-intercept Slope If “Persons” = 1, what is the prediction for “Cost”? Cost = 15.22 + 19.96 Persons Cost = 15.22 + 19.96 (1) Cost = 15.22 + 19.96 = 35.18 People Cost If People = 4 Cost will be about 95.06

26 Predicting Rent The expected cost for rent on an 800 square foot apartment is $990 Rent = 150 + 1.05 SqFt If “SqFt” = 800, what is the prediction for “Rent”? Rent = 150 + 1.05 SqFt Rent = 150 + 1.05 (800) Rent = 150 + 840 = 990 Prediction line Y’ = a + b 1 X 1 Y-intercept Slope Square Feet Cost If SqFt = 800 Rent will be about 990 If “SqFt” = 2500, what is the prediction for “Rent”? Rent = 150 + 1.05 SqFt Rent = 150 + 1.05 (2500) Rent = 150 + 2625 = 2,775

27


Download ppt "Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2016 Room 150 Harvill."

Similar presentations


Ads by Google