Statistics Bivariate Analysis By: Student 1, 2, 3 Minutes Exercised Per Day vs. Weighted GPA.

Slides:



Advertisements
Similar presentations
Chapter 3 Examining Relationships Lindsey Van Cleave AP Statistics September 24, 2006.
Advertisements

Chapter 12 Inference for Linear Regression
Inference for Regression
Regression Inferential Methods
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Correlation and Regression
Chapter 4 The Relation between Two Variables
Chapter 3 Bivariate Data
Scatter Diagrams and Linear Correlation
AP Statistics Chapters 3 & 4 Measuring Relationships Between 2 Variables.
Linear Regression and Correlation Analysis
Correlation and Regression. Correlation What type of relationship exists between the two variables and is the correlation significant? x y Cigarettes.
Chapter 9: Correlation and Regression
Regression, Residuals, and Coefficient of Determination Section 3.2.
Linear Regression.
Measures of Regression and Prediction Intervals
Introduction to Linear Regression and Correlation Analysis
Correlation and Regression
Chapter 6: Exploring Data: Relationships Lesson Plan Displaying Relationships: Scatterplots Making Predictions: Regression Line Correlation Least-Squares.
Least-Squares Regression Section 3.3. Why Create a Model? There are two reasons to create a mathematical model for a set of bivariate data. To predict.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
Regression. Height Weight Suppose you took many samples of the same size from this population & calculated the LSRL for each. Using the slope from each.
Phone Contacts Vs GPA Is there a Correlation between the number of Contacts in someone's phone and their G.P.A?
Chapter 3 Section 3.1 Examining Relationships. Continue to ask the preliminary questions familiar from Chapter 1 and 2 What individuals do the data describe?
Wednesday, May 13, 2015 Report at 11:30 to Prairieview.
Production Planning and Control. A correlation is a relationship between two variables. The data can be represented by the ordered pairs (x, y) where.
Elementary Statistics Correlation and Regression.
Warsaw Summer School 2015, OSU Study Abroad Program Regression.
Regression with Inference Notes: Page 231. Height Weight Suppose you took many samples of the same size from this population & calculated the LSRL for.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Copyright ©2011 Brooks/Cole, Cengage Learning Inference about Simple Regression Chapter 14 1.
Examining Bivariate Data Unit 3 – Statistics. Some Vocabulary Response aka Dependent Variable –Measures an outcome of a study Explanatory aka Independent.
LECTURE 9 Tuesday, 24 FEBRUARY STA291 Fall Administrative 4.2 Measures of Variation (Empirical Rule) 4.4 Measures of Linear Relationship Suggested.
Chapter 3-Examining Relationships Scatterplots and Correlation Least-squares Regression.
Essential Statistics Chapter 51 Least Squares Regression Line u Regression line equation: y = a + bx ^ –x is the value of the explanatory variable –“y-hat”
 Chapter 3! 1. UNIT 7 VOCABULARY – CHAPTERS 3 & 14 2.
POD 09/19/ B #5P a)Describe the relationship between speed and pulse as shown in the scatterplot to the right. b)The correlation coefficient, r,
Is their a correlation between GPA and number of hours worked? By: Excellent Student #1 Excellent Student #2 Excellent Student #3.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
Regression Analysis Deterministic model No chance of an error in calculating y for a given x Probabilistic model chance of an error First order linear.
Correlation and Regression Elementary Statistics Larson Farber Chapter 9 Hours of Training Accidents.
We will use the 2012 AP Grade Conversion Chart for Saturday’s Mock Exam.
Week 2 Normal Distributions, Scatter Plots, Regression and Random.
Regression Inference. Height Weight How much would an adult male weigh if he were 5 feet tall? He could weigh varying amounts (in other words, there is.
Describing Bivariate Relationships. Bivariate Relationships When exploring/describing a bivariate (x,y) relationship: Determine the Explanatory and Response.
1. Analyzing patterns in scatterplots 2. Correlation and linearity 3. Least-squares regression line 4. Residual plots, outliers, and influential points.
Chapter 3 LSRL. Bivariate data x – variable: is the independent or explanatory variable y- variable: is the dependent or response variable Use x to predict.
Chapter 5 LSRL. Bivariate data x – variable: is the independent or explanatory variable y- variable: is the dependent or response variable Use x to predict.
Regression and Correlation
Unit 4 LSRL.
Regression.
Statistics 101 Chapter 3 Section 3.
Chapter 5 LSRL.
1) A residual: a) is the amount of variation explained by the LSRL of y on x b) is how much an observed y-value differs from a predicted y-value c) predicts.
Correlation and Regression
Chapter 12 Regression.
Regression Inference.
Least-Squares Regression
No notecard for this quiz!!
EQ: How well does the line fit the data?
Regression.
Regression.
Regression.
Regression Chapter 8.
Regression.
Warm-up: This table shows a person’s reported income and years of education for 10 participants. The correlation is .79. State the meaning of this correlation.
Correlation and Regression
Regression.
A medical researcher wishes to determine how the dosage (in mg) of a drug affects the heart rate of the patient. Find the correlation coefficient & interpret.
Presentation transcript:

Statistics Bivariate Analysis By: Student 1, 2, 3 Minutes Exercised Per Day vs. Weighted GPA

Why did we choose this study? Exercise is a vital part of staying healthy and living an active and accomplished lifestyle. We believe that physical activity improves a student’s will to learn and may increase study habits. Previous studies have concluded that children who live a more active lifestyle are more compelled to succeed in school. We want to see if this is true at our school. We like to exercise, and we were curious to see if there is a correlation between these two variables.

Collected Data N=30

Vital Stats For X -X bar: Sx: # Summary: MinX: 0 Q1: 30 Med: 60 Q3: 120 MaxX: 240 For Y -Y bar: Sy: # Summary: MinY: 2.6 Q1: 3.33 Med: 3.5 Q3: 3.7 MaxY: 4.3

Outliers? In order to find outlier, we used the two formulas: #<Q1-1.5(IQR) #>Q3+1.5(IQR) 0<30-1.5(90) 240> (90) 0< >255 NO OUTLIERS 2.6< (.37) 4.3>3.7=1.5(.37) 2.6< > is an OUTLIER

Histogram of X (exercise in min) The shape of the data is slightly right skewed.

Histogram of Y (Weighted GPA) The graph has a bell-shaped distribution. Outlier=4.5

Empirical Rule Test Exercise (X) Mean=81.1 Standard Deviation= / = & / (2)= & / (3)= & % of the data falls between & % of the data falls between & % of the data falls between &

Empirical Rule Test GPA (Y) Mean= 3.494, Standard Deviation= / = & /-.3297(2)= & /-.3297(3)= & % of the data falls between & % of the data falls between & % of the data falls between &

Explanatory & Response Variable The explanatory variable (X) in our data is the number of minuets exercised per day, it is used to predict changes in the response variable (Y) or GPA. GPA is the response variable, and is dependent on the other data. This allows us to find a relationship between the two values.

Scatterplot

Analysis The Scatterplot shows that there is no linear correlation between exercise and weighted GPA due to the graph. In order to receive that conclusion, we know that when a correlation graph has a pattern it is linear. When the correlation graph does not have a pattern it is not linear. The coefficient of correlation is r = This also gives another reason why the scatter plot is not linear. If the r value is closer to 1 then it is linear. If the r value rounds close to zero it is not linear. If the r value was close to one, it would be very strong but in this case the r value is not strong at all because it is closer to zero. The outlier in this scatter plot is 4.3 which slightly altered our data.

Regression Line on Scatterplot Equation: y= x

The y-intercept of the regression line gives the predicted value of y for any given value of x. The slope shows the relationship between x and y as the steepness of the regression line is analyzed. Our data does not prove a correlation between weighted GPA and average minutes exercise performed in a day, so this equation should not be used to predict the response variable.

R & R Squared The r-squared value is explained variation over total variation and will give the accuracy (in a percentage) for a given value. R2= è.14% of the variation in Y is explained by the variation in x. R measures the strenght and direction of a linear relationshop between two variables R= negative, with no correlation.

Total Variation: is the sum of the y values minus the mean of y values, squared Explained Variation: is the sum of the y-hat values minus the mean of y values, squared Unexplained Variation: is the sum of the y values minus the y-hat values =

Standard Error of Estimate The standard error of estimate is a measure of how sample points deviate from the regression line. Se measures the difference between the observed y-values and the predicted y-values. One would take the unexplained variable, divide that by the degree of freedom and square the result. s e =  y 2 – b 0  y – b 1  xy n – 2 Se=.3353

95% Prediction Interval For X we choose: 70 With wanting to find the possible GPA of a person with an average 70 minute workout, there will be a.3353 standard of error. The GPA would fall between and

Residual Plot

Interpretation The Residual plot shows that it is not a good model for the LSRL. This is because the plot contains a pattern and is in the negative range. In other words, this graph is not linear. On the residual plot, the X-values equals GPA weighted and the Y-values is exercise in minutes.

Conclusion In conclusion, we have found that there is no correlation between how many minuets a high school student exercises, and their GPA. Our graphs and data values are not strong enough to draw conclusions based on our sample. Despite the amount of time that a student does or does not spend working out, their grades will neither increase or decrease.

Possible Problems If the sample had been larger, the results may have been more accurate. It is possible that subjects may have lied either about the amount they exercise or their true GPA, thus hindering our results. It is sometimes difficult to estimate how much you exercise each day because it varies depending on your changing daily activities.

The End.