Bi-Variate Data( AS 3.9) Dru Rose (Westlake Girls High School) Workshop PD Aiming at Excellence.

Slides:



Advertisements
Similar presentations
Bivariate Data Report Structure.
Advertisements

Bi-Variate Data PPDAC. Types of data We are looking for a set of data that is affected by the other data sets in our spreadsheet. This variable is called.
LSRLs: Interpreting r vs. r2
Slide 1 Investigating Bivariate Measurement Data using iNZight Statistics Teachers’ Day 22 November 2012 Ross Parsonage.
Learning Objectives 1 Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Chapter 4 The Relation between Two Variables
Chapter 3 Bivariate Data
AS Achievement Standard.
AP Statistics Chapters 3 & 4 Measuring Relationships Between 2 Variables.
CORRELATON & REGRESSION
Chapter 2: Looking at Data - Relationships /true-fact-the-lack-of-pirates-is-causing-global-warming/
Describing the Relation Between Two Variables
Correlation A correlation exists between two variables when one of them is related to the other in some way. A scatterplot is a graph in which the paired.
Correlation and Regression. Correlation What type of relationship exists between the two variables and is the correlation significant? x y Cigarettes.
Business Statistics - QBM117 Least squares regression.
Correlation and Regression Analysis
Chapter 7 Scatterplots, Association, Correlation Scatterplots and correlation Fitting a straight line to bivariate data © 2006 W. H. Freeman.
Scatter Diagrams and Correlation
Correlation & Regression Math 137 Fresno State Burger.
Bivariate Data Notes for Merit Comparing two Bi-variate plots.
LECTURE 2 Understanding Relationships Between 2 Numerical Variables
Warm-up with Multiple Choice Practice on 3.1 to 3.3
Correlation and Regression David Young Department of Statistics and Modelling Science, University of Strathclyde Royal Hospital for Sick Children, Yorkhill.
MacDonald’s Investigation. Pose a question Can the amount of fat per 100g of food can be used to predict the amount of energy contained in McDonalds.
Examining Relationships
Moderation & Mediation
LECTURE UNIT 7 Understanding Relationships Among Variables Scatterplots and correlation Fitting a straight line to bivariate data.
Product moment correlation
4.1 Scatter Diagrams and Correlation. 2 Variables ● In many studies, we measure more than one variable for each individual ● Some examples are  Rainfall.
 Graph of a set of data points  Used to evaluate the correlation between two variables.
Correlation & Regression
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Topic 10 - Linear Regression Least squares principle - pages 301 – – 309 Hypothesis tests/confidence intervals/prediction intervals for regression.
Chapters 8 & 9 Linear Regression & Regression Wisdom.
Objectives 2.1Scatterplots  Scatterplots  Explanatory and response variables  Interpreting scatterplots  Outliers Adapted from authors’ slides © 2012.
1.1 example these are prices for Internet service packages find the mean, median and mode determine what type of data this is create a suitable frequency.
Carrying out a statistics investigation. A process.
Bivariate Data AS (3 credits) Complete a statistical investigation involving bi-variate data.
Lecture 5 Chapter 4. Relationships: Regression Student version.
Chapter 2 Examining Relationships.  Response variable measures outcome of a study (dependent variable)  Explanatory variable explains or influences.
Correlation – Recap Correlation provides an estimate of how well change in ‘ x ’ causes change in ‘ y ’. The relationship has a magnitude (the r value)
Copyright © 2010 Pearson Education, Inc. Chapter 7 Scatterplots, Association, and Correlation.
Chapter 14: Inference for Regression. A brief review of chapter 4... (Regression Analysis: Exploring Association BetweenVariables )  Bi-variate data.
Relationships Scatterplots and Correlation.  Explanatory and response variables  Displaying relationships: scatterplots  Interpreting scatterplots.
LSRLs: Interpreting r vs. r 2 r – “the correlation coefficient” tells you the strength and direction between two variables (x and y, for example, height.
.  Relationship between two sets of data  The word Correlation is made of Co- (meaning "together"), and Relation  Correlation is Positive when the.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
6.7 Scatter Plots. 6.7 – Scatter Plots Goals / “I can…”  Write an equation for a trend line and use it to make predictions  Write the equation for a.
Correlation Definition: Correlation - a mutual relationship or connection between two or more things. (google.com) When two set of data appear to be connected.
Investigating Bivariate Measurement Data using iNZight.
3.9 BIVARIATE DATA. Vocab ARMSPAN V’S HEIGHT Give data to me when finished.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
Quantitative Data Essential Statistics.
Correlation & Regression
CHAPTER 7 LINEAR RELATIONSHIPS
Georgetown Middle School Math
Correlation and Regression
Understanding Standards Event Higher Statistics Award
Scatter plots & Association
EQ: How well does the line fit the data?
11A Correlation, 11B Measuring Correlation
Topic 8 Correlation and Regression Analysis
Algebra Review The equation of a straight line y = mx + b
Bivariate Data credits.
Correlation & Regression
Honors Statistics Review Chapters 7 & 8
Review of Chapter 3 Examining Relationships
Creating and interpreting scatter plots
Presentation transcript:

Bi-Variate Data( AS 3.9) Dru Rose (Westlake Girls High School) Workshop PD Aiming at Excellence

The Aims of this workshop Discuss some concepts in this standard that students appear to find difficult and share some possible teaching approaches Share some borderline A/M and M/E student work with our grading decisions and reasoning

2 Big ideas

How do we train students to use their eyes? Intially NO Technology(until they can identify key features of data and describe them in context.) Provide plenty of contrasting scatter plots with straightforward contexts

How do we train students to describe what they see in context? 1.Teacher modelling Student voice: “It’s a decreasing trend”

Activity: Where should the trend line go?

What do students appear to find difficult? 1.Scatter i.e. Variation in the vertical direction ( In a box-plot variation is about spread in the x- direction )

What does non-constant scatter really mean? X √

What do students appear to find difficult?

Start with data sets where it is obvious Tar content, nicotine content and weight are measured before the cigarette is smoked. The CO is emitted when the cigarette is smoked – hence must be the “outcome” variable.

Systollic BP (when heart muscle heart contracts) and Diastollic BP (when heart muscle relaxes between beats) Consider situations where it might not matter

Move on to a rich multi-variate set What might be a possible “outcome” variable? What might be possible predictor variables for that outcome? What are possible investigative questions we could pose? Gapminder 2008

What do students appear to find difficult? 3. What the trend equation tells us Linear Trend : Weight = * Height For every 1 cm increase in height, on average the weight of an American adult increases by about 1.08kg What does the tell us? Wt of a baby of zero height -Is it a useful measure in this plot? -NO

4. The correlation coefficient: inappropriate use of r When does it tell us something useful and when should it not be used? Technology does what the user tells it to. Students need to continually ask questions: Is it sensible to put a line on a non-linear graph? Does the line I have added actually describe the trend in the majority of the data? What do students appear to find difficult?

X caution √ X

What do students appear to find difficult? 5. Outliers and Groups They see outliers and groups where there are none They remove points or groups where they should not Key questions we want students to ask: Will this point (group) affect the position of the trend line(curve)? Will this point (group) affect the strength of the relationship? Do I need to do further analysis or further research or both?

iceland Brunei Luxembourg Why so different?

Outlier is clearly pulling trend line towards it- remove it to get an appropriate trend line for making a prediction

(tonnes/yr) Tonnes of oil equivalent per yr (toe) Not affecting trend line. No need to remove

Making a forecast: What do we expect at Merit and Excellence level ?

Research needed only Chad

What do students appear to find difficult? 6. Confounding (lurking) variables The association we are observing may be an indirect relationship, where both the predictor and outcome variables are correlated with another related variable (called a confounder) With the big multi-variate data sets now available in INZIGHT students can now test out their thinking regarding potential confounding variables

What other variable might be connected with low life expectancy and a high number of children per woman? (or the reverse ) Using INZIGHT we can quickly identify Chad as having the highest fertility rate. Why? Google question: Why does Chad have the highest number of children per wom an?

What do we expect for MERIT? Some research to help explain the reason for the posed question A demonstration of understanding of the context and any statistical jargon used Some discussion on the reliability and usefulness of the forecast : e.g. limited range of x values, wide prediction interval? An overall conclusion which does come to a final decision in answer to the question.

What extra do we expect for Excellence? Sound research, referenced and integerated into the report Deeper thinking, beyond a formulaic approach Ability to cope with the unexpected No inappropriate use of statistical techniques and /or serious misunderstandings Discussion of the limitations of the analysis