Statistics and Modelling 2011. Topic 1: Introduction to statistical analysis Purpose – To revise and advance our understanding of descriptive statistics.

Slides:



Advertisements
Similar presentations
Brought to you by Tutorial Support Services The Math Center.
Advertisements

Unit 16: Statistics Sections 16AB Central Tendency/Measures of Spread.
Frequency Distribution and Variation Prepared by E.G. Gascon.
Chapter 3 Describing Data Using Numerical Measures
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Measures of Central Tendency. Central Tendency “Values that describe the middle, or central, characteristics of a set of data” Terms used to describe.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Calculating & Reporting Healthcare Statistics
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Variability Measures of spread of scores range: highest - lowest standard deviation: average difference from mean variance: average squared difference.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Introduction to Statistics Chapter 3 Using Statistics to summarize.
Data observation and Descriptive Statistics
1 Chapter 4: Variability. 2 Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure.
Today: Central Tendency & Dispersion
Measures of Central Tendency
Topic 5: Probability Distributions Achievement Standard Solve Probability Distribution Models to solve straightforward problems 4 Credits Externally.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Formula Compute a standard deviation with the Raw-Score Method Previously learned the deviation formula Good to see “what's going on” Raw score formula.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure of variability usually accompanies.
Chapter 3 Basic Statistics Section 2.2: Measures of Variability.
Objectives Vocabulary
Descriptive Statistics: Numerical Methods
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Table of Contents 1. Standard Deviation
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
Lecture 3 Describing Data Using Numerical Measures.
1 Review Sections Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.
Chap 3-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 3 Describing Data Using Numerical.
A way to organize data so that it has meaning!.  Descriptive - Allow us to make observations about the sample. Cannot make conclusions.  Inferential.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Chapter 2 Means to an End: Computing and Understanding Averages Part II  igma Freud & Descriptive Statistics.
Numerical Measures of Variability
Introduction to Statistics Santosh Kumar Director (iCISA)
Statistics Describing, Exploring and Comparing Data
Chapter 5: Measures of Dispersion. Dispersion or variation in statistics is the degree to which the responses or values obtained from the respondents.
Introduction to statistics I Sophia King Rm. P24 HWB
Measures of Central Tendency and Dispersion from Grouped Data.
Describing Samples Based on Chapter 3 of Gotelli & Ellison (2004) and Chapter 4 of D. Heath (1995). An Introduction to Experimental Design and Statistics.
Lesson 25 Finding measures of central tendency and dispersion.
Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,
Averages and Variability
A way to organize data so that it has meaning!.  Descriptive - Allow us to make observations about the sample. Cannot make conclusions.  Inferential.
CHAPTER 3 – Numerical Techniques for Describing Data 3.1 Measures of Central Tendency 3.2 Measures of Variability.
Measures of Dispersion Measures of Variability
Statistics Descriptive Statistics. Statistics Introduction Descriptive Statistics Collections, organizations, summary and presentation of data Inferential.
One-Variable Statistics. Descriptive statistics that analyze one characteristic of one sample  Where’s the middle?  How spread out is it?  How do different.
Chapter 3 Numerical Descriptive Measures. 3.1 Measures of central tendency for ungrouped data A measure of central tendency gives the center of a histogram.
An Introduction to Statistics
Descriptive Statistics
One-Variable Statistics
Copyright © Cengage Learning. All rights reserved.
Chapter 3 Describing Data Using Numerical Measures
Measures of Central Tendency
Measures of Central Tendency
Science of Psychology AP Psychology
Summary descriptive statistics: means and standard deviations:
Chapter 3 Describing Data Using Numerical Measures
Unit 8: Statistics Lesson #5: One-Variable Statistical Analysis
Theme 4 Describing Variables Numerically
Central Tendency.
Chapter 3.
3-2 Measures of Variance.
Descriptive Statistics
Summary descriptive statistics: means and standard deviations:
Descriptive Statistics
The data sets {19, 20, 21} and {0, 20, 40} have the same mean and median, but the sets are very different. The way that data are spread out from the mean.
Lecture 4 Psyc 300A.
Descriptive statistics for groups:
Presentation transcript:

Statistics and Modelling 2011

Topic 1: Introduction to statistical analysis Purpose – To revise and advance our understanding of descriptive statistics and how to use them.

LESSON 1 – Central tendency and Variability Activity (as students arrive): Measure each other’s heights. –Students then calculate the mean height. NOTES: What are statistics? Look at measures of Central Tendency. Measures of Variability – what do standard deviation and variance actually tell us? HW: Worksheet + Old Sigma (2 nd edition) – Pg. 148: Ex. 10.1

Statistics Statistics are ___________________________________________ ____________________. Population Mean  =Sum of all scores Population size  = N x 

Statistics Statistics are numerical values that describe the characteristics of a set of numbers (data-set). 2 types of statistics: Measures of ______ ________: Mean and median & mode. Measures of ______: Standard deviation, variance, inter-quartile range Population Mean  =Sum of all scores Population size  = N x 

Statistics Statistics are numerical values that describe the characteristics of a set of numbers (data-set). 2 types of statistics: Measures of Central Tendency: Mean and median & mode. Measures of ______: Standard deviation, variance, inter-quartile range Population Mean  =Sum of all scores Population size  = N x 

Statistics Statistics are numerical values that describe the characteristics of a set of numbers (data-set). 2 types of statistics: Measures of Central Tendency: Mean and median & mode. Measures of Variability: Standard deviation, variance, inter-quartile range Measures of Central Tendency:  Mean – easy to calculate but _________ ___ ________ _______. Population Mean  =Sum of all scores Population size  = N x 

Statistics Statistics are numerical values that describe the characteristics of a set of numbers (data-set). 2 types of statistics: Measures of Central Tendency: Mean and median & mode. Measures of Variability: Standard deviation, variance, inter-quartile range Measures of Central Tendency:  Mean – easy to calculate but sensitive to extreme values. Population Mean  =Sum of all scores Population size  = N x 

Statistics Statistics are numerical values that describe the characteristics of a set of numbers (data-set). 2 types of statistics: Measures of Central Tendency: Mean and median & mode. Measures of Variability: Standard deviation, variance, inter-quartile range Measures of Central Tendency:  Mean – easy to calculate but sensitive to extreme values. Population Mean  =Sum of all scores Population size  = N x 

2 types of statistics: Measures of Central Tendency: Mean and median & mode. Measures of Variability: Standard deviation, variance, inter-quartile range. Measures of Central Tendency:  Mean – easy to calculate but sensitive to extreme values.  Median – rank the data and find the middle entry. Advantage: More _____ than the mean because it doesn’t get dragged up or down by extreme values. Population Mean  =of all scores Population size Sum

2 types of statistics: Measures of Central Tendency: Mean and median & mode. Measures of Variability: Standard deviation, variance, inter-quartile range. Measures of Central Tendency:  Mean – easy to calculate but sensitive to extreme values.  Median – rank the data and find the middle entry. Advantage: More robust than the mean because it doesn’t get dragged up or down by extreme values. Disadvantages: Population Mean  =of all scores Population size Sum

 Mean – easy to calculate but sensitive to extreme values.  Median – rank the data and find the middle entry. Advantage: More robust than the mean because it doesn’t get dragged up or down by extreme values. Disadvantages: 1. _______________________________________________. _____________________________. 2. _____________________________. Population Mean  =of all scores Population size  N  Sum = x

 Mean – easy to calculate but sensitive to extreme values.  Median – rank the data and find the middle entry. Advantage: More robust than the mean because it doesn’t get dragged up or down by extreme values. Disadvantages: 1. Takes longer to calculate because you must rank the numbers first. 2. _____________________________. Population Mean  =of all scores Population size  N  Sum = x

 Mean – easy to calculate but sensitive to extreme values.  Median – rank the data and find the middle entry. Advantage: More robust than the mean because it doesn’t get dragged up or down by extreme values. Disadvantages: 1. Takes longer to calculate because you must rank the numbers first. 2. Not a function of all of the numbers.  The Mode - The most frequently occurring value. This is usually somewhere near the middle, but not always. Thus the mode is not always a good measure of central tendency. Population Mean  =of all scores Population size  N  Sum = x

Measures of Variability Standard deviation, variance, inter-quartile range. The standard deviation is a measure of the _____________________ _____.

Measures of Variability Standard deviation, variance, inter-quartile range. The standard deviation is a measure of the average distance from the mean. The _______ is also a measure of variability. It is ________________ _______. ___________________________________________ ____________________.

Measures of Variability Standard deviation, variance, inter-quartile range. The standard deviation is a measure of the average distance from the mean. The variance is also a measure of variability. It is ________________ _______. ___________________________________________ ____________________.

Measures of Variability Standard deviation, variance, inter-quartile range. The standard deviation is a measure of the average distance from the mean. The variance is also a measure of variability. It is the standard deviation squared. ___________________________________________ ____________________.

Measures of Variability Standard deviation, variance, inter-quartile range. The standard deviation is a measure of the average distance from the mean. The variance is also a measure of variability. It is the standard deviation squared. We actually work out the variance first, then square root it to get the standard deviation. Variance Var(X) Var(X) = The Mean of the Squared Deviations from the Mean (i.e. the average squared distance from the mean) Var(X)=

Measures of Variability Standard deviation, variance, inter-quartile range. The standard deviation is a measure of the average distance from the mean. The variance is also a measure of variability. It is the standard deviation squared. We actually work out the variance first, then square root it to get the standard deviation. Variance Var(X) Var(X) = The Mean of the Squared Deviations from the Mean (i.e. the average squared distance from the mean) Var(X)=Sum of Squared Deviations from the Sample Mean Sample size

Measures of Variability Standard deviation, variance, inter-quartile range. The standard deviation is a measure of the average distance from the mean. The variance is also a measure of variability. It is the standard deviation squared. We actually work out the variance first, then square root it to get the standard deviation. Variance Var(X) Var(X) = The Mean of the Squared Deviations from the Mean (i.e. the average squared distance from the mean) Var(X)=Sum of Squared Deviations from the Sample Mean Sample size Formula: Var(X)=

Variance Var(X) Var(X) = The Mean of the Squared Deviations from the Mean (i.e. the average squared distance from the mean) Standard Deviation s or  Standard Deviation Var(X)=Sum of Squared Deviations from the Sample Mean Sample size Formula: Var(X)=

Variance Var(X) Var(X) = The Mean of the Squared Deviations from the Mean (i.e. the average squared distance from the mean) Standard Deviation s or  Standard Deviation Formula: A measure of the average distance from the mean. Var(X)=Sum of Squared Deviations from the Sample Mean Sample size Formula: Var(X)=  Now look at the class height data, and calculate the variance & SD. HW: Do Mean & SD worksheet + Sigma (old) Ex

LESSON 2– Standard Deviation applications: Points of today:  Develop an understanding of what variance and standard deviation tell us. STARTER: Mark HW worksheet. 1.Alternative formula for variance & standard deviation. 2.Sigma (old – 2 nd edition): Pg. 151 – Ex Types of data – discrete & continuous. 4.Sigma (old – 2 nd edition): Pg. 157 – Ex (complete for HW).

Variance & standard deviation The formula: Var(X)= can be re-arranged to get: Var(X) = So Standard Deviation can be given by: s = This version of the formula is quicker to use because you don’t have to calculate the distance from the mean for each individual value. Caution: When calculating the SD from a freq. table, remember to multiply each x value by the number of times it occurs (its frequency)! Copy, then do Sigma (old) - pg. 151: Ex

Variance formula proof: Link between the 2 variance formulas: Prove that = Left hand side = = = = = = = = Right hand side

LESSON 3– Grouped data & frequency: Points of today:  Interpret displays of grouped data based on relative frequency. 1.Difference between discrete and continuous data. 2.How to calculate (estimate) the mean and standard deviation of grouped data displayed on a frequency table. Sigma (old – 2 nd edition): Ex. 10.4

Discrete and Continous data DiscreteContinuous

Discrete and Continous data DiscreteContinuous Where does its value come from?

Discrete and Continous data DiscreteContinuous Where does its value come from? Counting

Discrete and Continous data DiscreteContinuous Where does its value come from? Counting What values can it take?

Discrete and Continous data DiscreteContinuous Where does its value come from? Counting What values can it take? Whole numbers or rounded values

Discrete and Continous data DiscreteContinuous Where does its value come from? Counting What values can it take? Whole numbers or rounded values What question is being asked?

Discrete and Continous data DiscreteContinuous Where does its value come from? Counting What values can it take? Whole numbers or rounded values What question is being asked? ‘How many ?’

Discrete and Continous data DiscreteContinuous Where does its value come from? Counting What values can it take? Whole numbers or rounded values What question is being asked? ‘How many ?’ Examples:

Discrete and Continous data DiscreteContinuous Where does its value come from? Counting What values can it take? Whole numbers or rounded values What question is being asked? ‘How many ?’ Examples: Number of students who gain Excellence in the first test. Money – since we count it in dollars and cents.

Discrete and Continous data DiscreteContinuous Where does its value come from? Counting What values can it take? Whole numbers or rounded values What question is being asked? ‘How many ?’ Examples: Number of students who gain Excellence in the first test. Money – since we count it in dollars and cents.

Discrete and Continous data DiscreteContinuous Where does its value come from? CountingMeasurement What values can it take? Whole numbers or rounded values What question is being asked? ‘How many ?’ Examples: Number of students who gain Excellence in the first test. Money – since we count it in dollars and cents.

Discrete and Continous data DiscreteContinuous Where does its value come from? CountingMeasurement What values can it take? Whole numbers or rounded values All real numbers (anywhere along the number line – infinite precision) What question is being asked? ‘How many ?’ Examples: Number of students who gain Excellence in the first test. Money – since we count it in dollars and cents.

Discrete and Continous data DiscreteContinuous Where does its value come from? CountingMeasurement What values can it take? Whole numbers or rounded values All real numbers (anywhere along the number line – infinite precision) What question is being asked? ‘How many ?’‘How long ?’, ‘ How heavy ?’ Examples: Number of students who gain Excellence in the first test. Money – since we count it in dollars and cents.

Discrete and Continous data DiscreteContinuous Where does its value come from? CountingMeasurement What values can it take? Whole numbers or rounded values All real numbers (anywhere along the number line – infinite precision) What question is being asked? ‘How many ?’‘How long ?’, ‘ How heavy ?’ Examples: Number of students who gain Excellence in the first test. Money – since we count it in dollars and cents. Height Distance Weight Volume Time Copy then do Sigma (old) - pg. 157: Ex Q1 and 2.

Discrete and Continous data DiscreteContinuous Where does its value come from? What values can it take? What question is being asked? Examples:

Calculating the mean & standard deviation from a frequency table

Calculating the mean & standard deviation when data is grouped into intervals

Mean & std. deviation on your calc. On a Graphics Calculator (GC): Type: MENU, STAT, CALC (F2), SET (F6): 1VarXList:List 1 (F1) 1VarFreq:1 (F1) EXIT (goes back) Now enter the data into List 1. Press 1Var  : mean  x : sum of all values  x 2 : sum of the squares of all values x  n : standard deviation. On a Scientific Calculator: MODE 2 (puts it into STAT mode) Now enter the first data-value and press M+ (means add to memory) Repeat, pressing M+ after each: E.g.: 4 M+ 7 M+ 8 M+ etc. Every time you’ve entered in a new value it will tell you the number of values now saved. (e.g. n=3). Once all are entered, type “=“, then: SHIFT 2: 1: for the mean; 2: ( x  n ) for the SD. SHIFT Mode: Clears the memory