Statistics Math 416. Game Plan Introduction Introduction Census / Poll / Survey Census / Poll / Survey Population – Sample – Bias Population – Sample.

Slides:



Advertisements
Similar presentations
Brought to you by Tutorial Support Services The Math Center.
Advertisements

Grade 10 Mathematics Data handling.
15-Apr-15Created by Mr. Lafferty1 Statistics Mode, Mean, Median and Range Semi-Interquartile Range ( SIQR ) Nat 5 Quartiles Boxplots.
Statistics Unit 6.
Statistics It is the science of planning studies and experiments, obtaining sample data, and then organizing, summarizing, analyzing, interpreting data,
Unit 6 Data and Statistics Review Game. Please select a Team Nemo 2.Dory 3.Bruce 4.Squirt 5.Jacques.
Dual Tragedies in the B-ham Paper. Module 2 Simple Descriptive Statistics and Univariate Displays of Data A Tale of Three Cities George Howard, DrPH.
“Teach A Level Maths” Statistics 1
Intro to Statistics         ST-L1 Objectives: To review measures of central tendency and dispersion. Learning Outcome B-4.
1.2: Describing Distributions
Lecture 6: Descriptive Statistics: Probability, Distribution, Univariate Data.
Unit 4 – Probability and Statistics
Statistics and Probability Grade 6
Introduction to Data Analysis. Defining Terms  Population- the entire group of people or objects that you want information about  Sample- specific part.
Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel.
Statistics Used In Special Education
Department of Quantitative Methods & Information Systems
Measures of Central Tendencies Week # 2 1. Definitions Mean: or average The sum of a set of data divided by the number of data. (Do not round your answer.
UNIT 8:Statistical Measures Measures of Central Tendency: numbers that represent the middle of the data Mean ( x ): Arithmetic average Median: Middle of.
Statistics Math 314 Game Plan Introduction Introduction Presentation Presentation Line graph Line graph Pie graph Pie graph Pictograph Pictograph Bar.
Review Measures of central tendency
Unit 1 Chapter 7 Section 5 Day One: Box and Whisker Plots pg. 394.
Unit 6 Data and Statistics Review Game. Please select a Team Nemo 2.Dory 3.Bruce 4.Squirt 5.Jacques.
Box – and – Whisker Plots. -a method of displaying and interpreting a data set -data is first arranged into numeric order ( small to large )
What is variability in data? Measuring how much the group as a whole deviates from the center. Gives you an indication of what is the spread of the data.
Confidential2 Warm Up Find the mean, mode (s), and median for each set of data 1.90, 92, 94, 91, 90, 94, 95,98 93, 90 and 94, , 9.1, 8.9, 9.0,9.3,
Statistics 2. Variables Discrete Continuous Quantitative (Numerical) (measurements and counts) Qualitative (categorical) (define groups) Ordinal (fall.
The Central Tendency is the center of the distribution of a data set. You can think of this value as where the middle of a distribution lies. Measure.
1)Construct a box and whisker plot for the data below that represents the goals in a soccer game. (USE APPROPRIATE SCALE) 7, 0, 2, 5, 4, 9, 5, 0 2)Calculate.
Do Now Find the mean, median, mode, and range of each data set and then state which measure of central tendency best represents the data. 1)2, 3, 3, 3,
Grade 8 Math Project Kate D. & Dannielle C.. Information needed to create the graph: The extremes The median Lower quartile Upper quartile Any outliers.
Business Statistics Spring 2005 Summarizing and Describing Numerical Data.
 The mean is typically what is meant by the word “average.” The mean is perhaps the most common measure of central tendency.  The sample mean is written.
Summary Statistics and Mean Absolute Deviation MM1D3a. Compare summary statistics (mean, median, quartiles, and interquartile range) from one sample data.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 5 Describing Distributions Numerically.
Numerical Measures. Measures of Central Tendency (Location) Measures of Non Central Location Measure of Variability (Dispersion, Spread) Measures of Shape.
Median, Quartiles, Inter-Quartile Range and Box Plots. Measures of Spread Remember: The range is the measure of spread that goes with the mean. Mean =
Measures of Variability: “The crowd was scattered all across the park, but a fairly large group was huddled together around the statue in the middle.”
Statistics topics from both Math 1 and Math 2, both featured on the GHSGT.
Welcome to MDM4U (Mathematics of Data Management, University Preparation)
Single middle value The Median The median is the middle value of a set of data once the data has been ordered. Example 1. Robert hit 11 balls at Grimsby.
1 Research Methods in Psychology AS Descriptive Statistics.
Welcome to MDM4U (Mathematics of Data Management, University Preparation)
Collecting  Describing  Summarizing.   Statistics is a set of data (information) that has been collected. The data can be categorical or numerical.
Statistics Unit Test Review Chapters 11 & /11-2 Mean(average): the sum of the data divided by the number of pieces of data Median: the value appearing.
MM2D1: Using sample data, students will make informal inferences about population means and standard deviations b. Understand and calculate the means and.
Mean, Median, Mode & Range Outlier An outlier is a data item that is much higher or much lower than items in a data set. 1, 2, 5, 27, 3, 4.
7 th Grade Math Vocabulary Word, Definition, Model Emery Unit 4.
An Introduction to Statistics
Chapter 1: Exploring Data
One-Variable Statistics
Statistics Unit Test Review
Range, Mean, Median, Mode Essential Question: How do we take a random sample, and what statistics can we find with the data? Standard: MM1D3.a.
Single Variable Data Analysis
Statistics Collecting and analyzing large amounts of numerical data
DS5 CEC Interpreting Sets of Data
Measures of Central Tendencies
Theme 4 Describing Variables Numerically
Good research questions
Measures of Central Tendency
Welcome!.
Prepared by: C.Cichanowicz, March 2011
Box-and-Whisker Plots
Ticket in the Door GA Milestone Practice Test
Ticket in the Door GA Milestone Practice Test
Two Way Frequency Table
Ch. 12 Vocabulary 9.) measure of central tendency 10.) outlier
Skills test Numeracy support
Review of 6th grade material to help with new Statistics unit
USING STATISTICS TO DESCRIBE GEOGRAPHICAL DATA
Presentation transcript:

Statistics Math 416

Game Plan Introduction Introduction Census / Poll / Survey Census / Poll / Survey Population – Sample – Bias Population – Sample – Bias Sample Proportion Sample Proportion Mean Median Mode Mean Median Mode Box and Whisker Plot Box and Whisker Plot Box and Whisker Interpretation Box and Whisker Interpretation

Stats Intro There are lies, there are damn lies and then there are statistics There are lies, there are damn lies and then there are statistics - Mark Twain - Mark Twain The goal is by the use of number describe a characteristic of a population. The goal is by the use of number describe a characteristic of a population. The idea is to win your argument by providing facts and too many people consider statistics to be absolute facts. The idea is to win your argument by providing facts and too many people consider statistics to be absolute facts.

Stats Intro In general, most people do not understand statistics. In general, most people do not understand statistics. Hypothesis: Student A has a school average of 10% Hypothesis: Student A has a school average of 10% Conclusion: Student A is a bad person. Conclusion: Student A is a bad person. The statistic does not measure the person’s goodness or badness. The statistic does not measure the person’s goodness or badness. What does that statistic mean? What does that statistic mean? If all there marks were the same for all courses, it would be 10% If all there marks were the same for all courses, it would be 10%

Statistics Life is a continual battle to get your ideas across and have other people trying to get their ideas across to you. Life is a continual battle to get your ideas across and have other people trying to get their ideas across to you. You are constantly being bombarded by arguments and statistics. You are constantly being bombarded by arguments and statistics. 1. Commercials 2. Teachers To understand the world around you, need to be aware of statistics meaning and reliability. To understand the world around you, need to be aware of statistics meaning and reliability. Where do statistics come from? Where do statistics come from?

Population First we establish the population. First we establish the population. Population: the complete group that we are investigating Population: the complete group that we are investigating Characteristic: A particular identifying object exhibited by the population Characteristic: A particular identifying object exhibited by the population i.e. hair colour favorite colour math knowledge, political opinion etc.

Population The next problem is interpreting how to measure a characteristic and obtain the data. The next problem is interpreting how to measure a characteristic and obtain the data. Obtaining the Data: Three methods Obtaining the Data: Three methods Method #1: Ask the whole population - Called a census - Problems – hard to do – depending on population

Census Method #1: Ask the whole population Method #1: Ask the whole population - Called a census - Problems – hard to do – depending on population

Poll Method #2: - Ask a representative “sample” of the population Method #2: - Ask a representative “sample” of the population - Called a poll Problems: representative may be tricky

Survey Method #3: Ask only experts of the population Method #3: Ask only experts of the population - called a survey Problems: who is an expert Representative sample?

Bias Bias If data is obtained or presented in an unfair manner than all conclusions are not correct. The results are said to be biased (or unfair). If data is obtained or presented in an unfair manner than all conclusions are not correct. The results are said to be biased (or unfair). In collection How and who you ask is the main source of bias How and who you ask is the main source of bias There are 4 types of bias (bad sampling, non pertinence, wording of question & attitude of pollster). There are 4 types of bias (bad sampling, non pertinence, wording of question & attitude of pollster).

Bias Eg asking 5 yr olds their favorite beer Eg asking 5 yr olds their favorite beer Bad sampling Bad sampling Eg Do you like to play an instrument? (to find favorite color) Non-pertinence Non-pertinence Eg man I hate Bush, are you in favor of war? Wording of questions Wording of questions Eg a policeman asking were you speeding? Attitude of pollster Attitude of pollster

Presentation Bias In presentation, imagine you disregard a grade level and claim that they do not matter in a school’s decision. In presentation, imagine you disregard a grade level and claim that they do not matter in a school’s decision. I need to prove my product is the best, how can I get these numbers to show that? I need to prove my product is the best, how can I get these numbers to show that?

Buy This Stock! $400 $300 $0 $100 $200 $500 JanAprilMarchFebJan Not! A statistical presentation is always biased Stencil #1-3

Representative Sample Creating a representative sample can be an art form in itself. The sample should be in all the same proportions, an impossibility. Creating a representative sample can be an art form in itself. The sample should be in all the same proportions, an impossibility. You must focus on the characteristics (the poll or survey is focusing on!) You must focus on the characteristics (the poll or survey is focusing on!)

Representative Sample Consider a school has 50 boys and 25 girls and a representative sample of 10 needs to be created. Consider a school has 50 boys and 25 girls and a representative sample of 10 needs to be created. We note the population is described in terms of boys and girls hence we will need to create our sample on that basis We note the population is described in terms of boys and girls hence we will need to create our sample on that basis Three steps Three steps

Representative Sample 1) Relative (by percent) 50/75 = 67% 25/75 = 33% n = 75 2) Theory - sample = 10 10x.67=6.7 10x.33 = 3.3 Difficult to get.7 or.3 of a person! 3) Reality 7 3 Total of 10 & has added bias

Some Rules If it starts at zero it stays at zero If it starts at zero it stays at zero If it appears to be zero be careful! If it appears to be zero be careful! Make decisions on a category not overall Make decisions on a category not overall

Creating a Sample 1) Given the following, create a sample of 10 Hudson Non-Hudson Hudson Non-Hudson Young n = 109 Relative Middle Aged Old MA 18% 22% 0 28% 30% Y0 0Is it really 0 people?

Creating a Sample Y open here Reality MA O Y 0 0 Theory MA O Stencil 4,5, 6 Do relative, theory and reality for #4; in #5 & 6 put theory & relative together

Statistics Central Tendency - Mean Mean means Mean meansthe average Symbol x Found by dividing the sum ∑x i by the number of elements n. i.e. x = ∑ x i Means which value would all values be equal to if they were the same i.e. (5,9,3,6) x = ∑ x i n n = ( )/4 = 5.75

Mode Symbol M Symbol M It is the number that appears the most It is the number that appears the most It is possible, not to have any or to have more than one mode It is possible, not to have any or to have more than one mode Eg (1,2,5) Eg (1,2,5) Eg (1,6,6,8) Eg (1,6,6,8) Eg (1,3,3,4,4,8) Eg (1,3,3,4,4,8) M = (nothing repeats) M = 3 & 4 M = 6

Median Symbol M Symbol M Median is found as the middle value Median is found as the middle value Note the sample must be in order! Note the sample must be in order! There are two possibilities (odd & even) There are two possibilities (odd & even) Consider (1,5,7) n = 3 Consider (1,5,7) n = 3 Odd only 1 middle; M = 5 Odd only 1 middle; M = 5 (1,5,7,8) n = 4 (1,5,7,8) n = 4 You must find the mean of both middles (5 + 7)/2 = 6 You must find the mean of both middles (5 + 7)/2 = 6 Do #7

Box & Whisker Plot (2,5,1,6,9,8,) (2,5,1,6,9,8,) The Construction The Construction 1) Make sure your sample is in order 1) Make sure your sample is in order (1, 2,5,6,8,9) (1, 2,5,6,8,9) 2) Find the min, max & median 2) Find the min, max & median Min = 1 ; max = 9 median = 5.5 = Q 2 Min = 1 ; max = 9 median = 5.5 = Q 2 These three points will serve you as part of the box and whisker diagram. Draw it on board… These three points will serve you as part of the box and whisker diagram. Draw it on board…

Box & Whisker Plot ) Create a number line with vertical line at the 3) Create a number line with vertical line at the three points hinges three points hinges 4) Find median between min and Q 2 called Q1 4) Find median between min and Q 2 called Q1 5) Find median between Q2 and max called Q3 It is 2 and make another hinge It is 2 and make another hinge It is 8 & make another hinge. (1,2,5,6,8,9) Complete it!

Words & Facts We have broken the data into four parts called quartiles. We have broken the data into four parts called quartiles. Whiskers Min Box Whiskers Q1Q1 Q3Q3 Q2Q2 Interquartile range = Q3-Q1 Max

Words & Facts Each quartile should hold about ¼ of the data BUT you cannot be sure Each quartile should hold about ¼ of the data BUT you cannot be sure You cannot tell the mean or the mode You cannot tell the mean or the mode Do not jump to conclusions! Do not jump to conclusions! A box and whisker gives you an idea about the spread or concentration or dispersion of data A box and whisker gives you an idea about the spread or concentration or dispersion of data

Example # A general View A general View This data is very close together below 4. There is more of a spread between 4 and 11 and once again between 11 and 12. This data is very close together below 4. There is more of a spread between 4 and 11 and once again between 11 and 12. Some Questions… What is the Mean? 12 No idea

Questions What is the mode? What is the mode? What is the median? What is the median? What is the interquartile range? What is the interquartile range? How many are below 11? How many are below 11? No idea Q 2 = 4 Q 3 -Q 1 = = 8 75% but no idea of the number The lowest concentration of numbers lie where? Lowest concentration vs. highest concentration Between

Example # n = 20 Class A n = 40 Class B a) Which class did better?Hard to tell but class A b) What are the meansNo idea ¾ x /4 x 40 = 35 c) All together approximately how many were over 60%

Example #2 – More Questions Which class and which mark was the highest? Which class and which mark was the highest? Class B at approximately 97% Class B at approximately 97% Which class has lowest range? Which class has lowest range? Class A Class A = = 32 Class B Class B = = 57 Answer: Class A Answer: Class A Finish Stencil Finish Stencil