Communicating Quantitative Information Everybody to take the PSAT Homework: Look up reports on school test scores, especially trends. Assess reports. Quiz.

Slides:



Advertisements
Similar presentations
Discrete Data Distributions and Summary Statistics Terms: histogram, mode, mean, range, standard deviation, outlier.
Advertisements

Introduction to Summary Statistics
Standard Deviation and Standard Error Tutorial
Frequency Distribution and Variation Prepared by E.G. Gascon.
Calculating & Reporting Healthcare Statistics
PSY 307 – Statistics for the Behavioral Sciences
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
Measures of Variability
Distributions When comparing two groups of people or things, we can almost never rely on a single comparison Example: Are men taller than women?
Introduction to Educational Statistics
Data observation and Descriptive Statistics
Quantitative Genetics
Central Tendency and Variability
Measures of Central Tendency
Understanding Research Results
Statistics Used In Special Education
Think of a topic to study Review the previous literature and research Develop research questions and hypotheses Specify how to measure the variables in.
Objective To understand measures of central tendency and use them to analyze data.
Dr. Serhat Eren DESCRIPTIVE STATISTICS FOR GROUPED DATA If there were 30 observations of weekly sales then you had all 30 numbers available to you.
Part II Sigma Freud & Descriptive Statistics
Data Handbook Chapter 4 & 5. Data A series of readings that represents a natural population parameter A series of readings that represents a natural population.
Chapter 3 Descriptive Measures
Descriptive Statistics Descriptive Statistics describe a set of data.
Measures of Spread Chapter 3.3 – Tools for Analyzing Data I can: calculate and interpret measures of spread MSIP/Home Learning: p. 168 #2b, 3b, 4, 6, 7,
Statistics: For what, for who? Basics: Mean, Median, Mode.
Conducting Descriptive Statistics Dr. K. A. Korb University of Jos.
Tuesday August 27, 2013 Distributions: Measures of Central Tendency & Variability.
Chapter 8 Quantitative Data Analysis. Meaningful Information Quantitative Analysis Quantitative analysis Quantitative analysis is a scientific approach.
Summary Statistics: Mean, Median, Standard Deviation, and More “Seek simplicity and then distrust it.” (Dr. Monticino)
Central Tendency and Variability Chapter 4. Variability In reality – all of statistics can be summed into one statement: – Variability matters. – (and.
Skewness & Kurtosis: Reference
Warm up The following graphs show foot sizes of gongshowhockey.com users. What shape are the distributions? Calculate the mean, median and mode for one.
According to researchers, the average American guy is 31 years old, 5 feet 10 inches, 172 pounds, works 6.1 hours daily, and sleeps 7.7 hours. These numbers.
REVIEW OF UNIT 1 1) The table displays the number of videos rented. Number of Videos Rented Number of Families a. How many families.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Central Tendency & Dispersion
Sociology 5811: Lecture 3: Measures of Central Tendency and Dispersion Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
Chapter 6: Analyzing and Interpreting Quantitative Data
RESEARCH & DATA ANALYSIS
Chapter Three McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved. Describing Data: Numerical Measures.
Describing Distributions Statistics for the Social Sciences Psychology 340 Spring 2010.
Descriptive Statistics for one Variable. Variables and measurements A variable is a characteristic of an individual or object in which the researcher.
Standard Deviation. Two classes took a recent quiz. There were 10 students in each class, and each class had an average score of 81.5.
Measurements and Their Analysis. Introduction Note that in this chapter, we are talking about multiple measurements of the same quantity Numerical analysis.
Averages and Variability
STATISICAL ANALYSIS HLIB BIOLOGY TOPIC 1:. Why statistics? __________________ “Statistics refers to methods and rules for organizing and interpreting.
Central Tendency and Variability Chapter 4. Variability In reality – all of statistics can be summed into one statement: – Variability matters. – (and.
Minds on! Two students are being considered for a bursary. Sal’s marks are Val’s marks are Which student would you award the bursary.
3.3 Measures of Spread Chapter 3 - Tools for Analyzing Data Learning goal: calculate and interpret measures of spread Due now: p. 159 #4, 5, 6, 8,
MM150 ~ Unit 9 Statistics ~ Part II. WHAT YOU WILL LEARN Mode, median, mean, and midrange Percentiles and quartiles Range and standard deviation z-scores.
INTRODUCTION TO STATISTICS
Descriptive Statistics
Univariate Statistics
Central Tendency and Variability
Introduction to Summary Statistics
Numerical Measures: Centrality and Variability
Summary descriptive statistics: means and standard deviations:
Introduction to Summary Statistics
DESCRIBING A POPULATION
Research Statistics Objective: Students will acquire knowledge related to research Statistics in order to identify how they are used to develop research.
Module 8 Statistical Reasoning in Everyday Life
Introduction to Summary Statistics
Introduction to Summary Statistics
Measures of Central Tendency “Where is the Middle?”
Introduction to Summary Statistics
Summary descriptive statistics: means and standard deviations:
Introduction to Summary Statistics
Introduction to Summary Statistics
Introduction to Summary Statistics
Presentation transcript:

Communicating Quantitative Information Everybody to take the PSAT Homework: Look up reports on school test scores, especially trends. Assess reports. Quiz Wednesday – Short Answer on Definitions

Quick Review Statistics must always be analyzed Qualitatively as well as Quantitatively – what, if anything can they tell us. Batting Statistics (38 Separate Ones)-- tics#Batting_statistics tics#Batting_statistics

Quick Review II Purchase College Potential Growth Ann. Increase 0.05 Year Population

Quick Review III Purchase College Potential Growth Ann. (show formulae -- Increase 0.05 Year Population =A6+1 =B6*1.05 =A7+1 =B7*1.05 =A8+1 =B8*1.05 =A9+1 =B9*1.05 =A10+1 =B10*1.05 =A11+1 =B11*1.05 =A12+1 =B12*1.05 =A13+1 =B13*1.05 =A14+1 =B14*1.05 =A15+1 =B15*1.05 =A16+1 =B16*1.05 =A17+1 =B17*1.05 =A18+1 =B18*1.05 =A19+1 =B19*1.05

Real story Fox Lane High School (Bedford Central School District) meeting Principal announces: all juniors will take the PSAT –fee paid by school –done during school hours [ A parent] says, "nice and everyone will accept 'scores' going down. –What did Dr. Meyer mean? Why did she assume this to be true?

Background: Measures of Centrality How to talk about a set of numbers? How to compare sets of numbers? Mean Median Mode Standard Deviation Other ways, including charts

Mode value that occurs the most times 2, 4, 4, 4, 6, 7, 8, 9 The mode is 4 can have multiple values 2, 2, 4, 4, 6, 7, 8, 9 modes 2 and 4 Our little examples may not have a unique mode—no instance repeated means each value is a mode.

Mean (average) … of N numbers is the sum / N sum = t1 + t2 + …. tN mean = sum/n n * mean = sum As if you had N occurrences of the mean

Examples What is the mean of: 30, 66, 78, 90? Same as the mean of: 60, 66, 78, 60 66, 66, 66, 66 Do same for 48, 55, 75, 92

Mean, continued 30, 66, 78, 90 situation (mean is 66) If these are class grades (assuming equal weighting) and you make 70 on the next project, will your average go up, down or stay the same? By how much?

Median Put the numbers in order If odd number of numbers, the median is the middle number If even, the median is the mean of the two middle numbers. The median is the number such that half the numbers are >= and half the numbers are <=. It is the number in the middle Think of the median line strip in a road.

Median calculations 30, 66, 78, 90? The median is –mean (average) of 66 and 78 is 72 (72 is 6 more than 66 and 6 less than 78) Median of 30, 66, 70, 78, 90 is 70 Median of –66000, , , , –800000, , , ,

Median vs Mean vs Mode No fixed relationship In so-called normal distribution, median, mean and mode are the same –The value that occurs the most (mode) is the average value and is the value in the middle when the values are sorted. –Normal distribution also is a certain shape

Housing prices Median is preferred measurement. Why?

Housing, continued Typical situation is Westchester –many months, there is one house sale of a very expensive house. If sales other than this sale are –300000, , , , , , Mean is ? Median is ?

Measures of centrality for example Mean is Median is Now, say one house sold for , , , , , , , What is new mean and new median? – median is average of and –mean is…. ( )/

Housing, continued Mean is Median is What is a better indicator of sale prices of houses?

[Young] Women earning more than men in NYC Study was on MEDIANs –Similar point can be made with mean, but not as simple Underlying issue is that there are 2 times 2 populations (at least) –Female college graduates, male college graduates, other females, other males Posting opportunity: find original article by Andrew A. Beveridge, Gotham Gazette, summarize, explain, comment.

Standard deviation Measure of spread of data The range is the highest – lowest. The range also is a measure of the spread. –doesn't distinguish between one 'outlier' and many SD is roughly, the average distance from the mean Take the difference between each item and the mean. Square it. Add. Divide by the number of items. Variance = (Σ(x i -m) 2 )/n) –Squaring the difference makes entries less than mean contribute the same as entries greater than mean. Standard Deviation is the square root of the variance Variance and Standard Deviation are each single numbers telling us something about the data.

Standard Deviation Example Two Sets of Data 1,2,3,4,51,3,3,3,5 Range (highest – lowest) 5 – 1 = 45 – 1 = 4 Median 15 / 5 = 315 / 5 = 3 Yet the data is quite different

Standard Deviation Example II 1,2,3,4,51,3,3,3,5 Subtract the data points from the Median = = -2 2 – 3 = = 0 3 – 3 = = 0 4 – 3 = = 0 5 – 3 = 2 The sum of the answers is 0

Standard Deviation Example III So we need another way – we square the answers = -2 = = -2 = 4 2 – 3 = -1 = = 0 = 0 3 – 3 = 0 = = 0 = 0 4 – 3 = 1 = = 0 = 0 5 – 3 = 2 = 45 – 3 = 2 = 4

Standard Deviation Example IV The “Sum of the Squares” / n (the number of items) is the Variance 10 / 5 = 28 / 5 = 1.6 The Standard Deviation is the Square Root of the Variance (Excel function: =SQRT (cell reference) Square Root of 2 = ; of 1.6 =

Standard Deviation Example V The Variance formula used (The “Sum of the Squares” / n (the number of items)) is when the entire population is being analyzed; if it is a sample (more on that later), the formula is The “Sum of the Squares” / (n – 1) (one less than the number of items)

Standard Deviation Calculation Calculating Standard Deviation -- L2NVo L2NVo With Excel -- qCYBk&feature=related qCYBk&feature=related

Standard Distributions In a “Normal Distribution” of a lot of data, 68% of the data will fall between 1 Standard Deviation (+ / -) of the mean; 95% of the data will fall between 2 Standard Deviations (+ / -) of the mean; 99.7% of the data will fall between 3 Standard Deviations (+ / -) of the mean;

Distributions go back to looking at all the data A distribution keeps track of how many occurrences of each number (or each of a set of ranges).

Class exercise? Height Hours watching TV since last class –Reporting error? Number of CDs Change ??

Class exercise, continued Determine unit or range Chart graph

Normal distribution Many things but not everything! are distributed normally Median is mean is mode Dip (inflection point)

Normal distributions Can be fat or thin …. Smaller variance/std deviation Larger variance/std deviation These are continuous curves as if there were quantities at every X point

Distributions What does the set of numbers … look like? Normal Uniform = every value occurs the same number of times Bi-modal = 2 normals next to each other Bath tub = upside down normal Or something else or nothing in particular

Two distributions , , , , , –What is mean? What is median? , , , , , –What is mean? What is median?

….two distributions Same 2 measures but very different distributions

Back to the PSAT story Why did I say…scores would go down? It is/was not definite, but pretty likely… Population that chose to take the PSAT when it was harder to take was more likely to be better prepared. New procedure added….more at low end, more 'low outliers'….

General principle Sample versus (whole) population The juniors who took the test when it was an individual choice were a sample of the population. The new policy was to include the whole population. How can you characterize the (old) sample? –more partial to going to college….

Questions to ask What is the denominator: Is this a whole population or a sample? If a sample, what are factors controlling the sample? More on this later….

Real story: Library bond In the run-up to election day (Nov 8, 2005) for a bond resolution –Village of Mt. Kisco to borrow money to build new library Issue: what will the cost be to the taxpayer to re- pay the bond? Answer: depends on [your home] assessment –For example, home assessed at $33,000 (which is the median assessment) would pay $124. –If your home is assessed at more, you would pay proportionately more, if less, you would pay less

Problem Definition and context Mt. Kisco has two systems of assessments: Village and Town. Village is less than Town and both are much, much less than 'market value' The $33,000 figure caused real confusion! Attempted to get reporter to either omit the number OR say more in the article. Not sucessful. Did succeed in getting costs of the status quo (not building a new library) into news, publicity.

Puzzles 1) A bat and a ball cost $1.10 in total. The bat costs $1 more than the ball. How much does the ball cost? 2) If it takes five machines five minutes to make five widgets, how long would it take 100 machines to make 100 widgets? 3) In a lake, there is a patch of lily pads. Every day, the patch doubles in size. If it takes 48 days for the patch to cover the entire lake, how long would it take for the patch to cover half the lake?

Puzzles from study on risk New York Times article by Virginia Postrel ene.html?_r=1 about study by Shane Frederick mit.edu/people/shanefremit.edu/people/shanefre/publications.htm Getting answers right (in one study among college students) correlated with willingness to take risk Not clear if distinction was made regarding level of risk.

Puzzle Sock drawer holds: 10 white socks, 10 black socks and 1 gray sock. What is the maximum number of socks that can be removed until getting a matching pair? (a sample of size X guarantees a pair)

Puzzle 32 cards are dealt from a well-shuffled deck of 52 cards. The deck contains 26 red and 26 black cards. What is the difference between the number of black cards among the 32 dealt and the red cards remaining in the deck?

Homework Study Definitions for Quiz Keep up with postings. –Find multiple sources on same topic Look up SAT or PSAT or other educational tests and comment.