Jan. 29 “Statistics” for one quantitative variable… Mean and standard deviation (last week!) “Robust” measures of location (median and its friends) Quartiles,

Slides:



Advertisements
Similar presentations
Describing Quantitative Variables
Advertisements

DESCRIBING DISTRIBUTION NUMERICALLY
Describing Distributions with Numbers
CHAPTER 1 Exploring Data
Numerically Summarizing Data
Looking at data: distributions - Describing distributions with numbers IPS chapter 1.2 © 2006 W.H. Freeman and Company.
Jan Shapes of distributions… “Statistics” for one quantitative variable… Mean and median Percentiles Standard deviations Transforming data… Rescale:
Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 3 Introduction – Slide 1 of 3 Topic 16 Numerically Summarizing Data- Averages.
MEASURES OF SPREAD – VARIABILITY- DIVERSITY- VARIATION-DISPERSION
Looking at data: distributions - Describing distributions with numbers
1.2: Describing Distributions
BPS - 5th Ed. Chapter 21 Describing Distributions with Numbers.
Basic Practice of Statistics - 3rd Edition
Describing Distributions Numerically
CHAPTER 2: Describing Distributions with Numbers
5 Number Summary Box Plots. The five-number summary is the collection of The smallest value The first quartile (Q 1 or P 25 ) The median (M or Q 2 or.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
AP Statistics Chapters 0 & 1 Review. Variables fall into two main categories: A categorical, or qualitative, variable places an individual into one of.
Describing distributions with numbers
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
LECTURE 12 Tuesday, 6 October STA291 Fall Five-Number Summary (Review) 2 Maximum, Upper Quartile, Median, Lower Quartile, Minimum Statistical Software.
Objectives 1.2 Describing distributions with numbers
Summary statistics Using a single value to summarize some characteristic of a dataset. For example, the arithmetic mean (or average) is a summary statistic.
1 Measure of Center  Measure of Center the value at the center or middle of a data set 1.Mean 2.Median 3.Mode 4.Midrange (rarely used)
Exploration of Mean & Median Go to the website of “Introduction to the Practice of Statistics”website Click on the link to “Statistical Applets” Select.
LECTURE 8 Thursday, 19 February STA291 Fall 2008.
1.3: Describing Quantitative Data with Numbers
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Applied Quantitative Analysis and Practices LECTURE#08 By Dr. Osman Sadiq Paracha.
STA Lecture 131 STA 291 Lecture 13, Chap. 6 Describing Quantitative Data – Measures of Central Location – Measures of Variability (spread)
Table of Contents 1. Standard Deviation
What is variability in data? Measuring how much the group as a whole deviates from the center. Gives you an indication of what is the spread of the data.
Describing distributions with numbers
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Measures of Center.
The Practice of Statistics Third Edition Chapter 1: Exploring Data 1.2 Describing Distributions with Numbers Copyright © 2008 by W. H. Freeman & Company.
Skewness & Kurtosis: Reference
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
Measures of Dispersion How far the data is spread out.
Quantitative data. mean median mode range  average add all of the numbers and divide by the number of numbers you have  the middle number when the numbers.
Measures of Center vs Measures of Spread
LECTURE CENTRAL TENDENCIES & DISPERSION POSTGRADUATE METHODOLOGY COURSE.
Chapter 3 Looking at Data: Distributions Chapter Three
Essential Statistics Chapter 21 Describing Distributions with Numbers.
 The mean is typically what is meant by the word “average.” The mean is perhaps the most common measure of central tendency.  The sample mean is written.
Review BPS chapter 1 Picturing Distributions with Graphs What is Statistics ? Individuals and variables Two types of data: categorical and quantitative.
Chapter 2 Describing Distributions with Numbers. Numerical Summaries u Center of the data –mean –median u Variation –range –quartiles (interquartile range)
Organizing Data AP Stats Chapter 1. Organizing Data Categorical Categorical Dotplot (also used for quantitative) Dotplot (also used for quantitative)
Chapter 5 Describing Distributions Numerically.
BPS - 5th Ed. Chapter 21 Describing Distributions with Numbers.
1.3 Describing Quantitative Data with Numbers Pages Objectives SWBAT: 1)Calculate measures of center (mean, median). 2)Calculate and interpret measures.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
Statistics topics from both Math 1 and Math 2, both featured on the GHSGT.
Univariate EDA. Quantitative Univariate EDASlide #2 Exploratory Data Analysis Univariate EDA – Describe the distribution –Distribution is concerned with.
More Univariate Data Quantitative Graphs & Describing Distributions with Numbers.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
BPS - 5th Ed.Chapter 21 Describing Distributions with Numbers.
© 2012 W.H. Freeman and Company Lecture 2 – Aug 29.
Chapter 5 Describing Distributions Numerically Describing a Quantitative Variable using Percentiles Percentile –A given percent of the observations are.
Chapter 1 Lesson 4 Quartiles, Percentiles, and Box Plots.
Describing Distributions with Numbers
Box and Whisker Plots Algebra 2.
Lecture 2 Chapter 3. Displaying and Summarizing Quantitative Data
Organizing Data AP Stats Chapter 1.
1.3 Describing Quantitative Data with Numbers
Basic Practice of Statistics - 3rd Edition
Chapter 1 Warm Up .
Summary (Week 1) Categorical vs. Quantitative Variables
Basic Practice of Statistics - 3rd Edition
MCC6.SP.5c, MCC9-12.S.ID.1, MCC9-12.S.1D.2 and MCC9-12.S.ID.3
Describing Distributions with Numbers
Presentation transcript:

Jan. 29 “Statistics” for one quantitative variable… Mean and standard deviation (last week!) “Robust” measures of location (median and its friends) Quartiles, IQR, five-number summary, Box plots Percentiles Transforming data… Rescale: Y = c times X Recenter: Y = X plus a other transformations adding variables to each other Standardizing data…

Population vs. Sample

NH polls, 1/26/04 - errors

A statistic is anything that can be computed from data.

STATISTICS of a single quantitative variable MEAN MEDIAN QUARTILES ( Q1, Q3 ) Five-number summary Boxplots Interquartile range PERCENTILES / QUANTILES / FRACTILES STANDARD DEVIATION VARIANCE

Statistics of one variable… Median --- middle value (when values are ranked, smallest to largest) (or, average of two middle values) “Robust” Trimmed mean Midmean Geometric mean “RMS mean”

Number of Colleges

Number of Colleges

Number of Colleges

Mean vs. Median Large tails affect the mean more than the median. So: Right-skewed distribution  Mean right of median Left-skewed distribution  Mean left of median

Colleges – Datadesk histogram median — 5 mean — 5.36

salaries median — 60,000 mean — 106,875

So, which measure of “center” is best? All the measures agree (roughly) when the distribution is symmetrical Mean has attractive mathematical properties Also, the mean is related to the total, if that’s what you care about Median may be more “typical” when the distribution is non- symmetrical A measure is “robust” if it works reasonably well under a wide variety of circumstances Medians are robust

Computing percentiles To calculate 20-th percentile: Rank the values from smallest to largest Compute 20% of n… 20% of 72 = 14.4 Count off that many values (from lowest)… The value at which you stop is the 20-th percentile. What if you stop between values ?

Number of Colleges

QUARTILES Lower quartile (Q1) = 25-th percentile Upper quartile (Q3) = 75-th percentile ( What’s Q2 ? ) INTERQUARTILE RANGE ( IQR ) = Q3 minus Q1

Five-number summary — maximum (or, say, 95 %ile) — Q3 — median — Q1 — minimum (or, say, 5 %ile)

Linear Transformations If you MULTIPLY or DIVIDE a variable by a constant… Y = c times X Y = X / c then… measures of center are multiplied or divided by c measures of spread are multiplied or divided by |c| If you ADD or SUBTRACT a constant from a variable… Y = X + aY = X – a then… measures of center are increased (decreased) by a measures of spread are UNCHANGED.

More transformations ADDING VARIABLES: W = X + Y Mean(W) = Mean(X) + Mean(Y) Standard Deviation of (W) — anything can happen OTHER TRANSFORMATIONS: Y = X squared ? Y = log(X) ? …NO RELIABLE RULES for mean or std. dev.

Standardized Variables Write and S for mean, standard deviation of X Then form transformed variable: Z = (X - ) / S Then… mean (Z) = 0 std dev (Z) = 1 Z answers the question: How many standard deviations is this value above (or below) the mean?