1 2.4 Describing Distributions Numerically – cont. Describing Symmetric Data.

Slides:



Advertisements
Similar presentations
DESCRIBING DISTRIBUTION NUMERICALLY
Advertisements

Lesson Describing Distributions with Numbers parts from Mr. Molesky’s Statmonkey website.
Class Session #2 Numerically Summarizing Data
Measures of Dispersion
Numerically Summarizing Data
Chapter 4 Displaying and Summarizing Quantitative Data
Lecture 2 Describing Data II ©. Summarizing and Describing Data Frequency distribution and the shape of the distribution Frequency distribution and the.
Looking at data: distributions - Describing distributions with numbers IPS chapter 1.2 © 2006 W.H. Freeman and Company.
Statistics for the Social Sciences
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
Biostatistics Unit 2 Descriptive Biostatistics 1.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Introduction to Statistics Chapter 3 Using Statistics to summarize.
1.2: Describing Distributions
1 Numerical Summary Measures Lecture 03: Measures of Variation and Interpretation, and Measures of Relative Position.
Chapter 2 Describing distributions with numbers. Chapter Outline 1. Measuring center: the mean 2. Measuring center: the median 3. Comparing the mean and.
Chapter 4 Displaying and Summarizing Quantitative Data CHAPTER OBJECTIVES At the conclusion of this chapter you should be able to: n 1)Construct graphs.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Describing Data: Numerical
Describing distributions with numbers
STATISTIC & INFORMATION THEORY (CSNB134) MODULE 2 NUMERICAL DATA REPRESENTATION.
Measurement Tools for Science Observation Hypothesis generation Hypothesis testing.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Objectives 1.2 Describing distributions with numbers
Summary statistics Using a single value to summarize some characteristic of a dataset. For example, the arithmetic mean (or average) is a summary statistic.
Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure of variability usually accompanies.
Tuesday August 27, 2013 Distributions: Measures of Central Tendency & Variability.
Descriptive Statistics Measures of Variation. Essentials: Measures of Variation (Variation – a must for statistical analysis.) Know the types of measures.
Variability. Statistics means never having to say you're certain. Statistics - Chapter 42.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. Turning Data Into Information Chapter 2.
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics.
1.1 - Populations, Samples and Processes Pictorial and Tabular Methods in Descriptive Statistics Measures of Location Measures of Variability.
Chapter 12, Part 2 STA 291 Summer I Mean and Standard Deviation The five-number summary is not the most common way to describe a distribution numerically.
Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.1 Descriptive Statistics, The Normal Distribution, and Standardization.
Statistics Numerical Representation of Data Part 2 – Measure of Variation.
Describing distributions with numbers
Unit 3 Lesson 2 (4.2) Numerical Methods for Describing Data
Descriptive Statistics The goal of descriptive statistics is to summarize a collection of data in a clear and understandable way.
© Copyright McGraw-Hill 2000
Introduction to Statistics Santosh Kumar Director (iCISA)
Means & Medians Chapter 4. Parameter - Fixed value about a population Typical unknown.
Central Tendency & Dispersion
Numerical descriptors BPS chapter 2 © 2006 W.H. Freeman and Company.
Review BPS chapter 1 Picturing Distributions with Graphs What is Statistics ? Individuals and variables Two types of data: categorical and quantitative.
Numerical descriptors BPS chapter 2 © 2006 W.H. Freeman and Company.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Chapter 5: Measures of Dispersion. Dispersion or variation in statistics is the degree to which the responses or values obtained from the respondents.
CHAPTER Basic Definitions and Properties  P opulation Characteristics = “Parameters”  S ample Characteristics = “Statistics”  R andom Variables.
Measures of Variation 1 Section 2.4. Section 2.4 Objectives 2 Determine the range of a data set Determine the variance and standard deviation of a population.
1.3 Describing Quantitative Data with Numbers Pages Objectives SWBAT: 1)Calculate measures of center (mean, median). 2)Calculate and interpret measures.
Section 2.4 Measures of Variation. Section 2.4 Objectives Determine the range of a data set Determine the variance and standard deviation of a population.
Numerical descriptions of distributions
Applied Quantitative Analysis and Practices LECTURE#07 By Dr. Osman Sadiq Paracha.
Descriptive Statistics for one variable. Statistics has two major chapters: Descriptive Statistics Inferential statistics.
Chapter 1: Exploring Data, cont. 1.2 Describing Distributions with Numbers Measuring Center: The Mean Most common measure of center Arithmetic average,
IPS Chapter 1 © 2012 W.H. Freeman and Company  1.1: Displaying distributions with graphs  1.2: Describing distributions with numbers  1.3: Density Curves.
Section 2.4 Measures of Variation 1 of 149 © 2012 Pearson Education, Inc. All rights reserved.
Numerical Summaries of Quantitative Data. Means, Standard Deviations, z-scores.
Variance and Standard Deviation
MATH-138 Elementary Statistics
Numerical descriptions of distributions
Numerical Descriptive Measures
Reasoning in Psychology Using Statistics
Numerical Descriptive Measures
Chapter 5: Describing Distributions Numerically
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Numerical Descriptive Measures
Chapter 1 Warm Up .
Numerical Descriptive Measures
Presentation transcript:

1 2.4 Describing Distributions Numerically – cont. Describing Symmetric Data

2 Symmetric Data Body temp. of 93 adults

3 Recall: 2 characteristics of a data set to measure n center measures where the “middle” of the data is located n variability measures how “spread out” the data is

4 Measure of Center When Data Approx. Symmetric n mean (arithmetic mean) n notation

5

6 Connection Between Mean and Histogram n A histogram balances when supported at the mean. Mean x = 140.6

7 Mean: balance point Median: 50% area each half right histo: mean yrs, median 57.7yrs

8 Properties of Mean, Median 1.The mean and median are unique; that is, a data set has only 1 mean and 1 median (the mean and median are not necessarily equal). 2.The mean uses the value of every number in the data set; the median does not.

9 Think about mean and median n Six people in a room have a median age of 45 years and mean age of 45 years. n One person who is 40 years old leaves the room. n Questions: 1.What is the median age of the 5 people remaining in the room? 2.What is the mean age of the 5 people remaining in the room? Can’t answer  6=270; =230; 230/5=46

10 Example: class pulse rates n

, 2014 baseball salaries n 2010 n = 845  = $3,297,828 median = $1,330,000 max = $33,000,000 n 2014 n = 848  = $3,932,912 median = $1,456,250 max = $28,000,000

12 Disadvantage of the mean n Can be greatly influenced by just a few observations that are much greater or much smaller than the rest of the data

13 Mean, Median, Maximum Baseball Salaries

14 Skewness: comparing the mean, and median n Skewed to the right (positively skewed) n mean>median

15 Skewed to the left; negatively skewed n Mean < median n mean=78; median=87;

16 Symmetric data n mean, median approx. equal

18 Describing Symmetric Data (cont.) n Measure of center for symmetric data: n Measure of variability for symmetric data?

19 Example n 2 data sets: x 1 =49, x 2 =51 x=50 y 1 =0, y 2 =100 y=50

20 On average, they’re both comfortable

21 Ways to measure variability 1. range=largest-smallest ok sometimes; in general, too crude; sensitive to one large or small obs.

22 Previous Example

23 The Sample Standard Deviation, a measure of spread around the mean n Square the deviation of each observation from the mean; find the square root of the “average” of these squared deviations

24 Calculations … Mean = 63.4 Sum of squared deviations from mean = 85.2 (n − 1) = 13; (n − 1) is called degrees freedom (df) s 2 = variance = 85.2/13 = 6.55 inches squared s = standard deviation = √6.55 = 2.56 inches Women height (inches)

25 1. First calculate the variance s Then take the square root to get the standard deviation s. Mean ± 1 s.d. We’ll never calculate these by hand, so make sure to know how to get the standard deviation using your calculator, Excel, or other software.

26 Population Standard Deviation

27 Remarks 1. The standard deviation of a set of measurements is an estimate of the likely size of the chance error in a single measurement

28 Remarks (cont.) 2. Note that s and  are always greater than or equal to zero. 3. The larger the value of s (or  ), the greater the spread of the data. When does s=0? When does  =0?

29 Remarks (cont.) 4. The standard deviation is the most commonly used measure of risk in finance and business –Stocks, Mutual Funds, etc. 5. Variance  s 2 sample variance   2 population variance  Units are squared units of the original data  square $, square gallons ??

30 Remarks 6):Why divide by n-1 instead of n? n degrees of freedom n each observation has 1 degree of freedom however, when estimate unknown population parameter like , you lose 1 degree of freedom

31 Remarks 6) (cont.):Why divide by n-1 instead of n? Example n Suppose we have 3 numbers whose average is 9 nx1=x2=nx1=x2= n then x 3 must be n once we selected x 1 and x 2, x 3 was determined since the average was 9 n 3 numbers but only 2 “degrees of freedom”

32 Computational Example

33 class pulse rates

34 Example #1#2#3# n x n s n m

35 Boxplots: same mean, standard deviation

36 More Boxplots of the 4 data sets

37 Review: Properties of s and  s and  are always greater than or equal to 0 when does s = 0?  = 0? The larger the value of s (or  ), the greater the spread of the data n the standard deviation of a set of measurements is an estimate of the likely size of the chance error in a single measurement

38 Summary of Notation