Stat 2411 Statistical Methods

Slides:



Advertisements
Similar presentations
Lesson Describing Distributions with Numbers parts from Mr. Molesky’s Statmonkey website.
Advertisements

CHAPTER 1 Exploring Data
Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 3 Introduction – Slide 1 of 3 Topic 16 Numerically Summarizing Data- Averages.
Numerically Summarizing Data
Measures of Central Tendency 3.1. ● Analyzing populations versus analyzing samples ● For populations  We know all of the data  Descriptive measures.
Chapter 2 Describing distributions with numbers. Chapter Outline 1. Measuring center: the mean 2. Measuring center: the median 3. Comparing the mean and.
Agresti/Franklin Statistics, 1 of 63 Chapter 2 Exploring Data with Graphs and Numerical Summaries Learn …. The Different Types of Data The Use of Graphs.
AP Statistics Chapters 0 & 1 Review. Variables fall into two main categories: A categorical, or qualitative, variable places an individual into one of.
Describing distributions with numbers
Chapter 1 Exploring Data
Objectives 1.2 Describing distributions with numbers
STA Lecture 111 STA 291 Lecture 11 Describing Quantitative Data – Measures of Central Location Examples of mean and median –Review of Chapter 5.
Numerical Descriptive Techniques
1.3: Describing Quantitative Data with Numbers
Descriptive statistics Describing data with numbers: measures of location.
Chapter 3 Descriptive Statistics: Numerical Methods Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
Describing distributions with numbers
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Chapter 5: Exploring Data: Distributions Lesson Plan Exploring Data Displaying Distributions: Histograms Interpreting Histograms Displaying Distributions:
Describing Quantitative Data with Numbers Section 1.3.
Chapter 5 Describing Distributions Numerically.
Statistics 1: Introduction to Probability and Statistics Section 3-2.
Using Measures of Position (rather than value) to Describe Spread? 1.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
Chapter 5 Describing Distributions Numerically Describing a Quantitative Variable using Percentiles Percentile –A given percent of the observations are.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
CHAPTER 1 Exploring Data
Lecture #3 Tuesday, August 30, 2016 Textbook: Sections 2.4 through 2.6
CHAPTER 1 Exploring Data
MAT 446 Supplementary Note for Ch 1
Describing Distributions Numerically
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Box and Whisker Plots Algebra 2.
Please take out Sec HW It is worth 20 points (2 pts
Drill {A, B, B, C, C, E, C, C, C, B, A, A, E, E, D, D, A, B, B, C}
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1 Warm Up .
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Measures of Center.
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Presentation transcript:

Stat 2411 Statistical Methods Chapter 3 Measures of Location

3.1 Populations and Samples Population: All conceivably possible or hypothetically possible observation Sample: The particular observations actually taken

Population Example: Temperatures of patients with meningitis. There are unlimited or infinite potential observations 100.2 101.5 100.3 Population of potential measurements

Sample Notation: value Sample n=10 104.0 100.2 100.8 108.0 104.8 102.4 104.0 100.2 100.8 108.0 104.8 102.4 104.2 103.8 101.6 101.4 Notation: value

3.2 The mean Sample mean= =Average =Center of gravity

Summation notation

Population Descriptions The Population mean is the average of all values in the population of potential values. Population mean = Population descriptions are denoted by Greek letters like Meningitis example: = average of all potential measurement of temperature of all meningitis cases.

Parameter and Statistic Population descriptions – parameters Sample descriptions – statistics Sample statistics are usually used to estimate the corresponding population parameters.

3.3 Weighted mean Weight X Homework 20 90 Exam 1 8 82 Exam 2 11 87 Final 35 83

Geometric Mean (problem 3.15) Sometimes data are analyzed in the log scale (for reasons discussed later). Geometric mean = back-transformed mean of log’s x y log10x 10y

Geometric mean Example: x 1 10 100 y 0 1 2 Algebraically equivalent formula

Harmonic Mean Back-Transformed mean of 1/x Example: x = time Y = rate X Y 1 1 10 0.1 100 0.01 Example: x = time Y = rate Current: 1 mph 15 miles 3 mph upstream 5 mph downstream Harmonic mean 30miles/5 hours up +3 hours down

3.4 The Median The median M is the midpoint of a data set. When observations are ordered from smallest to largest, M is in the middle, with half the observations smaller, half larger: 3 5 7 9 38 M = 7 If there are an even number of observations (no single midpoint), the location of the median is really between the two middle points, so the average of the two is used as the estimate of the median value. 3 5 7 9 M = 5 + 7 = 6 2

Means vs. Medians 3 5 7 9 38 M = 7 3 + 5 + 7 + 9 + 38 5 62 12.4 The two values can behave VERY differently, because the Median (M) is resistant to the magnitude of possible outliers, but the Mean ( ) is not, so it can be drawn toward them. 3 5 7 9 38 M = 7 The median remains unaffected, but the mean is drawn toward the outlier at 38. X 3 + 5 + 7 + 9 + 38 5 = 62 12.4

Mode The value that occurs most frequently Mode=108 9 10 11 12 13 06 02688888 222244666 02448 04

Fractiles Quartiles : divide data into 4 parts. Deciles : divide data into 10 parts. Percentiles: divide data into a hundred parts Among the many fractiles, quartiles are used very often in describing data. Quartiles are the values at which 25% (Q1), 50% (Q2=Median) and 75% (Q3) of the observations fall at or below them, and can be used to describe the internal variability. Like the term Center, we now need a definition of the SPREAD of the distribution. Variability - That’s where quartiles come in.

Defining the Quartiles To calculate the quartiles: 1. Arrange the observations in increasing order and locate the median M in the ordered list of observations. 2. The first quartile Q1 is the median of the observations whose position in the ordered list is to the left of the location of the overall median. 3. the third quartile Q3 is the median of the observations whose position in the ordered list is to the right of the location of the overall median. Quartiles are defined following the same rules as we followed for identifying the Median (M), which makes perfect sense since the Median (M) is the same thing as the second quartile (50th percentile).

Calculating (Identifying) the Quartiles 26 systolic blood pressure 90 96 100 102 106 108 108 108 108 108 112 112 112 112 114 114 116 116 116 120 122 124 124 128 130 134 Q1=108 Q3=120 M = (112+112)/2 M = 112 With an even number of measures, there is no single value associated with the midpoint of the distribution The median is really between the two 112’s with a value estimated by their average age, 112. Then, from the Median there are 13 points to the left, 90, 96, 100, … 112. The median of those values (Q1) is then the midpoint of the 13 numbers, which is the 7th number 108. Then, from the Median there are 13 points to the right, 112, 114, 114,…, 134. The median of those values (Q3) is the midpoint of these 13 numbers, or the 7th number counting backwards: 120.

Graphing the Five-Number Summary The Box Plot Graphing the Five-Number Summary (Min, Q1, Median, Q3, Max) Maximum (Largest Observation) Q3 (75th percentile) Values of the Variable Median M (50th percentile) Q1 (25th percentile) A box plot is a method of plotting the values of the five-number summary, from lowest to highest. ALWAYS PLOT YOUR DATA Minimum (Smallest Observation) Box plots can show very large datasets & highlight skewness Because they show less detail than histograms or stemplots, they are best used for side-by-side comparison of more than 1 dataset.

Read section 3.8 on summation notation.