CHAPTER 2 Modeling Distributions of Data

Slides:



Advertisements
Similar presentations
Changing the unit of measurement
Advertisements

CHAPTER 2 Modeling Distributions of Data
2.1 Describing Location in a Distribution. Measuring Position: Percentiles One way to describe the location of a value in a distribution is to tell what.
1.1 Displaying Distributions with Graphs
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 2 Modeling Distributions of Data 2.1 Describing.
Section 2.1 Describing Location in a Distribution
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 2 Modeling Distributions of Data 2.1 Describing.
H OW T ALL A RE YOU ?. 2.1A D ESCRIBING L OCATION IN A D ISTRIBUTION  Using percentiles  Making an Ogive  Calculating and Interpreting a z -score.
Chapter 2: Modeling Distributions of Data
2.1B D ESCRIBING L OCATION IN A D ISTRIBUTION TRANSFORM data DEFINE and DESCRIBE density curves.
Section 2.1 Part 2: Transforming Data, Density Curves.
+ Chapter 2: Modeling Distributions of Data Section 2.1 Describing Location in a Distribution The Practice of Statistics, 4 th edition - For AP* STARNES,
Chapter 3: Transformations of Graphs and Data Lesson 6: Scale Changes of Data Mrs. Parziale.
TRANSLATIONS OF DATA CHAPTER 3 LESSON 3 VOCABULARY Translation- a transformation that maps each x i to x i +h, where h is a constant Invariant- Unchanged.
 By the end of this section, you should be able to: › Find and interpret the percentile of an individual value within a distribution of data. › Estimate.
SWBAT: Describe the effect of transformations on shape, center, and spread of a distribution of data. Do Now: Two measures of center are marked on the.
Chapter 2 Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
Transforming Data.
Chapter 2: Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
Sections 2.3 and 2.4.
CHAPTER 2 Modeling Distributions of Data
6.2 Transforming and Combining Random Variables
Describing Location in a Distribution
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
Good Afternoon! Agenda: Knight’s Charge-please get started Good things
Reasoning in Psychology Using Statistics
Ninth grade students in an English class were surveyed to find out about how many times during the last year they saw a movie in a theater. The results.
Describing Location in a Distribution
Chapter 2: Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Linear transformations
1.2 Describing Distributions with Numbers
CHAPTER 6 Random Variables
Chapter 2: Modeling Distributions of Data
Day 2 - Chapter 2.1 Transformations & Density Curves.
EQ: What effect do transformations have on summary statistics?
Chapter 2: Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Describing Location in a Distribution
Chapter 2: Modeling Distributions of Data
2.1: Describing Location in a Distribution
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
Linear Transformations
Chapter 2: Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
Presentation transcript:

CHAPTER 2 Modeling Distributions of Data 2.1 (part 3) Describing Location in a Distribution

Transforming Data Transforming converts the original observations from the original units of measurements to another scale. Transformations can affect the shape, center, and spread of a distribution. Effect of Adding (or Subtracting) a Constant Adding / Subtracting the same number “a” to / from each individual observation: Is the same as if you add/subtract “a” to measures of center and location (mean, median, quartiles, percentiles) Does not change the shape of the distribution or measures of spread (range, IQR, standard deviation).

Example #1: Teachers’ Salaries A school system employs teachers at salaries between $30,000 and $60,000. The teacher’s union and the school board are negotiating the form of next year’s increase in the salary schedule. Suppose that every teacher is given a flat $1000 raise. How much will the MEAN salary increase? How much will the MEDIAN salary increase? Will a flat $1000 raise increase the spread as measured by the distance between the quartiles? Will a flat $1000 raise increase the spread as measured by the standard deviation of the salaries?

Example #2: Estimating Room Width Soon after the metric system was introduced in Australia, a group of students was asked to guess the width of their classroom to the nearest meter. The dotplot labeled “guess m” shows their responses. Later, the students found out that the actual width of the room was 13 meters. We can now examine the distribution of students’ guessing errors by defining a new variable as follows: error = guess − 13 That is, we’ll subtract 13 from each observation in the data set. Try to predict what the shape, center, and spread of this new distribution will be.

did NOT affect the shape or spread of the distribution. From the figure below, it seems clear that subtracting 13 from each observation did NOT affect the shape or spread of the distribution. But this transformation appears to have decreased the center of the distribution by 13 meters. The summary statistics in the table below confirm our beliefs. n Mean sx Min Q1 M Q3 Max IQR Range Guess(m) 44 16.02 7.14 8 11 15 17 40 6 32 Error (m) 3.02 -5 -2 2 4 27

Transforming Data Transforming converts the original observations from the original units of measurements to another scale. Transformations can affect the shape, center, and spread of a distribution. Effect of Multiplying (or Dividing) by a Constant Multiplying (or dividing) each observation by the same number “b”: Multiplies/divides measures of center and location (mean, median, quartiles, percentiles) by “b” Multiplies/divides measures of spread (range, IQR, standard deviation) by |b|, does not change the shape of the distribution

Example #3: Teachers’ Salaries (revisited) Suppose that instead of teachers getting a flat $1000 raise, they will receive a 5% raise, and the amount of the raise will vary from $1500 to $3000, depending on present salary. Will a 5% raise across the board increase the spread of the distribution as measured by the distance between the quartiles? Do you think it will increase the standard deviation?

Example #4: Estimating Room Width (revisited) Because our group of Australian students is having some difficulty with the metric system, it may not be helpful to tell them that their guesses tended to be about 2 to 3 meters too high. Let’s convert the error data to feet before we report back to them. There are roughly 3.28 feet in a meter. So for the student whose error was -5 meters, that translates to: To change the units of measurement from meters to feet, we multiply each of the error values by 3.28. What effect will this have on the shape, center, and spread of the distribution?

The dotplots below show the students’ guessing errors in meters and in feet, along with summary statistics from computer software. The shape of the two distributions is the same….they are both skewed to the right and they are both bimodal. The centers and spreads of the two distributions are quite different. Can you see that the measures of center were multiplied by 3.28? If we multiply the all the individual observations by 3.28, then the mean, median, standard deviation, IQR, and the range should also be multiplied by a factor of 3.28. (** The variance would be multiplied by )

Check your understanding: Temperature What happens if we transform a data set by both adding/subtracting a constant AND multiplying/dividing by a constant? Convert temperature data from Celsius to Farenheit, we have to use the formula Ex: If the mean temp is 8.43 degrees in Celsius, find the mean temperature in Fahrenheit. Ex: If the std dev is 2.27 degrees Celsius, calculate the std dev of the temperature in Fahrenheit. If the 93rd percentile is 12 degrees Celsius, calculate the 93rd percentile in degrees Fahrenheit.

Connecting Transformations & Z-scores To standardize an observation, you subtract the mean of the distribution, and then divide by the standard deviation. What if we standardized EVERY observation in a distribution? Let’s go back to the class of Statistics tests that had a mean of 80 and a std dev of 6.07. To convert the entire class’ test results to z-scores, we would subtract 80 from each test score, and then divide by 6.07. What effect would these transformations have on the shape, center, and spread of the distribution?

Check your understanding: Taxicabs In 2010, taxicabs in NYC charged an initial fee of $2.50 plus $2 per mile. Writing an equation, fare = 2.50 + 2(miles) At the end of the month, a businessman collects all his taxicab receipts and calculates the mean fare he paid was $15.45, and the standard deviation was $10.20. Problem: What are the mean and standard deviation of the lengths of his cab rides in MILES?

Describing Location in a Distribution FIND and INTERPRET the percentile of an individual value within a distribution of data. ESTIMATE percentiles and individual values using a cumulative relative frequency graph. FIND and INTERPRET the standardized score (z-score) of an individual value within a distribution of data. DESCRIBE the effect of adding, subtracting, multiplying by, or dividing by a constant on the shape, center, and spread of a distribution of data.