Lecture 6 Sections 2.1 – 2.2 Objectives: Measure of Center

Slides:



Advertisements
Similar presentations
Describing Quantitative Variables
Advertisements

Descriptive Measures MARE 250 Dr. Jason Turner.
Lecture 2 Part a: Numerical Measures
Chapter Three McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved
The procedure for finding the variance and standard deviation for grouped data is similar to that for finding the mean for grouped data, and.
Calculating & Reporting Healthcare Statistics
Chap 3-1 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 3 Describing Data: Numerical.
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson2-1 Lesson 2: Descriptive Statistics.
Stat 2411 Statistical Methods Chapter 4. Measure of Variation.
CHAPTER 6 Statistical Analysis of Experimental Data
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Chapter 2 Describing distributions with numbers. Chapter Outline 1. Measuring center: the mean 2. Measuring center: the median 3. Comparing the mean and.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Math 116 Chapter 12.
Describing Data: Numerical
Describing distributions with numbers
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Normal Distributions.
CHAPTER 3 : DESCRIPTIVE STATISTIC : NUMERICAL MEASURES (STATISTICS)
Chapter 7 Continuous Distributions. Continuous random variables Are numerical variables whose values fall within a range or interval Are measurements.
DAY 3 14 Jan Today is A.January 14, 2014 B.January 13, 2013.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 1 Chapter 4 Numerical Methods for Describing Data.
© 2008 Brooks/Cole, a division of Thomson Learning, Inc. 1 Chapter 4 Numerical Methods for Describing Data.
QBM117 Business Statistics Descriptive Statistics Numerical Descriptive Measures.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. Turning Data Into Information Chapter 2.
Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester
Describing distributions with numbers
IT College Introduction to Computer Statistical Packages Eng. Heba Hamad 2009.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
Chapter Three McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved
1 Chapter 4 Numerical Methods for Describing Data.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Chapter Three McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved. Describing Data: Numerical Measures.
Descriptive Statistics for one Variable. Variables and measurements A variable is a characteristic of an individual or object in which the researcher.
©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture 1 Lecture 7: Chapter 4, Section 3 Quantitative Variables (Summaries,
Chapter Three McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
Stat 2411 Statistical Methods Chapter 4. Measure of Variation.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Chapter 7 Random Variables and Continuous Distributions.
Chapter 3 Section 3 Measures of variation. Measures of Variation Example 3 – 18 Suppose we wish to test two experimental brands of outdoor paint to see.
Slide 1 Copyright © 2004 Pearson Education, Inc.  Descriptive Statistics summarize or describe the important characteristics of a known set of population.
Exploratory Data Analysis
Descriptive Statistics ( )
Math 201: Chapter 2 Sections 3,4,5,6,7,9.
Continuous Distributions
MECH 373 Instrumentation and Measurements
Measures of Dispersion
Business and Economics 6th Edition
Numerical Descriptive Measures
Stat 2411 Statistical Methods
Describing, Exploring and Comparing Data
Characteristics of the Mean
Numerical Descriptive Measures
Descriptive Statistics
Continuous Distributions
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Describing Data with Numerical Measures
BUS173: Applied Statistics
Numerical Descriptive Measures
LESSON 4: MEASURES OF VARIABILITY AND PROPORTION
Data Analysis and Statistical Software I Quarter: Spring 2003
10-5 The normal distribution
Summary (Week 1) Categorical vs. Quantitative Variables
Stat 2411 Statistical Methods Chapter 4. Measure of Variation.
Describing Distributions with Numbers
Business and Economics 7th Edition
Numerical Descriptive Measures
Presentation transcript:

Lecture 6 Sections 2.1 – 2.2 Objectives: Measure of Center Measure of Center for Data Measure of Center for Distributions Measure of Variability for Data Measure of Variability for Distributions The Empirical Rule (Normal Distribution)

The Sample Mean To describe a “typical” or “representative” observation, we will use the sample mean Most frequently used measure of the center. Sensitive to extreme observations (outliers). Useful for the estimation of the center when the distribution is symmetric and is free of outliers.

Example Caustic stress corrosion cracking of iron and steel has been studied because of failures around rivets in steel boilers and failures of steam rotors. Consider the accompanying observations on crack length (μm) as a result of constant load stress corrosion tests on smooth bar tensile samples for a fixed length of time. The data is from the article “On the Role of Phosphorus in the Caustic Stress Corrosion Cracking of Low Alloy Steels”, Corrosion Science, 1989: 53-68. 16.1 9.6 24.9 20.4 12.7 21.2 30.2 25.8 18.5 10.3 25.3 14.0 27.1 45.0 23.3 24.2 14.6 8.9 32.4 11.8 28.5 a. Find the mean of crack length. b. Replace 45.0 by 295.0 and then find the mean of crack length.

The Sample Median Midpoint of the observations in the ordered list. So, 50% of data falls below and 50% falls above. If n is odd ⇒ the median is the middle value in the ordered list (that is, the (n+1)/2 th observation). If n is even, there is no unique middle ⇒ the median is the average of the middle pair of values. Much less sensitive to extreme observations (outliers). Useful for the estimation of the center when the distribution is skewed.

Example Consider the following 5 observations 34 44 56 63 67 49.2 53.9 50.0 44.5 42.2 42.3 32.3 31.3 60.9 47.5

Trimmed Means The 100r% trimmed mean is the mean of remaining observations after trimming the largest n*100r% and the smallest n*100r%, where r is a number between 0 and 0.5 Note: The trimmed mean is less sensitive to outliers than the mean but more sensitive than the median. Example. Consider the following 20 observations, each representing the lifetime (hr) of a certain type of incandescent lamp: 612 623 666 744 883 898 964 970 983 1003 1016 1022 1029 1058 1085 1088 1122 1135 1197 1201 Find the 10% trimmed mean.

Population Mean Discrete Distributions Definition. The mean (or expected value) of a discrete variable x is given by Example. Plastic parts manufactured using an injection molding process may exhibit one or more defects, including sinks, scratches, black spots, and so on. Let x represent the number of defects on a single part, and suppose the distribution of x is as follows: x 0 1 2 3 4 p(x) .80 .14 .03 .02 .01 1) If x~B(n,π), then μ=nπ. 2) If x~Poisson(λ), then μ = λ.

Population Mean Continuous Distributions Definition. The mean (or expected value) of a continuous variable x is given by Example. The distribution of the amount of gravel (tons) sold by a particular construction supply company in a given week is a continuous variable x with density function Knowledge of the mean value of x will help the company decide on a price for the gravel. Find the mean of x.

Means for Specific Distributions Continuous distributions If x ~ N(μ, σ2), then the mean of x is μ. If x has an exponential distribution with parameter λ, then mean of x is λ. If x has a lognormal distribution with parameters μ and σ, then the mean of x is The mean of a Weibull distribution is a somewhat complicated expression involving the parameters α and β.

Population Median Median for continuous distribution The median of a continuous distribution divides the area under the density curve into two equal halves. The defining condition is Example. Find the median for the distribution of weekly gravel sales.

Mean and Median The mean and the median are the same only if the distribution is symmetrical. The median is a measure of center that is resistant to skew and outliers. The mean is not. Mean and median for a symmetric distribution Mean Median Mean and median for skewed distributions Left skew Right skew Mean Median Mean Median

Measure of Variability for Data A measure of the center is not enough to describe a distribution well. Example. Suppose the heights of five starting basketball players on two men’s basketball teams are: Team I (inches): 72 73 76 76 78 Team II (inches): 67 72 76 76 84 Range - Simplest measure of variability Range = the difference between the largest and the smallest sample values. The range depends on only the two most extreme observations and disregards the positions of the remaining (n-2) values.

Sample Variance and Standard Deviation   The sample standard deviation, denoted by s, is the square root of the variance. The sample variance measures how far, on average, the observations are from the mean. The more spread out a distribution is around its mean, the larger its standard deviation. The unit for s is the same as the unit for the data. Sensitive to outliers.

Example Strength is an important characteristic of materials used in prefabricated housing. Each of 11 prefabricated plate elements was subjected to a severe stress test, and the maximum width (mm) of the resulting cracks was recorded. The data is from the article “Prefabricated Ferrocement Ribbed Elements for Low-Cost Housing” (J. of Ferrocement, 1984: 347-364). .684 2.540 .924 3.130 1.038 .598 .483 3.520 1.285 2.650 1.497 Find the sample variance and sample standard deviation.

Measure of Variability for Distributions Population variance and standard deviation Discrete distributions Definition. The variance of a discrete variable x is given by The standard deviation is σ, the positive square root of the variance. Example. Revisit the plastic part example. Find the variance and standard deviation. Variances for specific discrete distributions 1) If x ~ B(n,π), then. σ2 = nπ(1-π) 2) If x ~ Poisson (λ), then σ2 = λ.

Measure of Variability for Distributions Population variance and standard deviation Continuous distributions Definition. The variance of a continuous variable x is given by The standard deviation is σ, the positive square root of the variance. Example. Revisit the gravel sales example. Find the variance and standard deviation of x. Variances for specific continuous distributions If x ~ N(μ, σ2) then the variance of x is σ2 If x has an exponential distribution with parameter λ, then variance of x is λ. 3) If x has a lognormal distribution with parameters μ and σ, then the variance of x is 4) The variance of a Weibull distribution is even more complicated.

The Empirical Rule For any variable x whose distribution is well approximated by a normal curve: Approximately 68% of the values are within 1 standard deviation of the mean. Approximately 95% of the values are within 2 standard deviation of the mean. Approximately 99.7% of the values are within 3 standard deviations of the mean. N(0,1)

Example Scores on an achievement test taken by all high school seniors in a certain state are known to have, approximately, a bell-shaped distribution with Mean (μ) = 64, Standard Deviation (σ) = 10. 68% of the data will lie in the interval 95% of the scores are between Almost all of the scores are between

Example The time to complete a standardized exam is approximately bell shaped with a mean of 70 minutes and a standard deviation of 10 minutes. Using the empirical rule, what percent of students will complete the exam in under an hour?