A.M EASURES OF LOCATION A.M EASURES OF LOCATION B.M EASURES OF SPREAD Central tendency and measures of dispersion &

Slides:



Advertisements
Similar presentations
Statistical Techniques I EXST7005 Start here Measures of Dispersion.
Advertisements

Class Session #2 Numerically Summarizing Data
SUMMARIZING DATA: Measures of variation Measure of Dispersion (variation) is the measure of extent of deviation of individual value from the central value.
Agricultural and Biological Statistics
Lecture 2 Describing Data II ©. Summarizing and Describing Data Frequency distribution and the shape of the distribution Frequency distribution and the.
1-3 Measure of Location Just as graphics can enhance the display of data, numerical descriptions are also of value. In this section and the next, we present.
1 Measures of Location or Central Tendency This idea of Central tendency refers to the extent to which all the data values group around a typical or central.
Calculating & Reporting Healthcare Statistics
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Descriptive Statistics Statistical Notation Measures of Central Tendency Measures of Variability Estimating Population Values.
Chapter 3 Numerically Summarizing Data
Slides by JOHN LOUCKS St. Edward’s University.
Central Tendency and Variability
Measures of Central Tendency Section 2.3 Statistics Mrs. Spitz Fall 2008.
The arithmetic mean of a variable is computed by determining the sum of all the values of the variable in the data set divided by the number of observations.
Measures of Central Tendency
Descriptive Statistics Healey Chapters 3 and 4 (1e) or Ch. 3 (2/3e)
MEASURES of CENTRAL TENDENCY.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Describing Data: Numerical
STATISTIC & INFORMATION THEORY (CSNB134) MODULE 2 NUMERICAL DATA REPRESENTATION.
Measurement Tools for Science Observation Hypothesis generation Hypothesis testing.
LECTURE 12 Tuesday, 6 October STA291 Fall Five-Number Summary (Review) 2 Maximum, Upper Quartile, Median, Lower Quartile, Minimum Statistical Software.
Chapter 3 – Descriptive Statistics
Descriptive Statistics Anwar Ahmad. Central Tendency- Measure of location Measures descriptive of a typical or representative value in a group of observations.
Statistics 1 Measures of central tendency and measures of spread.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
BUS250 Seminar 4. Mean: the arithmetic average of a set of data or sum of the values divided by the number of values. Median: the middle value of a data.
Measures of Variability In addition to knowing where the center of the distribution is, it is often helpful to know the degree to which individual values.
Chapter 4 Variability. Variability In statistics, our goal is to measure the amount of variability for a particular set of scores, a distribution. In.
QBM117 Business Statistics Descriptive Statistics Numerical Descriptive Measures.
Measures of Central Tendency and Dispersion Preferred measures of central location & dispersion DispersionCentral locationType of Distribution SDMeanNormal.
Descriptive Statistics: Numerical Methods
Chapter 3 Numerically Summarizing Data 3.2 Measures of Dispersion.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved.
Central Tendency and Variability Chapter 4. Variability In reality – all of statistics can be summed into one statement: – Variability matters. – (and.
PCB 3043L - General Ecology Data Analysis. OUTLINE Organizing an ecological study Basic sampling terminology Statistical analysis of data –Why use statistics?
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Basic Measurement and Statistics in Testing. Outline Central Tendency and Dispersion Standardized Scores Error and Standard Error of Measurement (Sm)
Chapter 3 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Chapter 3: Measures of Central Tendency and Variability Imagine that a researcher.
Basic Statistical Terms: Statistics: refers to the sample A means by which a set of data may be described and interpreted in a meaningful way. A method.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Understanding Your Data Set Statistics are used to describe data sets Gives us a metric in place of a graph What are some types of statistics used to describe.
Chapter 3, Part A Descriptive Statistics: Numerical Measures n Measures of Location n Measures of Variability.
PCB 3043L - General Ecology Data Analysis. PCB 3043L - General Ecology Data Analysis.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
1 Descriptive Statistics Descriptive Statistics Ernesto Diaz Faculty – Mathematics Redwood High School.
Summary Statistics: Measures of Location and Dispersion.
Chapter 5: Measures of Dispersion. Dispersion or variation in statistics is the degree to which the responses or values obtained from the respondents.
Symbol Description It would be a good idea now to start looking at the symbols which will be part of your study of statistics.  The uppercase Greek letter.
Describing Samples Based on Chapter 3 of Gotelli & Ellison (2004) and Chapter 4 of D. Heath (1995). An Introduction to Experimental Design and Statistics.
Class 1 Introduction Sigma Notation Graphical Descriptions of Data Numerical Descriptions of Data.
CHAPTER 2: Basic Summary Statistics
Measurements and Their Analysis. Introduction Note that in this chapter, we are talking about multiple measurements of the same quantity Numerical analysis.
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
Copyright © 2016 Brooks/Cole Cengage Learning Intro to Statistics Part II Descriptive Statistics Intro to Statistics Part II Descriptive Statistics Ernesto.
Statistics for Business
Descriptive Statistics
Measures of Central Tendency
Central Tendency and Variability
Summary descriptive statistics: means and standard deviations:
Variance Variance: Standard deviation:
Measures of Location Statistics of location Statistics of dispersion
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Numerical Descriptive Measures
Descriptive Statistics Healey Chapters 3 and 4 (1e) or Ch. 3 (2/3e)
CHAPTER 2: Basic Summary Statistics
Numerical Descriptive Measures
Presentation transcript:

A.M EASURES OF LOCATION A.M EASURES OF LOCATION B.M EASURES OF SPREAD Central tendency and measures of dispersion &

Measures of Location Spread Central tendency Dispersion tendency

Measures of Location (Central tendency) 1. Mean 2. Median 3. Mode Common measures of location are A

1. Mean a. Arithmetic Mean/Averageb. Harmonic Meanc. Geometric Mean Mean is of 3 types such as

Arithmetic Mean The most widely utilized measure of central tendency is the arithmetic mean or average. The population mean is the sum of the values of the variables under study divided by the total number of observations in the population. It is denoted by μ (‘mu’). Each value is algebraically denoted by an X with a subscript denotation ‘ i ’. For example, a small theoretical population whose objects had values 1,6,4,5,6,3,8,7 would be denoted X 1 =1, X 2 = 6, X 3 = 4……. X 8 =7 …….1.1

Mean…. N We would denote the population size with a capital N. In our theoretical population N=8. The pop. mean μ would be Formula 1.1: The algebraic shorthand formula for a pop. mean is μ =

Mean….. The Greek letter (sigma) indicates summation, the subscript i=1 means to start with the first observation, and the superscript N means to continue until and including the N th observation. For the example above, would indicate the sum of X 2 +X 3 +X 4 +X 5 or = 21. To reduce clutter, if the summation sign is not indexed, for example X i, it is implied that the operation of addition begins with the first observation and continues through the last observation in a population, that is, =

Mean… The sample mean is defined by = Where n is the sample size. The sample mean is usually reported to one more decimal place than the data and always has appropriate units associated with it. n The symbol (X bar) indicates that the observations of a subset of size n from a population have been averaged.

Mean…. μ is fundamentally different from μ because samples from a population can have different values for their sample mean, that is, they can vary from sample to sample within the population. The population mean, however, is constant for a given population.

Mean….. Again consider the small theoretical population 1,6,4,5,6,3,8,7. A sample size of 3 may consists of 5,3,4 with = 4 or 6,8,4 with = 6. Actually there are 56 possible samples of size 3 that could be drawn from the population 1.1. Only four samples have a sample mean the same as the population mean ie = μ.

Mean… SampleSum X 3, X 6, X X 2, X 3, X X 5, X 3, X X 8, X 6, X

Mean… Each sample mean is an unbiased estimate of μ but depends on the values included in the sample size for its actual value. We would expect the average of all possible ‘s to be equal to the population parameter, μ. This is in fact, the definition of an unbiased estimator of the pop. mean.

Mean… If you calculate the sample mean for each of the 56 possible samples with n=3 and then average these sample means, they will give an average value of 5, that is, the pop. mean, μ. Remember that most real populations are too large or too difficult to census completely, so we must rely on using a single sample to estimate or approximate the population characteristics.

Harmonic mean

Geometric mean n= no of obs., X 1, X 2, X 3 ……..X n are individual obs.

Median The second measure of central tendency is the MEDIAN. The median is the middle most value of an ordered list of observations. Though the idea is simple enough, it will prove useful to define in terms of an even simple notion. The depth of a value is its position relative to the nearest extreme (end) when the data are listed in order from smallest to largest.

Median: Example 2.1 Table below gives the circumferences at chest height (CCH) in cm and their corresponding depths for 15 sugar maples measured in a forest in Ohio. CCH Depth No. of obs. = 15 (odd) The population median M is the observation whose depth is d =, where N is the population size.

Median… A sample median M is the statistic used to approximate or estimate the population median. M is defined as the observation whose depth is d = where n is the sample size. In example 2.1 the sample size is n=15 so the depth of the sample median is d=8. the sample median X = X 8 = 38 cm.

Median: Example 2.2 The table below gives CCH (cm) for 12 cypress pines measured near Brown lake on North Stradebroke Island CCH Depth No. of observation = 12 (even) Since n=12, the depth of the median is = 6.5. Obviously no observation has depth 6.5, so this is the interpretation as the average of both observations whose depth is 6 in the list above. So M = = 62 cm.

Mode The mode is defined as the most frequently occurring value in a data set. The mode in example 2.2 would be 73 cm while example 2.1 would have a mode of 29 cm.

Mean, median and mode concide In symmetrical distributions (NORMAL DISTRIBUTION), the MEAN, MEDIAN and MODE coincide.

Exercise Hen egg sizes(ES,g) on 12 wks of lay were randomly measured in a layer flock as follows. Determine mean, median and mode of eggs. size. Hen No ES

Measures of Spread (dispersion) It measures variability of data. There are 4 measures in common. 1. Range 2. Variance 3. Standard Deviation (SD) 4. Standard Error (SE) B

Range Range: The simplest measure of dispersion or spread of data is the RANGE Formula: The difference between the largest and smallest observations (two extremes) in a group of data is called the RANGE. Sample range= X n – X 1 ; Population range=X N -X 1 The values X n and X 1 are called ‘sample range limits’.

Range: Example Marks of Biometry of 10 students are as follows (Full marks 100) Student ID Marks ObtainedMarks ordered Here, Range = X 1 -X 10 =80-25 = 55

Range… The range is a crude estimator of dispersion because it uses only two of the data points and is somewhat dependent on sample size. As sample size increases, we expect largest and smallest observations to become more extreme. Therefore, sample size to increase even though population range remains unchanged. It is unlikely that sample will include the largest and smallest values from the population, so the sample range usually underestimates the population range and is,therefore, a biased estimator.

Variance Suppose we express each observation as a distance from the mean x i = X i -. These differences are called deviates and will be sometimes positive (X i is above the mean) and sometimes negative (X i is below the mean). If we try to average the deviates, they always sum to zero. Because the mean is the central tendency or location, the negative deviates will exactly cancel out the positive deviates.

Variance… Example X MeanDeviates Sum0 = 0

Variance… Algebraically one can demonstrate the same result more generally, Since is a constant for any sample,

Variance… Since then, so

Variance… To circumvent the unfortunate property, the widely used measure of dispersion called the sample variance utilizes the square of the deviates. The quantity is the sum of these squared deviates and is referred to as the corrected sum of squares (CSS). Each observation is corrected or adjusted for its distance from the mean.

Variance… Formula: The CSS is utilized in the formula for the sample variance The sample variance is usually reported to two more decimal places than the data and has units that are the square of the measurement units.

Variance… Or With a similar deviation the population variance computational formula can be shown to be

Variance…Example(unit Kg) Data set 3.1, 17.0, 9.9, 5.1, 18.0, 3.8, 10.0, 2.9, 21.2 n=9

Variance… Remember, the numerator must always be a positive number because it is sum of squared deviations. Population variance formula is rarely used since most populations are too large to census directly.

Standard deviation (SD) Standard deviation is the positive square root of the variance And

Standard Error (SE) n= no. of observation

Exercise 2 Daily milk yield (L) of 12 cows are tabulated below. Calculate mean, median, mode, variance and standard error. Cow noMilk yieldCow noMilk yield

Problem 1 Two herds of cows located apart in Malaysia gave the following amount of milk/day (L). Compute arithmetic mean, median, mode, range, variance, SD and SE of daily milk yield in cows of the two herds. Put your comments on what have been reflected from two sets of milk records as regards to their differences.

Table Herd Cow no Herd Cow no

Problem 2 Sex adjusted weaning weight of lambs in two different breeds of sheep were recorded as follows. Compute mean, median, range, variance and SE in weaning weight of lambs in two breed groups. Put your comments on various differences between the two groups.

Weaning wt. (Kg) of lambs Breed Breed

Problem No 3 In a market study data on the price (RM) of 10 kg rice were collected from 2 different markets in Malaysia. Using descriptive statistics show the differences relating to price of rice in the two markets. Pasar 1: 20, 25, 22, 23, 22, 24, 23, 21, 25, 25,23,22,25,24,24 Pasar 2: 25, 24, 26, 23, 26, 25, 25, 26, 24, 26, 24, 23,22, 25, 26, 26, 24