1 PUAF 610 TA Session 2. 2 Today Class Review- summary statistics STATA Introduction Reminder: HW this week.

Slides:



Advertisements
Similar presentations
Descriptive Measures MARE 250 Dr. Jason Turner.
Advertisements

Class Session #2 Numerically Summarizing Data
The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set.
Descriptive Statistics
Measures of Dispersion or Measures of Variability
Calculating & Reporting Healthcare Statistics
DESCRIBING DATA: 2. Numerical summaries of data using measures of central tendency and dispersion.
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 3 Introduction – Slide 1 of 3 Topic 16 Numerically Summarizing Data- Averages.
Measures of Central Tendency
Intro to Descriptive Statistics
Biostatistics Unit 2 Descriptive Biostatistics 1.
Introduction to Educational Statistics
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Introduction to Statistics Chapter 3 Using Statistics to summarize.
Descriptive Statistics: Overview Measures of Center Mode Median Mean * Measures of Symmetry Skewness Measures of Spread Range Inter-quartile Range Variance.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Measures of Central Tendency
Describing Data: Numerical
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Measures of Central Tendency & Spread
Overview Summarizing Data – Central Tendency - revisited Summarizing Data – Central Tendency - revisited –Mean, Median, Mode Deviation scores Deviation.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Measures of Central Tendency and Dispersion Preferred measures of central location & dispersion DispersionCentral locationType of Distribution SDMeanNormal.
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
Descriptive Statistics: Numerical Methods
M07-Numerical Summaries 1 1  Department of ISM, University of Alabama, Lesson Objectives  Learn when each measure of a “typical value” is appropriate.
Skewness & Kurtosis: Reference
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
Sampling Design and Analysis MTH 494 Ossam Chohan Assistant Professor CIIT Abbottabad.
INVESTIGATION 1.
Agenda Descriptive Statistics Measures of Spread - Variability.
Practice Page 65 –2.1 Positive Skew Note Slides online.
INVESTIGATION Data Colllection Data Presentation Tabulation Diagrams Graphs Descriptive Statistics Measures of Location Measures of Dispersion Measures.
Numerical Measures of Variability
LECTURE CENTRAL TENDENCIES & DISPERSION POSTGRADUATE METHODOLOGY COURSE.
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
Summary Statistics and Mean Absolute Deviation MM1D3a. Compare summary statistics (mean, median, quartiles, and interquartile range) from one sample data.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Summary Statistics: Measures of Location and Dispersion.
Chapter 5: Measures of Dispersion. Dispersion or variation in statistics is the degree to which the responses or values obtained from the respondents.
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.
LIS 570 Summarising and presenting data - Univariate analysis.
Descriptive Statistics for one Variable. Variables and measurements A variable is a characteristic of an individual or object in which the researcher.
Descriptive Statistics for one variable. Statistics has two major chapters: Descriptive Statistics Inferential statistics.
CHAPTER 2: Basic Summary Statistics
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Descriptive Statistics(Summary and Variability measures)
CCGPS Coordinate Algebra Unit 4: Describing Data.
Statistics Josée L. Jarry, Ph.D., C.Psych. Introduction to Psychology Department of Psychology University of Toronto June 9, 2003.
Copyright © 2016 Brooks/Cole Cengage Learning Intro to Statistics Part II Descriptive Statistics Intro to Statistics Part II Descriptive Statistics Ernesto.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Statistics -Descriptive statistics 2013/09/30. Descriptive statistics Numerical measures of location, dispersion, shape, and association are also used.
Descriptive Statistics
Mathematical Presentation of Data Measures of Dispersion
Descriptive Statistics: Overview
Central Tendency and Variability
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Descriptive Statistics
Description of Data (Summary and Variability measures)
Characteristics of the Mean
Numerical Descriptive Measures
Descriptive Statistics
Basic Statistical Terms
Numerical Descriptive Measures
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
CHAPTER 2: Basic Summary Statistics
Numerical Descriptive Measures
Presentation transcript:

1 PUAF 610 TA Session 2

2 Today Class Review- summary statistics STATA Introduction Reminder: HW this week

3 Review: Two types of Statistics Descriptive statistics summarize numerical information. Inferential statistics uses a sample to infer the population.

4 Summary statistic In descriptive statistics, summary statistics are used to summarize a set of observations. Typically, –What is the central value? –How widely are values spread from the center? –Are there data that are very atypical? –….

5 Summary statistic a measure of location, or central tendency a measure of statistical dispersion a measure of the shape of the distribution

6 Central tendency Central tendency relates to the way in which quantitative data tend to cluster around some value. A measure of central tendency is any of a number of ways of specifying the “central value”.

7 Basic measures of central tendency Mean Median Mode

8 Mean the sum of all measurements divided by the number of observations in the data set population mean (  ) v. sample mean (“x- bar”)

9 Example Assume 4 people take PUAF 610, and their final exam scores are 95, 87, 93, 83. What’s the mean for exam score?

10 Example Mean= ( )/4=89.5

11 Median the middle observation, when data are ordered from smallest to largest the point of a distribution that divides the bottom 50% from the top 50% of the data. The median is the 50th percentile.

12 Median If there is an odd number of observations, the median is the middle observation If there is an even number of observations, the median is the average of the two middle observations If the dataset is arranged in increasing order the median is located at position (n+1)/2

13 Example Calculate the sample median for the following observations: 1, 5, 2, 8, 7. Start by sorting the values: 1, 2, 5, 7, 8. The median is located at position (n+1)/2=3, thus it is 5. An odd number of values.

14 Example Calculate the sample median for the following observations: 1, 5, 2, 8, 7, 2. Start by sorting the values: 1, 2, 2, 5, 7, 8. The median is located at position (n+1)/2=3.5, Thus, it is the average of the two middlemost terms (2 + 5)/2 = 3.5. An even number of values

15 Mode the most frequent value in the data set It is possible for a distribution to have more than one mode or not to have a mode at all.

16 Example The mode for the following data set (1) 1, 2, 2, 3, 4, 7, 9 (2) 12, 26, 26, 53, 84, 71, 71, 79 (3) 32, 46, 53, 94, 37, 29

17 Comparing of Mode, Median and Mean Pros and Cons For descriptive purposes we might use the measure that suits the data. If we would like to infer from samples to populations, the mean is a measure of choice because it can be manipulated mathematically.

18 Summary statistic a measure of location, or central tendency a measure of statistical dispersion, or variation a measure of the shape of the distribution

19 Measures of Variation Variation is variability or spread in a variable Measures of variation are lengths of intervals on the measurement scale that indicate the spread of values in a distribution.

20 Measures of Variation Range Quartiles Interquartile range Variance Standard Deviation

21 Range the length of the smallest interval which contains all the data (highest value – lowest value) + 1

22 Quartiles any of the three values which divide the sorted data set into four equal parts, so that each part represents one fourth of the sampled population.

23 Quartiles first quartile (Q1) = lower quartile = cuts off lowest 25% of data = 25th percentile second quartile (Q2) = median = cuts data set in half = 50th percentile third quartile (Q3) = upper quartile = cuts off highest 25% of data, or lowest 75% = 75th percentile * The difference between the upper and lower quartiles is called the interquartile range.

24 Variance Describes how far values lie from the mean. Use the absolute values or to square the deviation scores to get rid of the minus signs. Averaging absolute values cannot be used in more advanced analyses. –By averaging the sum of squared deviations (sum of squares) we can get a measure that is susceptible to further algebraic manipulations that are difficult or impossible with absolute values.

25 Variance Less intuitive and more difficult to interpret, because it is measured in squared units rather than original units Do not use variance much (in population)and(in sample) where μ is the mean and N is the number of population.

26 Standard deviation 26 A widely used measure of the variability or dispersion. It shows how much variation there is from the "average“. Standard deviation is obtained by taking a square root of the variance, i.e. (population)(sample)

27 Standard deviation A low standard deviation indicates that the data points tend to be very close to the mean. A high standard deviation indicates that the data is spread out over a large range of values.

28 Summary statistic a measure of location, or central tendency a measure of statistical dispersion, or variation a measure of the shape of the distribution

29 Shape of the distribution Skewness Kurtosis

30 Skewness a measure of the asymmetry of the distribution The skewness value can be positive or negative, or even undefined.

31 Skewness negative skew: The left tail is longer; the mass of the distribution is concentrated on the right of the figure. It has relatively few low values.

32 Skewness positive skew: The right tail is longer; the mass of the distribution is concentrated on the left of the figure. It has relatively few high values.

33 Skewness A zero value indicates that the values are relatively evenly distributed on both sides of the mean.

34 Kurtosis a measure of the "peakedness" of the distribution Higher kurtosis means more of the variance is the result of infrequent extreme deviations, as opposed to frequent modestly sized deviations

35 That’s all for class review. So far so good? Let’s go to STATA!