Introduction A Review of Descriptive Statistics. Charts When dealing with a larger set of data values, it may be clearer to summarize the data by presenting.

Slides:



Advertisements
Similar presentations
Lesson Describing Distributions with Numbers parts from Mr. Molesky’s Statmonkey website.
Advertisements

Statistics 100 Lecture Set 6. Re-cap Last day, looked at a variety of plots For categorical variables, most useful plots were bar charts and pie charts.
Statistics It is the science of planning studies and experiments, obtaining sample data, and then organizing, summarizing, analyzing, interpreting data,
Psychology: A Modular Approach to Mind and Behavior, Tenth Edition, Dennis Coon Appendix Appendix: Behavioral Statistics.
Table of Contents Exit Appendix Behavioral Statistics.
Appendix A. Descriptive Statistics Statistics used to organize and summarize data in a meaningful way.
Introduction to Summary Statistics
Exploratory Data Analysis (Descriptive Statistics)
Descriptive Statistics
2-5 : Normal Distribution
Statistics for the Social Sciences
Calculating & Reporting Healthcare Statistics
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
PSY 307 – Statistics for the Behavioral Sciences
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Chap 3-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 3 Describing Data: Numerical Statistics for Business and Economics.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Describing Data: Numerical
Objective To understand measures of central tendency and use them to analyze data.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
A Look at Means, Variances, Standard Deviations, and z-Scores
Chapter 3 – Descriptive Statistics
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Chapter 3 Statistics for Describing, Exploring, and Comparing Data
1.3 Psychology Statistics AP Psychology Mr. Loomis.
1 DATA DESCRIPTION. 2 Units l Unit: entity we are studying, subject if human being l Each unit/subject has certain parameters, e.g., a student (subject)
1 Excursions in Modern Mathematics Sixth Edition Peter Tannenbaum.
Reasoning in Psychology Using Statistics Psychology
Stat 1510: Statistical Thinking and Concepts 1 Density Curves and Normal Distribution.
Tuesday August 27, 2013 Distributions: Measures of Central Tendency & Variability.
1 Laugh, and the world laughs with you. Weep and you weep alone.~Shakespeare~
Psyc 235: Introduction to Statistics Lecture Format New Content/Conceptual Info Questions & Work through problems.
Chapter 2 Describing Data.
Describing Data Lesson 3. Psychology & Statistics n Goals of Psychology l Describe, predict, influence behavior & cognitive processes n Role of statistics.
14.1 Data Sets: Data Sets: Data set: collection of data values.Data set: collection of data values. Frequency: The number of times a data entry occurs.Frequency:
Warm up The following graphs show foot sizes of gongshowhockey.com users. What shape are the distributions? Calculate the mean, median and mode for one.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
The Central Tendency is the center of the distribution of a data set. You can think of this value as where the middle of a distribution lies. Measure.
Categorical vs. Quantitative…
INVESTIGATION 1.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
MDM4U Chapter 3 Review Normal Distribution Mr. Lieff.
Descriptive Statistics: Presenting and Describing Data.
Essential Statistics Chapter 31 The Normal Distributions.
Unit 2 (F): Statistics in Psychological Research: Measures of Central Tendency Mr. Debes A.P. Psychology.
Chapter 3 Review MDM 4U Mr. Lieff. 3.1 Graphical Displays be able to effectively use a histogram name and be able to interpret the various types of distributions.
Copyright © 2011 Pearson Education, Inc. Describing Numerical Data Chapter 4.
Descriptive Statistics Review – Chapter 14. Data  Data – collection of numerical information  Frequency distribution – set of data with frequencies.
Descriptive Statistics Tabular and Graphical Displays –Frequency Distribution - List of intervals of values for a variable, and the number of occurrences.
Chapter 6 The Normal Distribution.  The Normal Distribution  The Standard Normal Distribution  Applications of Normal Distributions  Sampling Distributions.
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.
LIS 570 Summarising and presenting data - Univariate analysis.
Introduction to statistics I Sophia King Rm. P24 HWB
Today: Standard Deviations & Z-Scores Any questions from last time?
Describing Distributions Statistics for the Social Sciences Psychology 340 Spring 2010.
Descriptive Statistics for one Variable. Variables and measurements A variable is a characteristic of an individual or object in which the researcher.
Descriptive Statistics for one variable. Statistics has two major chapters: Descriptive Statistics Inferential statistics.
Measurements and Their Analysis. Introduction Note that in this chapter, we are talking about multiple measurements of the same quantity Numerical analysis.
Descriptive Statistics(Summary and Variability measures)
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Chapter 4: Measures of Central Tendency. Measures of central tendency are important descriptive measures that summarize a distribution of different categories.
Exploratory Data Analysis
Descriptive Statistics: Presenting and Describing Data
NUMERICAL DESCRIPTIVE MEASURES
Descriptive Statistics
Description of Data (Summary and Variability measures)
Laugh, and the world laughs with you. Weep and you weep alone
Descriptive Statistics
Presentation transcript:

Introduction A Review of Descriptive Statistics

Charts When dealing with a larger set of data values, it may be clearer to summarize the data by presenting a graphical image

Intervals Numerical data values may be grouped or classified by defining “class intervals”: Suppose the following data values represent the ACT test scores for 30 individuals. 8, 10, 11, 13, 13, 14, 14, 15, 15, 16, 16, 17, 17, 18, 18, 18, 18, 19, 20, 20, 21, 21, 21, 22, 22, 23, 25, 26, 28, 30 Define intervals so that each of the values fall into exactly one of the intervals.

Frequency Determine how many data scores fall in each of the intervals (the "frequency“)

Draw a bar chart (or "histogram") with the height of the bar on each interval determined by the frequency “Histogram”

Relative Frequency Alternatively, give the percentage of scores or "relative frequency". That is, if 5 of the 30 values fall in the interval, then the relative frequency is 5/30 =

Relative to each other, the bars are the same height and the histograms have the same shape.

Cumulative Frequency …or we could “keep a running total”, called a “cumulative frequency”, as we go from one interval to the next. if there are 2 values in the first interval and 5 in the next, then the cumulative frequency is = 7 for the second interval.

Cumulative Graph The increase in the height of the bar shows how many data values were contributed by a given interval.

The Middle In addition to the graphical summary also give numerical measurements which describe the distribution of the data The middle ?

Set of Heights the height (in inches) of 30 third graders How should we describe the "middle height"? For numerical data, we commonly compute the "arithmetic average" of the values, also called the mean value.

The Mean Value To compute the average: find the sum of the values and divide by the number of values in the set. For our 30 third-graders, we find the sum of the 30 heights and then divide by 30: Compare this to “the middle” of the histogram.

The “Middle Weight” Looks to be in the middle! Mean = 54.7

Sampling a Population We distinguish between a sample and the entire population. A population consists of all the members of the set under consideration (eg., all third-graders in the United States) A sample consists of a subset of members selected from a population (eg., 30 third-graders in our example)

Notation The notation used depends on if we’re using the entire population or a sample. If a selected sample is representative of the population, we expect the mean of the sample is nearly equal to the mean for the population.

Median Value The median value is literally defined to be the middle data value. You may need to "split the difference" by averaging two middle values. Half the data lies at or below the median and the other half lies at or above the median. Median is another “measure of the middle” but is less affected by non-typical data values.

Median third-grader? Consider our previous data for 30 third-graders An even number of data values, so we average the two middle values. The median is ( )/2 = inches.

Mean vs. Median In smaller samples, the median value is often a better measure; it is unaffected a non-typical score and is more representative of the middle. Suppose test scores were 23, 58, 64, 68, 75, 79, 83, 85, 87, 91, 94 median is 79 Mean equals about 73.36

The Spread Another characteristic of a data set is how widely the data values are spread. Find a way to measure how widely the values vary. The measurement we use is called the "standard deviation".

The Deviations Having determined the mean value, we can measure how far each data value varies from the middle. The difference or "deviation" from the middle, is computed as. Our goal is to compute a sort of average of these deviations from the middle.

“16 ounce drink” Suppose a sample of 8 medium colas were measured. The volumes, measured in ounces, are given by the data below Volumes have an average or mean value of ounces.

Deviations in Colas Recall the contents of our 8 colas where the mean value is ounces. data value deviation from middle = = = = = = = 0.45

Squared Deviations To prevent the negative and postive values from cancelling each other out, we square them. data deviation from middle deviation squared = (- 0.35) 2 = = (- 0.25) 2 = = (- 0.15) 2 = = (- 0.15) 2 = = 0.05 ( 0.05) 2 = = 0.15 = = 0.25 = = 0.45 =

Avg. of Squared Deviations To average the deviations: add the squared deviations and divide by one less than the number of data values in the sample. Finally, we "undo the squaring" by computing the square root.

data value deviation squared total = = sum of squared deviations

s = is a sort of average of how far the data values vary from the middle Average Spread

Notation As with the mean value, notation depends on the whether the data represents the population or a sample.

Compare The standard deviation describes the “distribution of the data”. Which of the following distributions would you expect to have the larger standard deviation?

Match the statistics with the histograms

Bell-shaped Distribution For reasonably large random samples, we often observe a "bell-shaped" distribution. In such cases, we expect to find about 68% of the data within one std. dev. of the mean. Also, about 95% of the data is expected to lie within 2 standard deviations of the mean.

“Empirical Rule”