Introduction to Statistics Measures of Central Tendency.

Slides:



Advertisements
Similar presentations
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 4. Measuring Averages.
Advertisements

Introduction to Summary Statistics
Descriptive (Univariate) Statistics Percentages (frequencies) Ratios and Rates Measures of Central Tendency Measures of Variability Descriptive statistics.
Descriptive Statistics Statistical Notation Measures of Central Tendency Measures of Variability Estimating Population Values.
PSY 307 – Statistics for the Behavioral Sciences
Descriptive Statistics
Measures of Dispersion
Measures of Central Tendency
Today: Central Tendency & Dispersion
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 12 Describing Data.
Chapter 1 Descriptive Analysis. Statistics – Making sense out of data. Gives verifiable evidence to support the answer to a question. 4 Major Parts 1.Collecting.
Measurement Tools for Science Observation Hypothesis generation Hypothesis testing.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
What is statistics? STATISTICS BOOT CAMP Study of the collection, organization, analysis, and interpretation of data Help us see what the unaided eye misses.
Intra-Individual Variability Intra-individual variability is greater among older adults (Morse 1993) –May be an indicator of the functioning of the central.
Statistics and Research methods Wiskunde voor HMI Betsy van Dijk.
Part II Sigma Freud & Descriptive Statistics
APPENDIX B Data Preparation and Univariate Statistics How are computer used in data collection and analysis? How are collected data prepared for statistical.
1 Excursions in Modern Mathematics Sixth Edition Peter Tannenbaum.
Chapter 3 Descriptive Measures
1 Review Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central Measures (mean,
Statistics and Quantitative Analysis U4320 Segment 2: Descriptive Statistics Prof. Sharyn O’Halloran.
Descriptive Statistics: Numerical Methods
Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.1 Descriptive Statistics, The Normal Distribution, and Standardization.
Central Tendency and Variability Chapter 4. Variability In reality – all of statistics can be summed into one statement: – Variability matters. – (and.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
1 Review Sections Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.
Measures of Central Tendency And Spread Understand the terms mean, median, mode, range, standard deviation.
Statistics 11 The mean The arithmetic average: The “balance point” of the distribution: X=2 -3 X=6+1 X= An error or deviation is the distance from.
Measures of Dispersion
Categorical vs. Quantitative…
INVESTIGATION 1.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Measures of Central Tendency: The Mean, Median, and Mode
Part II Sigma Freud and Descriptive Statistics Chapter 2 Computing and Understanding Averages: Means to an End.
Chapter 2 Means to an End: Computing and Understanding Averages Part II  igma Freud & Descriptive Statistics.
Part II  igma Freud & Descriptive Statistics Chapter 2 Means to an End: Computing and Understanding Averages.
Introduction to Statistics Santosh Kumar Director (iCISA)
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
Describing Distributions with Numbers Chapter 2. What we will do We are continuing our exploration of data. In the last chapter we graphically depicted.
Statistical Analysis of Data. What is a Statistic???? Population Sample Parameter: value that describes a population Statistic: a value that describes.
Introduction to Statistics Measures of Central Tendency and Dispersion.
LIS 570 Summarising and presenting data - Univariate analysis.
Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,
Measurements and Their Analysis. Introduction Note that in this chapter, we are talking about multiple measurements of the same quantity Numerical analysis.
Introduction to Statistics. Objectives: Understand certain statistical concepts & terminology Describe types of measurement scales Differentiate between.
Presenting Data Descriptive Statistics. Chapter- Presentation of Data Mona Kapoor.
Averages and Variability
Chapter 2 Describing and Presenting a Distribution of Scores.
Descriptive Statistics(Summary and Variability measures)
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
Central Tendency and Variability Chapter 4. Variability In reality – all of statistics can be summed into one statement: – Variability matters. – (and.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 2 Describing and Presenting a Distribution of Scores.
Slide 1 Copyright © 2004 Pearson Education, Inc.  Descriptive Statistics summarize or describe the important characteristics of a known set of population.
Statistics Descriptive Statistics. Statistics Introduction Descriptive Statistics Collections, organizations, summary and presentation of data Inferential.
Descriptive measures Capture the main 4 basic Ch.Ch. of the sample distribution: Central tendency Variability (variance) Skewness kurtosis.
Descriptive Statistics (Part 2)
Statistical Reasoning in Everyday Life
IB Psychology Today’s Agenda: Turn in:
Description of Data (Summary and Variability measures)
T6.1 – Introduction to Statistics
Univariate Descriptive Statistics
Univariate Descriptive Statistics
Chapter 3 Describing Data Using Numerical Measures
Central Tendency.
Measures of Location Statistics of location Statistics of dispersion
Introduction to Statistics
Welcome!.
Presentation transcript:

Introduction to Statistics Measures of Central Tendency

Two Types of Statistics Descriptive statistics of a POPULATION Relevant notation (Greek): –  mean –N population size –  sum Inferential statistics of SAMPLES from a population. –Assumptions are made that the sample reflects the population in an unbiased form. Roman Notation: –X mean –n sample size –  sum

Be careful though because you may want to use inferential statistics even when you are dealing with a whole population. Measurement error or missing data may mean that if we treated a population as complete that we may have inefficient estimates. –It depends on the type of data and project. –Example of Democratic Peace.

Also, be careful about the phrase “descriptive statistics”. It is used generically in place of measures of central tendency and dispersion for inferential statistics. Another name is “summary statistics”, which are univariate: –Mean, Median, Mode, Range, Standard Deviation, Variance, Min, Max, etc.

Measures of Central Tendency These measures tap into the average distribution of a set of scores or values in the data. –Mean –Median –Mode

What do you “Mean”? The “mean” of some data is the average score or value, such as the average age of an MPA student or average weight of professors that like to eat donuts. Inferential mean of a sample: X=(  X)/n Mean of a population:  =(  X)/N

Problem of being “mean” The main problem associated with the mean value of some data is that it is sensitive to outliers. Example, the average weight of political science professors might be affected if there was one in the department that weighed 600 pounds.

Donut-Eating Professors ProfessorWeight Schmuggles165 Bopsey213 Pallitto Homer Schnickerson165 Levin148 Honkey-Doorey251 Zingers308 Boehmer151 Queenie132 Googles-Boop199 Calzone

The Median (not the cement in the middle of the road) Because the mean average can be sensitive to extreme values, the median is sometimes useful and more accurate. The median is simply the middle value among some scores of a variable. (no standard formula for its computation)

What is the Median? ProfessorWeight Schmuggles165 Bopsey213 Pallitto189 Homer187 Schnickerson165 Levin148 Honkey-Doorey251 Zingers308 Boehmer151 Queenie132 Googles-Boop199 Calzone Weight Rank order and choose middle value. If even then average between two in the middle

Percentiles If we know the median, then we can go up or down and rank the data as being above or below certain thresholds. You may be familiar with standardized tests. 90 th percentile, your score was higher than 90% of the rest of the sample.

The Mode (hold the pie and the ala) (What does ‘ala’ taste like anyway??) The most frequent response or value for a variable. Multiple modes are possible: bimodal or multimodal.

Figuring the Mode ProfessorWeight Schmuggles165 Bopsey213 Pallitto189 Homer187 Schnickerson165 Levin148 Honkey-Doorey251 Zingers308 Boehmer151 Queenie132 Googles-Boop199 Calzone227 What is the mode? Answer: 165 Important descriptive information that may help inform your research and diagnose problems like lack of variability.

Measures of Dispersion Measures of Dispersion (not something you cast…) Measures of dispersion tell us about variability in the data. Also univariate. Basic question: how much do values differ for a variable from the min to max, and distance among scores in between. We use: –Range –Standard Deviation –Variance

Remember that we said in order to glean information from data, i.e. to make an inference, we need to see variability in our variables. Measures of dispersion give us information about how much our variables vary from the mean, because if they don’t it makes it difficult infer anything from the data. Dispersion is also known as the spread or range of variability.

The Range The Range (no Buffalo roaming!!) r = h – l –Where h is high and l is low In other words, the range gives us the value between the minimum and maximum values of a variable. Understanding this statistic is important in understanding your data, especially for management and diagnostic purposes.

The Standard Deviation A standardized measure of distance from the mean. Very useful and something you do read about when making predictions or other statements about the data.

=square root  =sum (sigma) X=score for each point in data _ X=mean of scores for the variable n=sample size (number of observations or cases S = Formula for Standard Deviation

We can see that the Standard Deviation equals pounds. The weight of Zinger is still likely skewing this calculation (indirectly through the mean).

Example of S in use Boehmer- Sobek paper. –One standard deviation increase in the value of X variable increases the Probability of Y occurring by some amount.

Table 2: Development and Relative Risk of Territorial Claim Probability*% Change Baseline development pop density pop growth Capability Openness Capability and pop growth % Change in prob after 1 sd change in given x variable, holding others at their means

Let’s go to computers! Type in data in the Excel sheet.

Variance Note that this is the same equation except for no square root taken. Its use is not often directly reported in research but instead is a building block for other statistical methods =

Organizing and Graphing Data

Goal of Graphing? 1.Presentation of Descriptive Statistics 2.Presentation of Evidence 3.Some people understand subject matter better with visual aids 4.Provide a sense of the underlying data generating process (scatter- plots)

What is the Distribution? Gives us a picture of the variability and central tendency. Can also show the amount of skewness and Kurtosis.

Graphing Data: Types

Creating Frequencies We create frequencies by sorting data by value or category and then summing the cases that fall into those values. How often do certain scores occur? This is a basic descriptive data question.

Ranking of Donut-eating Profs. (most to least) Zingers308 Honkey-Doorey251 Calzone227 Bopsey213 Googles-boop199 Pallitto189 Homer187 Schnickerson165 Smuggle165 Boehmer151 Levin148 Queeny132

Here we have placed the Professors into weight classes and depict with a histogram in columns.

Here it is another histogram depicted as a bar graph.

Pie Charts:

Actually, why not use a donut graph. Duh! See Excel for other options!!!!

Line Graphs: A Time Series

Scatter Plot (Two variable)