1 Describing distributions with numbers William P. Wattles Psychology 302.

Slides:



Advertisements
Similar presentations
Quantitative Methods in HPELS 440:210
Advertisements

Describing Quantitative Variables
DESCRIBING DISTRIBUTION NUMERICALLY
Introduction to Data Analysis
BHS Methods in Behavioral Sciences I April 18, 2003 Chapter 4 (Ray) – Descriptive Statistics.
Statistics for the Social Sciences
The goal of data analysis is to gain information from the data. Exploratory data analysis: set of methods to display and summarize the data. Data on just.
Chapter 1 Introduction Individual: objects described by a set of data (people, animals, or things) Variable: Characteristic of an individual. It can take.
PSY 307 – Statistics for the Behavioral Sciences
Intro to Descriptive Statistics
Basic Business Statistics 10th Edition
Introduction to Educational Statistics
Edpsy 511 Homework 1: Due 2/6.
Data observation and Descriptive Statistics
Chapter 2 Describing distributions with numbers. Chapter Outline 1. Measuring center: the mean 2. Measuring center: the median 3. Comparing the mean and.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Describing Data: Numerical
AP Statistics Chapters 0 & 1 Review. Variables fall into two main categories: A categorical, or qualitative, variable places an individual into one of.
Describing distributions with numbers
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Chapter 1 Exploring Data
CHAPTER 2: Describing Distributions with Numbers ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Let’s Review for… AP Statistics!!! Chapter 1 Review Frank Cerros Xinlei Du Claire Dubois Ryan Hoshi.
Statistics.
Methods for Describing Sets of Data
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
Tuesday August 27, 2013 Distributions: Measures of Central Tendency & Variability.
M07-Numerical Summaries 1 1  Department of ISM, University of Alabama, Lesson Objectives  Learn when each measure of a “typical value” is appropriate.
Describing distributions with numbers
Central Tendency and Variability Chapter 4. Variability In reality – all of statistics can be summed into one statement: – Variability matters. – (and.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
1 Tutorial 2 GE 5 Tutorial 2  rules of engagement no computer or no power → no lesson no computer or no power → no lesson no SPSS → no lesson no SPSS.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
A way to organize data so that it has meaning!.  Descriptive - Allow us to make observations about the sample. Cannot make conclusions.  Inferential.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Statistics Chapter 1: Exploring Data. 1.1 Displaying Distributions with Graphs Individuals Objects that are described by a set of data Variables Any characteristic.
Chapter 3 Looking at Data: Distributions Chapter Three
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Copyright © 2011 Pearson Education, Inc. Describing Numerical Data Chapter 4.
BASIC STATISTICAL CONCEPTS Chapter Three. CHAPTER OBJECTIVES Scales of Measurement Measures of central tendency (mean, median, mode) Frequency distribution.
IE(DS)1 Descriptive Statistics Data - Quantitative observation of Behavior What do numbers mean? If we call one thing 1 and another thing 2 what do we.
Edpsy 511 Exploratory Data Analysis Homework 1: Due 9/19.
Notes Unit 1 Chapters 2-5 Univariate Data. Statistics is the science of data. A set of data includes information about individuals. This information is.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
LIS 570 Summarising and presenting data - Univariate analysis.
Describing Distributions Statistics for the Social Sciences Psychology 340 Spring 2010.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 5. Measuring Dispersion or Spread in a Distribution of Scores.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
More Univariate Data Quantitative Graphs & Describing Distributions with Numbers.
Why do we analyze data?  It is important to analyze data because you need to determine the extent to which the hypothesized relationship does or does.
Why do we analyze data?  To determine the extent to which the hypothesized relationship does or does not exist.  You need to find both the central tendency.
Chapter 2 Describing and Presenting a Distribution of Scores.
A way to organize data so that it has meaning!.  Descriptive - Allow us to make observations about the sample. Cannot make conclusions.  Inferential.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Statistics Vocabulary. 1. STATISTICS Definition The study of collecting, organizing, and interpreting data Example Statistics are used to determine car.
Chapter 1: Exploring Data
Descriptive Statistics
Describing distributions with numbers
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Biostatistics Lecture (2).
Presentation transcript:

1 Describing distributions with numbers William P. Wattles Psychology 302

2 Measuring the Center of a distribution n Mean – The arithmetic average – Requires measurement data n Median – The middle value n Mode – The most common value

3 Measuring the center with the Mean

4 Our first formula

5 The Mean n One number that tells us about the middle using all the data. n The group not the individual has a mean.

Population Sample

6 Sample mean

7 Mu, the population mean

Population Sample

8 Calculate the mean with Excel n Save the file psy302 to your hard drive –right click on the file –save to desktop or temp n Open file psy302 n Move flower trivia score to new sheet

9 Calculate the mean with Excel n Rename Sheet – double click sheet tab, type flower n Calculate the sum – type label: total n Calculate the mean – type label: mean n Check with average function

10 Measuring the center with the Median n Rank order the values n If the number of observations is odd the median is the center observation n If the number of observations is even the median is the mean of the middle two observations. (half way between them)

11 Measuring the center with the Median

12 The mean versus the median n The Mean – uses all the data – has arithmetic properties n The Median – less influenced by Outliers and extreme values

Mean vs. Median

5 The Mean n The mean uses all the data. n The group not the individual has a mean. n We calculate the mean on Quantitative Data Three things to remember

n The mean tells us where the middle of the data lies. n We also need to know how spread out the data are.

Measuring Spread n Knowing about the middle only tells us part of the story of the data. n We need to know how spread out the data are.

Variability n Variety is the spice of life n Without variability things are just boring

Why is the mean alone not enough to describe a distribution? n Outliers is NOT the answer!!!!

The mean tells us the middle but not how spread out the scores are.

14 Example of Spread n New York n mean annual high temperature 62

14 Example of Spread n San Francisco n mean annual high temperature 65

16 Example of Spread n New York n meanmaxminrangesd n n San Francisco n

Example of Variability

17 Measuring Spread n Range n Quartiles n Five-number summary – Minimum – first quartile – median – third quartile – Maximum n Standard Deviation

n Mean 50.63% n Mean 33.19% Std Dev 21.4% Std Dev 13.2%

19 Deviation score n Each individual has a deviation score. It measures how far that individual deviates from the mean. n Deviation scores always sum to zero. n Deviation scores contain information. – How far and in which direction the individual lies from the mean

18 Measuring spread with the standard deviation n Measures spread by looking at how far the observations are from their mean. n The standard deviation is the square root of the variance. n The variance is also a measure of spread

Individual deviation scores

Standard deviation n One number that tells us about the spread using all the data. n The group not the individual has a standard deviation. Note !!

23 Standard Deviation

22 Variance

24 Properties of the standard deviation n s measures the spread about the mean n s=0 only when there is no spread. This happens when all the observations have the same value. n s is strongly influenced by extreme values

n New Column headed deviation n Deviation score = X – the mean

25 Calculate Standard Deviation with Excel n In new column type heading: dev2 n Enter formula to square deviation n Total squared deviations – type label: sum of squares n Divide sum of squares by n-1 – type label: variance

Moore page 50

n To Calculate Standard Deviation: n Total raw scores n divide by n to get mean n calculate deviation score for each subject (X minus the mean) n Square each deviation score n Sum the deviation scores to obtain sum of squares n Divide by n-1 to obtain variance n Take square root of variance to get standard deviation.

Population Sample

26 Sample variance

27 Population variance

Population Variance Sample Variance

28 Little sigma, the Population standard deviation

29 Sample standard deviation

Population Standard Deviation Sample Standard Deviation

To analyze data n 1. Make a frequency distribution and plot the data n Look for overall pattern and outliers or skewness n Create a numerical summary: mean and standard deviation.

41 Start with a list of scores

42 Make a frequency distribution

43 Frequency distribution

44 Represent with a chart (histogram)

45 Represent with line chart

Density Curve n Replaces the histogram when we have many observations.

Transform a score n Hotel Atlantico n 200 pesos n Peso a unit of measure

Transform a score n 1 dollar = pesos n 200/28.38=$7.05 n Dollar a unit of measure

31 n standardized observations or values. n To standardize is to transform a score into standard deviation units. n Frequently referred to as z-scores n A z-score tells how many standard deviations the score or observation falls from the mean and in which direction

32 Standard Scores (Z-scores) n individual scores expressed in terms of the mean and standard deviation of the sample or population. n Z = X minus the mean/standard deviation

33 Z-score

34 new symbols

35 Calculate Z-scores for trivia data n Label column E as Z-score n Type formula deviation score/std dev n Make std dev reference absolute (use F4 to insert dollar signs) n Copy formula down. n Check: should sum to zero

File extensions n Word.doc n Excel.xls n Text files.txt

To view File extensions n Open Windows Explorer n Choose Tools/Folder Options/View n uncheck “hide extensions for known file types.

37 Z Scores n Height of young women – Mean = 64 – Standard deviation = 2.7 n How tall in deviations is a woman 70 inches? n A woman 5 feet tall (60 inches) is how tall in standard deviations?

38 Z scores n Height of young women – Mean = 64 – Standard deviation = 2.7 n How tall in deviations is a woman 70 inches? z = 2.22 n A woman 5 feet tall (60 inches) is how tall in standard deviations? z = -1.48

39 Calculating Z scores

Calculating X from Z scores

72 Types of data n Categorical or Qualitative data –Nominal: Assign individuals to mutually exclusive categories. F exhaustive: everyone is in one category –Ordinal: Involves putting individuals in rank order. Categories are still mutually exclusive and exhaustive, but the order cannot be changed.

73 Types of data n Measurement or Quantitative Data –Interval data: There is a consistent interval or difference between the numbers. Zero point is arbitrary –Ratio data: Interval scale plus a meaningful zero. Zero means none. Weight, money and Celsius scales exemplify ratio data –Measurement data allows for arithmetic operations.

Review n Video2 Video2

60 The End

Mean vs. Median