Describing data with graphics and numbers. Types of Data Categorical Variables –also known as class variables, nominal variables Quantitative Variables.

Slides:



Advertisements
Similar presentations
Introduction to Summary Statistics
Advertisements

Frequency Distribution and Variation Prepared by E.G. Gascon.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Descriptive Statistics A.A. Elimam College of Business San Francisco State University.
Business 205. Review of last class NOIR Validity Reliability.
DESCRIBING DATA: 2. Numerical summaries of data using measures of central tendency and dispersion.
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Statistics Intro Univariate Analysis Central Tendency Dispersion.
Measures of Dispersion CJ 526 Statistical Analysis in Criminal Justice.
Statistics Intro Univariate Analysis Central Tendency Dispersion.
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Edpsy 511 Homework 1: Due 2/6.
Data observation and Descriptive Statistics
Quiz 2 Measures of central tendency Measures of variability.
Measurement Tools for Science Observation Hypothesis generation Hypothesis testing.
Objective To understand measures of central tendency and use them to analyze data.
Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word.
Methods for Describing Sets of Data
Chapter 3 Averages and Variations
Statistics Recording the results from our studies.
Introduction to Summary Statistics. Statistics The collection, evaluation, and interpretation of data Statistical analysis of measurements can help verify.
Descriptive Statistics Roger L. Brown, Ph.D. Medical Research Consulting Middleton, WI Online Course #1.
Tuesday, March 18, 2014MAT Tuesday, March 18, 2014MAT 3122.
Chapter 2 Describing Data.
Biostatistics Class 1 1/25/2000 Introduction Descriptive Statistics.
Central Tendency and Variability Chapter 4. Variability In reality – all of statistics can be summed into one statement: – Variability matters. – (and.
Lecture 5: Chapter 5: Part I: pg Statistical Analysis of Data …yes the “S” word.
Skewness & Kurtosis: Reference
Thursday, February 6, 2014MAT 312. Thursday, February 6, 2014MAT 312.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
The Central Tendency is the center of the distribution of a data set. You can think of this value as where the middle of a distribution lies. Measure.
MDM4U Chapter 3 Review Normal Distribution Mr. Lieff.
Tuesday, February 11, 2014MAT 312. Tuesday, February 11, 2014MAT 312.
Thursday, February 13, 2014MAT 312. Thursday, February 13, 2014MAT 312.
Practice Page 65 –2.1 Positive Skew Note Slides online.
Thursday, February 27, 2014MAT 312. Thursday, February 27, 2014MAT 312.
Psy 230 Jeopardy Measurement Research Strategies Frequency Distributions Descriptive Stats Grab Bag $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500.
Chapter 3: Averages and Variation Section 2: Measures of Dispersion.
Math 145 September 11, Recap  Individuals – are the objects described by a set of data. Individuals may be people, but they may also be animals.
Tuesday, February 3, 2014MAT 312. Tuesday, February 3, 2014MAT 312.
Descriptive Statistics Review – Chapter 14. Data  Data – collection of numerical information  Frequency distribution – set of data with frequencies.
RESEARCH & DATA ANALYSIS
Edpsy 511 Exploratory Data Analysis Homework 1: Due 9/19.
The field of statistics deals with the collection,
Distinguish between quantitative, qualitative Computing central value of set of data: mean, median and mode Ways to report the variation in a set of data:
Statistical Analysis of Data. What is a Statistic???? Population Sample Parameter: value that describes a population Statistic: a value that describes.
LIS 570 Summarising and presenting data - Univariate analysis.
Introduction to statistics I Sophia King Rm. P24 HWB
Questions?? Example: Exercise set B #1 p. 38 –Set up table with intervals, frequencies & percent per year –Graph appears on p. 39 Figure 5 –For discrete.
Why do we analyze data?  It is important to analyze data because you need to determine the extent to which the hypothesized relationship does or does.
CHAPTER 2: Basic Summary Statistics
Why do we analyze data?  To determine the extent to which the hypothesized relationship does or does not exist.  You need to find both the central tendency.
Descriptive Statistics Unit 6. Variable Any characteristic (data) recorded for the subjects of a study ex. blood pressure, nesting orientation, phytoplankton.
Chapter 3 Review MDM 4U Mr. Lieff. 3.1 Graphical Displays be able to effectively use a histogram name and be able to interpret the various types of distributions.
Central Tendency and Variability Chapter 4. Variability In reality – all of statistics can be summed into one statement: – Variability matters. – (and.
Data Presentation Numerical Summary Measures Chung-Yi Li, PhD Dept. of Public Health, College of Med. NCKU.
AP Statistics. Chapter 1 Think – Where are you going, and why? Show – Calculate and display. Tell – What have you learned? Without this step, you’re never.
Welcome to MM305 Unit 2 Seminar Dr. Bob Statistical Foundations for Quantitative Analysis.
Lecture 8 Data Analysis: Univariate Analysis and Data Description Research Methods and Statistics 1.
Methods for Describing Sets of Data
Measures of Central Tendency
Warm Up What is the mean, median, mode and outlier of the following data: 16, 19, 21, 18, 18, 54, 20, 22, 23, 17 Mean: 22.8 Median: 19.5 Mode: 18 Outlier:
Module 6: Descriptive Statistics
Shoe Sizes.
IB Psychology Today’s Agenda: Turn in:
Description of Data (Summary and Variability measures)
Univariate Descriptive Statistics
Summary descriptive statistics: means and standard deviations:
Review for Exam 1 Ch 1-5 Ch 1-3 Descriptive Statistics
CHAPTER 2: Basic Summary Statistics
Presentation transcript:

Describing data with graphics and numbers

Types of Data Categorical Variables –also known as class variables, nominal variables Quantitative Variables –aka numerical nariables –either continuous or discrete.

Graphing categorical variables

Ten most common causes of death in Americans between 15 and 19 years old in 1999.

Bar graphs

Graphing numerical variables

Heights of BIOL 300 students (cm)

Stem-and-leaf plot

Frequency table Height GroupFrequency

Frequency table Height GroupFrequency

Histogram

Frequency distribution

Histogram with more data

90th percentile 50th percentile (median)

Associations between two categorical variables

Association between reproductive effort and avian malaria

Mosaic plot

Grouped Bar Graph

Associations between categorical and numerical variables

Multiple histograms

Associations between two numerical variables

Scatterplots

Evaluating Graphics Lie factor Chartjunk Efficiency

Don’t mislead with graphics

Better representation of truth

Lie Factor Lie factor = size of effect shown in graphic size of effect in data

Lie Factor Example Effect in graphic: 2.33/0.08 = 29.1 Effect in data: 6748/5844 = 1.15 Lie factor = 29.1 / 1.15 = 25.3

Chartjunk

Needless 3D Graphics

Summary: Graphical methods for frequency distributions

Summary: Associations between variables

Great book on graphics

Describing data

Two common descriptions of data Location (or central tendency) Width (or spread)

Measures of location Mean Median Mode

Mean n is the size of the sample

Mean Y 1 =56, Y 2 =72, Y 3 =18, Y 4 =42

Mean Y 1 =56, Y 2 =72, Y 3 =18, Y 4 =42 = ( ) / 4 = 47

Median The median is the middle measurement in a set of ordered data.

The data:

The data: can be put in order: Median is 25.

Mean vs. median in politics 2004 U.S. Economy Republicans: times are good –Mean income increasing ~ 4% per year Democrats: times are bad –Median family income fell Why?

Measures of width Range Standard deviation Variance Coefficient of variation

Range

Range The range is = 22

Population Variance

Sample variance n is the sample size

Shortcut for calculating sample variance

Standard deviation (SD) Positive square root of the variance  is the true standard deviation s is the sample standard deviation

In class exercise Calculate the variance and standard deviation of a sample with the following data: 6, 1, 2

Answer Variance=7 Standard deviation =

Coefficient of variance (CV) CV = 100 s /.

Equal means, different variances

Manipulating means The mean of the sum of two variables: E[X + Y] = E[X]+ E[Y] The mean of the sum of a variable and a constant: E[X + c] = E[X]+ c The mean of a product of a variable and a constant: E[c X] = c E[X] The mean of a product of two variables: E[X Y] = E[X] E[Y] if and only if X and Y are independent.

Manipulating variance The variance of the sum of two variables: Var[X + Y] = Var[X]+ Var[Y] if and only if X and Y are independent. The variance of the sum of a variable and a constant: Var[X + c] = Var[X] The variance of a product of a variable and a constant: Var[c X] = c 2 Var[X]

Parents’ heights MeanVariance Father Height Mother Height Father Height +Mother Height