Descriptive Statistics Unit 6. Variable Any characteristic (data) recorded for the subjects of a study ex. blood pressure, nesting orientation, phytoplankton.

Slides:



Advertisements
Similar presentations
Describing Quantitative Variables
Advertisements

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 2 Exploring Data with Graphs and Numerical Summaries Section 2.2 Graphical Summaries.
EXPLORING DATA WITH GRAPHS AND NUMERICAL SUMMARIES
Chapter 4 Displaying and Summarizing Quantitative Data.
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 4 Displaying and Summarizing Quantitative Data.
1 Chapter 1: Sampling and Descriptive Statistics.
IB Math Studies – Topic 6 Statistics.
Displaying & Summarizing Quantitative Data
Exploratory Data Analysis (Descriptive Statistics)
Chapter 1 & 3.
ISE 261 PROBABILISTIC SYSTEMS. Chapter One Descriptive Statistics.
Summarising and presenting data
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter Two Treatment of Data.
Chapter 2: Exploring Data with Graphs and Numerical Summaries
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Agresti/Franklin Statistics, 1 of 63 Chapter 2 Exploring Data with Graphs and Numerical Summaries Learn …. The Different Types of Data The Use of Graphs.
Programming in R Describing Univariate and Multivariate data.
Describing distributions with numbers
Objective To understand measures of central tendency and use them to analyze data.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Definitions Data: A collection of information in context.
Chapter 1 Exploring Data
Let’s Review for… AP Statistics!!! Chapter 1 Review Frank Cerros Xinlei Du Claire Dubois Ryan Hoshi.
Descriptive Statistics
Variable  An item of data  Examples: –gender –test scores –weight  Value varies from one observation to another.
Methods for Describing Sets of Data
2011 Summer ERIE/REU Program Descriptive Statistics Igor Jankovic Department of Civil, Structural, and Environmental Engineering University at Buffalo,
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 1 Overview and Descriptive Statistics.
STAT 211 – 019 Dan Piett West Virginia University Lecture 1.
Chapter 1 The Role of Statistics. Three Reasons to Study Statistics 1.Being an informed “Information Consumer” Extract information from charts and graphs.
1 Laugh, and the world laughs with you. Weep and you weep alone.~Shakespeare~
M07-Numerical Summaries 1 1  Department of ISM, University of Alabama, Lesson Objectives  Learn when each measure of a “typical value” is appropriate.
Chapter 2 Describing Data.
Describing distributions with numbers
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
Bellwork 1. If a distribution is skewed to the right, which of the following is true? a) the mean must be less than the.
LECTURE CENTRAL TENDENCIES & DISPERSION POSTGRADUATE METHODOLOGY COURSE.
Agresti/Franklin Statistics, 1 of 63 Chapter 2 Exploring Data with Graphs and Numerical Summaries Learn …. The Different Types of Data The Use of Graphs.
Displaying Distributions with Graphs. the science of collecting, analyzing, and drawing conclusions from data.
1 Chapter 2: Exploring Data with Graphs and Numerical Summaries Section 2.1: What Are the Types of Data?
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
SPSS Workshop Day 2 – Data Analysis. Outline Descriptive Statistics Types of data Graphical Summaries –For Categorical Variables –For Quantitative Variables.
Edpsy 511 Exploratory Data Analysis Homework 1: Due 9/19.
Notes Unit 1 Chapters 2-5 Univariate Data. Statistics is the science of data. A set of data includes information about individuals. This information is.
LIS 570 Summarising and presenting data - Univariate analysis.
1 Never let time idle away aimlessly.. 2 Chapters 1, 2: Turning Data into Information Types of data Displaying distributions Describing distributions.
More Univariate Data Quantitative Graphs & Describing Distributions with Numbers.
Statistics Unit Test Review Chapters 11 & /11-2 Mean(average): the sum of the data divided by the number of pieces of data Median: the value appearing.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 2 Exploring Data with Graphs and Numerical Summaries Section 2.1 Different Types of.
Chapter 4 Histograms Stem-and-Leaf Dot Plots Measures of Central Tendency Measures of Variation Measures of Position.
ALL ABOUT THAT DATA UNIT 6 DATA. LAST PAGE OF BOOK: MEAN MEDIAN MODE RANGE FOLDABLE Mean.
Lecture 8 Data Analysis: Univariate Analysis and Data Description Research Methods and Statistics 1.
Descriptive Statistics
Exploring Data Descriptive Data
Graphing options for Quantitative Data
Exploratory Data Analysis
CHAPTER 1 Exploring Data
Exploring Data Descriptive Data
STAT 4030 – Programming in R STATISTICS MODULE: Basic Data Analysis
ISE 261 PROBABILISTIC SYSTEMS
Statistics Unit Test Review
Statistical Reasoning
Laugh, and the world laughs with you. Weep and you weep alone
Histograms: Earthquake Magnitudes
Displaying Distributions with Graphs
Displaying and Summarizing Quantitative Data
Honors Statistics Review Chapters 4 - 5
Types of variables. Types of variables Categorical variables or qualitative identifies basic differentiating characteristics of the population.
Presentation transcript:

Descriptive Statistics Unit 6

Variable Any characteristic (data) recorded for the subjects of a study ex. blood pressure, nesting orientation, phytoplankton count Can be classifies as either: -categorical -quantitative: *discrete *continuous

Categorical Variable Data belongs to one of a set of categories Exs: 1.Gender (Male or Female) 2.Pets owned (dog, cat, great white…) 3.Type of food imported (beef, pork, shellfish …) 4.Engage in 30 minutes of exercise daily (Yes or No) Type of graph(s): bar, pie

Pie Charts Summarizes categorical variable Drawn as circle where each category is a slice The size of each slice is proportional to the percentage in that category

Bar Graphs Summarizes categorical variable Vertical bars for each category Height of each bar represents either counts or percentages Easier to compare categories with bar graph than with pie chart Called Pareto Charts when ordered from tallest to shortest

Quantitative Variable Data is given numerical values for different magnitudes. Exs: 1.Age of test subjects 2.Number of siblings 3.Seasonal changes in pH of pond water Type of graph: scatter-plot, line, stem and leaf

Quantitative vs. Categorical For Quantitative variables, key features are the center (a representative value) and spread (variability). For Categorical variables, a key feature is the percentage of data in each of the categories

Discrete Quantitative Variable Quantitative variable is discrete if its possible values form a set of separate numbers: 0,1,2,3,…. Exs: 1.Number of calico cats sold 2.Number of nests with down linings 3.Number of students who fall asleep in Stats class

Continuous Quantitative Variable Quantitative variable is continuous if its possible values form an interval Measurements Examples: 1.Height/Weight 2.Age 3.Blood pressure

Most Common Way to Describe Data Central tendency Statistical variation

Central Tendency Used to represent entire data set Highlights distribution of data Measures one of the following: mode, mean, and median

Mode Value that occurs most often Highest bar in the histogram Mode is most often used with categorical data Best if not used alone

12, 12, 13, 14, 14, 15, 15, 15, 15, 37, 38 2, 3, 3, 4, 5, 5- bimodal 65, 68, 69, 71, 72, 73, 75, 77- mode?

Mean The sum of the observations divided by the number of observations Measure of centermost point when there is a symmetrical distribution of values in a data set Mean = Σx Σ- sum n n- total number of values

8g/cm³, 10g/cm³, 7g/cm³, 9g/cm³ 8g/cm³ + 10g/cm³ + 7g/cm³ + 9g/cm³ 4 34g/cm³ 4 8.5g/cm³

Median Midpoint of the observations when ordered from least to greatest Used when there are extremes in data 1. Order observations 2. If the number of observations is: a)Odd, the median is the middle observation b)Even- the median is the average of the two middle observations

Central Tendency If data set has normal distribution: mean, median and mode are the same value If data set is not distributed normally: values of central tendency will vary. *requires inferential statistics: t-test, ANOVA

Comparing the Mean and Median Mean and median of a symmetric distribution are close In a skewed distribution: the mean is farther out than the median

Statistical Variation Shows how scores differ from one another AKA: variation, dispersion, spread Represent average difference from the mean Four measures of variation: range, interquartile range, standard deviation, variance

Range Most general measure of variation Measures difference between highest and lowest values: spread of data Ex. pH 6, 6, 6, 7, 7, 7, 7, 5, 3 range: 7-3 = pH 4

Range range is strongly affected by outliers.

Interquartile Range- IQR AKA mid-fifty or midspread Organizes data into 4 quartiles, each with 25% of data To calculate IQR: 1. Find median of entire data set 2. Find median of lower half of set- lower quartile 3. Find median of upper half of set- upper quartile

Quartiles

M = median = 3.4 Q 1 = first quartile = 2.2 Q 3 = third quartile = 4.35 Measure of Spread: Quartiles * 25% of the data at or below Q 1 and 75% above * 50% of the obs are above the median and 50% are below * 75% of the data at or below Q 3 and 25% above

Calculating Interquartile Range Interquartile range: distance between the third and first quartile, giving spread of middle 50% of the data: IQR = Q3 - Q1

Standard Deviation Each data value has an associated deviation from the mean, A deviation is positive if it falls above the mean and negative if it falls below the mean The sum of the deviations is always zero

Standard deviation: summarizes the deviations of each observation from the mean and calculates an adjusted average of these deviations: Standard Deviation 1. Find mean 2. Find each deviation 3. Square deviations 4. Sum squared deviations 5. Divide sum by n-1 6. Take square root

Outlier An outlier falls far from the rest of the data

Graphs for Quantitative Data 1.Dot Plot: shows a dot for each observation placed above its value on a number line 2.Stem-and-Leaf Plot: displays individual observations 3.Histogram: uses bars to portray the data

Which Graph? Dot-plot and stem-and-leaf plot: More useful for small data sets Data values are retained Histogram More useful for large data sets Most compact display More flexibility in defining intervals content.answers.com

Dot Plots To construct a dot plot 1.Draw and label horizontal line 2.Mark regular values 3.Place a dot above each value on the number line Sodium in Cereals

Stem-and-leaf plots Summarizes quantitative variables Separates each observation into a stem (first part of #) and a leaf (last digit) Write each leaf to the right of its stem; order leaves if desired Sodium in Cereals