Module 21 Module 2: Terminology of Data Sets Attributes of Data Sets (Mean and Spread) Melinda Ronca-Battista, ITEP Catherine Brown, U.S. EPA.

Slides:



Advertisements
Similar presentations
Sampling Distributions and Sample Proportions
Advertisements

Introduction to Summary Statistics
4.2.2 Inductive Statistics 1 UPA Package 4, Module 2 INDUCTIVE STATISTICS.
Math 20: Foundations FM20.6 Demonstrate an understanding of normal distribution, including standard deviation and z-scores. FM20.7 Demonstrate understanding.
Psy302 Quantitative Methods
Sampling Distributions
Stat 2411 Statistical Methods Chapter 4. Measure of Variation.
Sampling We have a known population.  We ask “what would happen if I drew lots and lots of random samples from this population?”
Chapter 11: Inference for Distributions
Jeff Hosea : University of Surrey, Physics Dept, Level 1 Labs, Oct Level 1 Laboratories A Rough Summary of Key Error Formulae for samples of random.
REGRESSION Predict future scores on Y based on measured scores on X Predictions are based on a correlation from a sample where both X and Y were measured.
Descriptive Statistics
Measures of Variability: Range, Variance, and Standard Deviation
Density Curve A density curve is the graph of a continuous probability distribution. It must satisfy the following properties: 1. The total area.
Unit 9: Statistics By: Jamie Fu and Neha Surapaneni.
BPT 2423 – STATISTICAL PROCESS CONTROL.  Frequency Distribution  Normal Distribution / Probability  Areas Under The Normal Curve  Application of Normal.
Sampling Distributions
Sociology 5811: Lecture 7: Samples, Populations, The Sampling Distribution Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
BASIC NOTATION. Summation (  ) X i = The number of meals I have on day “i” X= 1,2,3,2,1  X i = ???  X i 2 = ??? (  X i ) 2 = ???
Get out your Density Curve WS! You will be able to describe a Normal curve. You will be able to find percentages and standard deviations based on a Normal.
Review – Using Standard Deviation Here are eight test scores from a previous Stats 201 class: 35, 59, 70, 73, 75, 81, 84, 86. The mean and standard deviation.
Confidence Intervals Confidence Interval for a Mean
Estimation: Sampling Distribution
OA Cancelled due to technical difficulties… OA Cancelled due to technical difficulties… Tonight’s HW is A2: 541(11b = PANIC and 11d) We will start using.
Introduction to Summary Statistics
A P STATISTICS LESSON 2 – 2 STANDARD NORMAL CALCULATIONS.
Measures of Variability Objective: Students should know what a variance and standard deviation are and for what type of data they typically used.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. Turning Data Into Information Chapter 2.
Chapter 6 The Normal Curve. A Density Curve is a curve that: *is always on or above the horizontal axis *has an area of exactly 1 underneath it *describes.
Density Curves and the Normal Distribution.
Developed at Utah State University Dept of Engr & Tech Educ — Materials and Processes 5.6 calculate the mean and standard deviation of.
Normal Curves and Sampling Distributions Chapter 7.
Module 13: Normal Distributions This module focuses on the normal distribution and how to use it. Reviewed 05 May 05/ MODULE 13.
Statistics PSY302 Quiz One Spring A _____ places an individual into one of several groups or categories. (p. 4) a. normal curve b. spread c.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Module 7: Comparing Datasets and Comparing a Dataset with a Standard How different is enough?
Normal Curves Often good representation of real data Often good approximations of chance outcomes.
Section 2.1 Density Curves. Get out a coin and flip it 5 times. Count how many heads you get. Get out a coin and flip it 5 times. Count how many heads.
Copyright © 2012 Pearson Education, Inc. All rights reserved Chapter 9 Statistics.
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a measure of the population. This value is typically unknown. (µ, σ, and now.
Descriptive Statistics Review – Chapter 14. Data  Data – collection of numerical information  Frequency distribution – set of data with frequencies.
Modeling Distributions
AP Statistics.  If our data comes from a simple random sample (SRS) and the sample size is sufficiently large, then we know that the sampling distribution.
Confidence Intervals for a Population Proportion Excel.
Chapter 4 Exploring Chemical Analysis, Harris
Module 11 Module I: Terminology— Data Quality Indicators (DQIs) Melinda Ronca-Battista ITEP Catherine Brown U.S. EPA.
m/sampling_dist/index.html.
Sampling Distributions Chapter 18. Sampling Distributions If we could take every possible sample of the same size (n) from a population, we would create.
© 2012 W.H. Freeman and Company Lecture 2 – Aug 29.
Stat 2411 Statistical Methods Chapter 4. Measure of Variation.
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
Comparing Datasets and Comparing a Dataset with a Standard How different is enough?
In your groups, go through all four steps of a confidence interval: The “Country Taste” bread making company wants to estimate the actual weight of their.
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a number that describes the population. In statistical practice, the value of.
Chapter 9 Sampling Distributions 9.1 Sampling Distributions.
Sampling Distributions Chapter 18
Statistical Methods Michael J. Watts
Statistical Methods Michael J. Watts
Objectives The student will be able to:
Stat 2411 Statistical Methods
Comparing Groups April 6-7, 2017 CS 160 – Section 10.

Section 2: Estimating with Small Samples
Statistics PSY302 Review Quiz One Fall 2018
Algebra 1/4/17
Normal Distribution.
Descriptive and inferential statistics. Confidence interval
Stat 2411 Statistical Methods Chapter 4. Measure of Variation.
Statistics PSY302 Review Quiz One Spring 2017
Advanced Algebra Unit 1 Vocabulary
Presentation transcript:

module 21 Module 2: Terminology of Data Sets Attributes of Data Sets (Mean and Spread) Melinda Ronca-Battista, ITEP Catherine Brown, U.S. EPA

module 22 Histogram  a.k.a. “frequency distribution”  Many types of datasets form “bell”-shaped histograms  a.k.a. “normal,” “standard,” “Gaussian” curves

module 23 Typical Histogram

module 24 Only 2 Factors  Mean (center of data, where most data are)  Spread (how far from mean is how much of the data)

module 25 Center  Mean = average  Outliers can strongly affect the mean  Distribution may not be symmetrical

module 26 Many Environmental Distributions

module 27 Normal Distribution Useful

module 28Spread  Sample standard deviation  STDEV(range)  The bigger the STDEV is, compared to the mean, the wider the spread  COV = STDEV/mean

module 29 How can we use normal distribution?  “Map” our distribution onto a normal distribution, using our mean and stdev  Then can predict how many values in different degrees of “spread” away from mean

module 210 Sample vs. Population  Our sample is a subset  We assume our subset is subset of “real” population  The closer our subset is to the real population, the better our prediction will be  Good sampling plans produce better representations of the “real” distribution

module 211 Subsets Might be Biased

module 212 Terminology  Mu =  mean  Sigma = s = standard deviation

module 213 How is this useful?  Calculate mean and stdev  NOW can predict reality!  Put any “x” value in context of how many STDEVs away from mean it is

module 214 Standard Deviation  STDEV(range)

module 215 Z Score  Z shows how far away from the mean is the “x” value you are interested in

module 216 Z scores are Proportions of Spread

module 217

module 218 Sample Size Affects Confidence:  The more N, the better your estimate (STDEV) reflects the real spread

module 219 Air Quality  Daily sampling is best estimate of reality  Compromise with 1 in 3 day sampling  Worse estimate with 1 in 6 day  Even worse with 1 in 12 day  How well does one location estimate all air in airshed?  Compromise in both frequency and number of sites

module 220 Module 2 Summary  Data sets estimate reality  Mean (average)  Spread (stdev)  N  Good sampling plans produce good estimates of reality