Stat 512 Day 4: Quantitative Data. Last Time p-values and statistical significance  What p-values tell us (and do not tell us) For now, approximating.

Slides:

Advertisements

Similar presentations

CHAPTER 1 Exploring Data

Advertisements

1 Chapter 1: Sampling and Descriptive Statistics.

Describing Data: One Quantitative Variable

Stat 217 – Day 9 Topic 9: Measures of spread. Announcements HW 3 returned  Working on Lab 2 Topic 9 today and tomorrow Review on Wednesday  HWs, Labs.

Stat 512 – Lecture 18 Multiple Regression (Ch. 11)

Stat 301 – Day 9 Fisher’s Exact Test Quantitative Variables.

Stat 512 – Day 5 Statistical significance with quantitative response variable.

CHAPTER 2: Describing Distributions with Numbers

AP Statistics Chapters 0 & 1 Review. Variables fall into two main categories: A categorical, or qualitative, variable places an individual into one of.

Describing distributions with numbers

Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.

LECTURE 12 Tuesday, 6 October STA291 Fall Five-Number Summary (Review) 2 Maximum, Upper Quartile, Median, Lower Quartile, Minimum Statistical Software.

CHAPTER 2: Describing Distributions with Numbers ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.

LECTURE 8 Thursday, 19 February STA291 Fall 2008.

M07-Numerical Summaries 1 1  Department of ISM, University of Alabama, Lesson Objectives  Learn when each measure of a “typical value” is appropriate.

Section 1 Topic 31 Summarising metric data: Median, IQR, and boxplots.

Describing distributions with numbers

Skewness & Kurtosis: Reference

Statistics Chapter 1: Exploring Data. 1.1 Displaying Distributions with Graphs Individuals Objects that are described by a set of data Variables Any characteristic.

June 11, 2008Stat Lecture 10 - Review1 Midterm review Chapters 1-5 Statistics Lecture 10.

Statistics Lecture 3. Last class: types of quantitative variable, histograms, measures of center, percentiles and measures of spread…well, we shall.

Organizing Data AP Stats Chapter 1. Organizing Data Categorical Categorical Dotplot (also used for quantitative) Dotplot (also used for quantitative)

Standard Deviation. Warm-up Do girls study more than boys? We asked the students in three AP Statistics classes how many minutes they studied on a typical.

Statistics: Unlocking the Power of Data Lock 5 STAT 250 Dr. Kari Lock Morgan Describing Data: One Quantitative Variable SECTIONS 2.2, 2.3 One quantitative.

1.3 Describing Quantitative Data with Numbers Pages Objectives SWBAT: 1)Calculate measures of center (mean, median). 2)Calculate and interpret measures.

More Univariate Data Quantitative Graphs & Describing Distributions with Numbers.

Synthesis and Review 2/20/12 Hypothesis Tests: the big picture Randomization distributions Connecting intervals and tests Review of major topics Open Q+A.

+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers.

1 By maintaining a good heart at every moment, every day is a good day. If we always have good thoughts, then any time, any thing or any location is auspicious.

UNIT ONE REVIEW Exploring Data.

Thursday, May 12, 2016 Report at 11:30 to Prairieview

Lecture #3 Tuesday, August 30, 2016 Textbook: Sections 2.4 through 2.6

Statistics 200 Lecture #4 Thursday, September 1, 2016

Mean, Median, Mode and Standard Deviation (Section 11-1)

CHAPTER 1 Exploring Data

MATH-138 Elementary Statistics

1.3 Measuring Center & Spread, The Five Number Summary & Boxplots

CHAPTER 1 Exploring Data

CHAPTER 2: Describing Distributions with Numbers

CHAPTER 2: Describing Distributions with Numbers

Laugh, and the world laughs with you. Weep and you weep alone

Displaying Distributions with Graphs

CHAPTER 1 Exploring Data

CHAPTER 1 Exploring Data

DAY 3 Sections 1.2 and 1.3.

Please take out Sec HW It is worth 20 points (2 pts

Warmup What is the shape of the distribution? Will the mean be smaller or larger than the median (don’t calculate) What is the median? Calculate the.

Displaying and Summarizing Quantitative Data

CHAPTER 1 Exploring Data

Organizing Data AP Stats Chapter 1.

1.3 Describing Quantitative Data with Numbers

Describing Quantitative Data with Numbers

Exploratory Data Analysis

CHAPTER 2: Describing Distributions with Numbers

Chapter 1: Exploring Data

Elementary Statistics: Looking at the Big Picture

CHAPTER 1 Exploring Data

CHAPTER 1 Exploring Data

Honors Statistics Review Chapters 4 - 5

CHAPTER 2: Describing Distributions with Numbers

CHAPTER 1 Exploring Data

CHAPTER 1 Exploring Data

Ten things about Descriptive Statistics

CHAPTER 1 Exploring Data

CHAPTER 1 Exploring Data

CHAPTER 1 Exploring Data

The Five-Number Summary

CHAPTER 1 Exploring Data

CHAPTER 1 Exploring Data

CHAPTER 1 Exploring Data

Presentation transcript:

Stat 512 Day 4: Quantitative Data

Last Time p-values and statistical significance  What p-values tell us (and do not tell us) For now, approximating the p-value through simulating the randomization process  How small p-values provide evidence that the difference we observed did not occur “just by chance” (randomization) Assume there is no treatment effect…  If a randomized experiment, then can also draw cause and effect conclusions

Practice Problem In (a), “controlling variables”  Specify the explanatory variable In (d), if no association…  If no relationship, same “success proportion” in each group  Not 1/2 since not equal group sizes (“significant”)  No inference here Role of randomization test  Don’t have to have equal sample sizes In (f), Causal vs. relationship Don’t panic, sorry for my biased comments

Statistical Methods Design: Planning and carrying out research studies  Observational units, Number and types of variables Descriptive: Summarizing and exploration data Inference: Making predictions or generalizing about phenomena represented by data What conclusions can we draw based on each of these three steps?

Repeat the Process – Quantitative Data Consider data collection issues Consider appropriate numerical and graphical summaries  Several measures, what does each tell you?  How do we get Minitab to do all the work? Simulation of p-values to determine statistical significance  Interpretation of p-values

Example 1: Cloud Seeding “A Bayesian analysis of a multiplicative treatment effect in weather modification”  Simpson, Alsen, Eden  Technometrics, 17, (1975)

Example 1 (a) Type of study, observational units? Experiment since randomly assigned the clouds (b) EV and RV seeding Cloudscompare rainfall no seeding randomized

Example 1 With a quantitative response variable, can compare the groups through parallel dotplots

What to look for Center Spread Shape Unusual observations

Numerical Summaries Five number summary Variable treatment Minimum Q1 Median Q3 Maximum rainfall seeded unseeded

Numerical Summaries Five number summary Min, Q1, median, Q3,outliers

Mean vs. Median

Properties The University of North Carolina took a survey of the students who had graduated as geology majors. In 1998, the average annual salary of geology majors who graduated from UNC was more than $500,000. The next year it was less than $100,000.

Summary Comparing the distribution of a quantitative variable between two or more groups  Graphical summaries: (parallel) dotplots, boxplots, side by side stemplots Center, spread, shape (skewed?), outliers  Numerical summaries Center: mean, median (five-number summary)  Mean = average of all values (not “resistant”)  Median = “typical” value Outliers: 1.5IQR criterion

Old Faithful

Histograms

Geyser Eruptions  1978 Range = = 53 minutes  2003 Range = = 54 minutes Without outliers:  = 40 minutes

Geyser Eruptions  1978 IQR = = 23 minutes  2003 IQR = = 11 minutes Without outliers IQR = = 11 minutes

Standard Deviation Want to compare the distance of the observations from the mean  Deviation from mean: y i -  Absolute deviations  Squared deviations

Old Faithful  1978 SD = 13 minutes  2003 SD = 8.5 minutes Without outliers SD=6.9 (  SD is not resistant!)

Example 3 What do we mean by variability?

Notes on histograms Left-hand endpoint rule Choice of interval widths Also watch use of “even” in describing shape (flat vs. symmetric)

Notes on Using Minitab Worksheets vs. Projects Saving graph windows Stacked vs. unstacked data

To Do For Tuesday: PP 4 For Thursday: PP 5 and reading HW 3 by Friday  Heavy Minitab component  Favor Upcoming: Project proposal