Stat 512 Day 4: Quantitative Data. Last Time p-values and statistical significance  What p-values tell us (and do not tell us) For now, approximating.

Slides:



Advertisements
Similar presentations
CHAPTER 1 Exploring Data
Advertisements

1 Chapter 1: Sampling and Descriptive Statistics.
Describing Data: One Quantitative Variable
Stat 217 – Day 9 Topic 9: Measures of spread. Announcements HW 3 returned  Working on Lab 2 Topic 9 today and tomorrow Review on Wednesday  HWs, Labs.
Stat 512 – Lecture 18 Multiple Regression (Ch. 11)
Stat 301 – Day 9 Fisher’s Exact Test Quantitative Variables.
Stat 512 – Day 5 Statistical significance with quantitative response variable.
CHAPTER 2: Describing Distributions with Numbers
AP Statistics Chapters 0 & 1 Review. Variables fall into two main categories: A categorical, or qualitative, variable places an individual into one of.
Describing distributions with numbers
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
LECTURE 12 Tuesday, 6 October STA291 Fall Five-Number Summary (Review) 2 Maximum, Upper Quartile, Median, Lower Quartile, Minimum Statistical Software.
CHAPTER 2: Describing Distributions with Numbers ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
LECTURE 8 Thursday, 19 February STA291 Fall 2008.
M07-Numerical Summaries 1 1  Department of ISM, University of Alabama, Lesson Objectives  Learn when each measure of a “typical value” is appropriate.
Section 1 Topic 31 Summarising metric data: Median, IQR, and boxplots.
Describing distributions with numbers
Skewness & Kurtosis: Reference
Statistics Chapter 1: Exploring Data. 1.1 Displaying Distributions with Graphs Individuals Objects that are described by a set of data Variables Any characteristic.
June 11, 2008Stat Lecture 10 - Review1 Midterm review Chapters 1-5 Statistics Lecture 10.
Statistics Lecture 3. Last class: types of quantitative variable, histograms, measures of center, percentiles and measures of spread…well, we shall.
Organizing Data AP Stats Chapter 1. Organizing Data Categorical Categorical Dotplot (also used for quantitative) Dotplot (also used for quantitative)
Standard Deviation. Warm-up Do girls study more than boys? We asked the students in three AP Statistics classes how many minutes they studied on a typical.
Statistics: Unlocking the Power of Data Lock 5 STAT 250 Dr. Kari Lock Morgan Describing Data: One Quantitative Variable SECTIONS 2.2, 2.3 One quantitative.
1.3 Describing Quantitative Data with Numbers Pages Objectives SWBAT: 1)Calculate measures of center (mean, median). 2)Calculate and interpret measures.
More Univariate Data Quantitative Graphs & Describing Distributions with Numbers.
Synthesis and Review 2/20/12 Hypothesis Tests: the big picture Randomization distributions Connecting intervals and tests Review of major topics Open Q+A.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers.
1 By maintaining a good heart at every moment, every day is a good day. If we always have good thoughts, then any time, any thing or any location is auspicious.
UNIT ONE REVIEW Exploring Data.
Thursday, May 12, 2016 Report at 11:30 to Prairieview
Lecture #3 Tuesday, August 30, 2016 Textbook: Sections 2.4 through 2.6
Statistics 200 Lecture #4 Thursday, September 1, 2016
Mean, Median, Mode and Standard Deviation (Section 11-1)
CHAPTER 1 Exploring Data
MATH-138 Elementary Statistics
1.3 Measuring Center & Spread, The Five Number Summary & Boxplots
CHAPTER 1 Exploring Data
CHAPTER 2: Describing Distributions with Numbers
CHAPTER 2: Describing Distributions with Numbers
Laugh, and the world laughs with you. Weep and you weep alone
Displaying Distributions with Graphs
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
DAY 3 Sections 1.2 and 1.3.
Please take out Sec HW It is worth 20 points (2 pts
Warmup What is the shape of the distribution? Will the mean be smaller or larger than the median (don’t calculate) What is the median? Calculate the.
Displaying and Summarizing Quantitative Data
CHAPTER 1 Exploring Data
Organizing Data AP Stats Chapter 1.
1.3 Describing Quantitative Data with Numbers
Describing Quantitative Data with Numbers
Exploratory Data Analysis
CHAPTER 2: Describing Distributions with Numbers
Chapter 1: Exploring Data
Elementary Statistics: Looking at the Big Picture
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Honors Statistics Review Chapters 4 - 5
CHAPTER 2: Describing Distributions with Numbers
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Ten things about Descriptive Statistics
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
The Five-Number Summary
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Presentation transcript:

Stat 512 Day 4: Quantitative Data

Last Time p-values and statistical significance  What p-values tell us (and do not tell us) For now, approximating the p-value through simulating the randomization process  How small p-values provide evidence that the difference we observed did not occur “just by chance” (randomization) Assume there is no treatment effect…  If a randomized experiment, then can also draw cause and effect conclusions

Practice Problem In (a), “controlling variables”  Specify the explanatory variable In (d), if no association…  If no relationship, same “success proportion” in each group  Not 1/2 since not equal group sizes (“significant”)  No inference here Role of randomization test  Don’t have to have equal sample sizes In (f), Causal vs. relationship Don’t panic, sorry for my biased comments

Statistical Methods Design: Planning and carrying out research studies  Observational units, Number and types of variables Descriptive: Summarizing and exploration data Inference: Making predictions or generalizing about phenomena represented by data What conclusions can we draw based on each of these three steps?

Repeat the Process – Quantitative Data Consider data collection issues Consider appropriate numerical and graphical summaries  Several measures, what does each tell you?  How do we get Minitab to do all the work? Simulation of p-values to determine statistical significance  Interpretation of p-values

Example 1: Cloud Seeding “A Bayesian analysis of a multiplicative treatment effect in weather modification”  Simpson, Alsen, Eden  Technometrics, 17, (1975)

Example 1 (a) Type of study, observational units? Experiment since randomly assigned the clouds (b) EV and RV seeding Cloudscompare rainfall no seeding randomized

Example 1 With a quantitative response variable, can compare the groups through parallel dotplots

What to look for Center Spread Shape Unusual observations

Numerical Summaries Five number summary Variable treatment Minimum Q1 Median Q3 Maximum rainfall seeded unseeded

Numerical Summaries Five number summary Min, Q1, median, Q3,outliers

Mean vs. Median

Properties The University of North Carolina took a survey of the students who had graduated as geology majors. In 1998, the average annual salary of geology majors who graduated from UNC was more than $500,000. The next year it was less than $100,000.

Summary Comparing the distribution of a quantitative variable between two or more groups  Graphical summaries: (parallel) dotplots, boxplots, side by side stemplots Center, spread, shape (skewed?), outliers  Numerical summaries Center: mean, median (five-number summary)  Mean = average of all values (not “resistant”)  Median = “typical” value Outliers: 1.5IQR criterion

Old Faithful

Histograms

Geyser Eruptions  1978 Range = = 53 minutes  2003 Range = = 54 minutes Without outliers:  = 40 minutes

Geyser Eruptions  1978 IQR = = 23 minutes  2003 IQR = = 11 minutes Without outliers IQR = = 11 minutes

Standard Deviation Want to compare the distance of the observations from the mean  Deviation from mean: y i -  Absolute deviations  Squared deviations

Old Faithful  1978 SD = 13 minutes  2003 SD = 8.5 minutes Without outliers SD=6.9 (  SD is not resistant!)

Example 3 What do we mean by variability?

Notes on histograms Left-hand endpoint rule Choice of interval widths Also watch use of “even” in describing shape (flat vs. symmetric)

Notes on Using Minitab Worksheets vs. Projects Saving graph windows Stacked vs. unstacked data

To Do For Tuesday: PP 4 For Thursday: PP 5 and reading HW 3 by Friday  Heavy Minitab component  Favor Upcoming: Project proposal