Stat 155, Section 2, Last Time Numerical Summaries of Data: –Center: Mean, Medial –Spread: Range, Variance, S.D., IQR 5 Number Summary & Outlier Rule Transformation.

Slides:



Advertisements
Similar presentations
Stor 155, Section 2, Last Time Distributions (how are data “spread out”?) Visual Display: Histograms –Binwidth is critical Time Plots = Time Series Course.
Advertisements

Class Session #2 Numerically Summarizing Data
Copyright (c) Bani Mallick1 Lecture 4 Stat 651. Copyright (c) Bani Mallick2 Topics in Lecture #4 Probability The bell-shaped (normal) curve Normal probability.
1 Business 260: Managerial Decision Analysis Professor David Mease Lecture 1 Agenda: 1) Course web page 2) Greensheet 3) Numerical Descriptive Measures.
Ch. 6 The Normal Distribution
QBM117 Business Statistics
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution Business Statistics: A First Course 5 th.
Describing Data: Numerical
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 4.1 Chapter Four Numerical Descriptive Techniques.
LECTURE 12 Tuesday, 6 October STA291 Fall Five-Number Summary (Review) 2 Maximum, Upper Quartile, Median, Lower Quartile, Minimum Statistical Software.
Chap 6-1 Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall Chapter 6 The Normal Distribution Business Statistics: A First Course 6 th.
1.1 Displaying Distributions with Graphs
Chapter 1: Exploring Data AP Stats, Questionnaire “Please take a few minutes to answer the following questions. I am collecting data for my.
3.3 Density Curves and Normal Distributions
LECTURE 8 Thursday, 19 February STA291 Fall 2008.
Random Variables Numerical Quantities whose values are determine by the outcome of a random experiment.
Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Density Curves and the Normal Distribution.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 2 Modeling Distributions of Data 2.2 Density.
MAT 1000 Mathematics in Today's World. Last Time 1.Three keys to summarize a collection of data: shape, center, spread. 2.Can measure spread with the.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Copyright © 2009 Pearson Education, Inc. Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Central Tendency and Variability Chapter 4. Variability In reality – all of statistics can be summed into one statement: – Variability matters. – (and.
Guide to Using Excel For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 6th Ed. Chapter 3: Describing Data.
Summary Five numbers summary, percentiles, mean Box plot, modified box plot Robust statistic – mean, median, trimmed mean outlier Measures of variability.
1 Review Sections Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Stat 31, Section 1, Last Time Time series plots Numerical Summaries of Data: –Center: Mean, Medial –Spread: Range, Variance, S.D., IQR 5 Number Summary.
1 2 nd Pre-Lab Quiz 3 rd Pre-Lab Quiz 4 th Pre-Lab Quiz.
Statistics – OR 155, Section 2 J. S. Marron, Professor Department of Statistics and Operations Research.
Last Time Normal Distribution –Density Curve (Mound Shaped) –Family Indexed by mean and s. d. –Fit to data, using sample mean and s.d. Computation of Normal.
Section 2.1 Density Curves. Get out a coin and flip it 5 times. Count how many heads you get. Get out a coin and flip it 5 times. Count how many heads.
Stat 31, Section 1, Last Time Distributions (how are data “spread out”?) Visual Display: Histograms Binwidth is critical Bivariate display: scatterplot.
UNIT #1 CHAPTERS BY JEREMY GREEN, ADAM PAQUETTEY, AND MATT STAUB.
Administrative Matters Midterm II Results Take max of two midterm scores:
Probability Theory Modelling random phenomena. Permutations the number of ways that you can order n objects is: n! = n(n-1)(n-2)(n-3)…(3)(2)(1) Definition:
Stat 31, Section 1, Last Time Course Organization & Website What is Statistics? Data types.
Stat 31, Section 1, Last Time Big Rules of Probability –The not rule –The or rule –The and rule P{A & B} = P{A|B}P{B} = P{B|A}P{A} Bayes Rule (turn around.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 6-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
© 2012 W.H. Freeman and Company Lecture 2 – Aug 29.
Copyright © 2009 Pearson Education, Inc. Slide 4- 1 Practice – Ch4 #26: A meteorologist preparing a talk about global warming compiled a list of weekly.
AP Statistics Review Day 1 Chapters 1-4. AP Exam Exploring Data accounts for 20%-30% of the material covered on the AP Exam. “Exploratory analysis of.
Section 2.1 Density Curves
Stat 31, Section 1, Last Time Linear transformations
Last Time Proportions Continuous Random Variables Probabilities
Probability and Statistics
Analysis and Empirical Results
CHAPTER 2 Modeling Distributions of Data
Normal Distribution and Parameter Estimation
CHAPTER 2 Modeling Distributions of Data
Stat 31, Section 1, Last Time Sampling Distributions
Chapter 2: Describing Location in a Distribution
Good Afternoon! Agenda: Knight’s Charge-please wait for direction
STAT 206: Chapter 6 Normal Distribution.
Chapter 2: Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
The normal distribution
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Chapter 2 Data Analysis Section 2.2
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
Statistics for Managers Using Microsoft® Excel 5th Edition
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
The Normal Distribution
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
Presentation transcript:

Stat 155, Section 2, Last Time Numerical Summaries of Data: –Center: Mean, Medial –Spread: Range, Variance, S.D., IQR 5 Number Summary & Outlier Rule Transformation & Summaries Course Organization & Website

Reading In Textbook Approximate Reading for Today’s Material: Pages Approximate Reading for Next Class: Pages ,

And now for something completely different Collect data (into Spreadsheet): Years stamped on coins (chosen denomination) Many as person has Enter into spreadsheet Look at “distribution” using histogram

And now for something completely different Unfortunately I lost the data… Didn’t save file??? Saved to Strange Location??? Anyway, I can’t find it… So won’t be able to finish this

A Special Request Professor Marron, I am having a lot of trouble creating time plots. Is there any way that you could walk me through creating one again or demonstrate on Tuesday? I read over the notes and the book but that didn't help. Thanks!

Exploratory Data Analysis 3 “Time Plots”, i.e. “Time Series: Idea: when time structure is important, plot variable as a function of time: variable time Often useful to “connect the dots”

Airline Passengers Example A look under the hood Use Chart Wizard Chart Type: Line (or could do XY) Use subtype for points & lines Use menu for first log10 Although could just type it in Drag down to repeat for whole column

Modelling Distributions Text: Section 1.3 Idea: Approximate histograms by: an “idealized curve” i.e. a “density curve” that represents the underlying population

Idealized Curve Example Recall Hidalgo Stamps Data, Shifting Bin Movie (made # modes change): Add idealized curve: Note: “population curve” shows why histogram modes appear and disappear

Interpretation of Density Areas under density curve, give “relative frequency” Proportion of data between = = Area under =

Interpretation of Density Note: Total Area under density = 1 (since relative freq. of everything is 1) HW: 1.80 (a: l = w = 1 b: 0.25 c: 0.5), 1.81, 1.83 Work with pencil and paper, not EXCEL

Most Useful Density “Normal Curve” = “Gaussian Density” Shape: “like a mound” E.g. of “sand dumped from a truck” Older, worse, description: “bell shaped”

Normal Density Example Winter Daily Maximum Temperatures in Melbourne, Australia Notes: Top Histogram is “mound shaped” Plus “small scale random variation” So model with “Normal Density”?

Normal Density Curves Note: there is a family of normal curves, indexed by: i.“Center”, i.e. Mean = ii.“Spread”, i.e. Stand. Deviation = Terminology: & are called “parameters” Greek “mu” ~ m Greek “sigma” ~ s

Family of Normal Curves Think about: “Shifts” (pans) indexed by “Scales” (zooms) indexed by Nice interactive graphical example: (note area under curve is always 1)

Normal Curve Mathematics The “normal density curve” is: usual “function” of circle constant = 3.14… natural number = 2.7…

Normal Curve Mathematics Main Ideas: Basic shape is: “Shifted to mu”: “Scaled by sigma”: Make Total Area = 1: divide by as, but never

Normal Model Fitting Idea: Choose to give: “good” fit to data. Approach: IF the distribution is “mound shaped” & outliers are negligible THEN a “good” choice of normal model is:

Normal Fitting Example Revisit Melbourne Daily Max Temps Fit curve, using “Visually good” approximation

Normal Fitting Example A look under the hood Use chosen (not default) histogram bins for nice comparison bins Use longer range to avoid the “More” bin Can compute with density formula (Two steps, in cols F and G) Or use NORMDIST function (col J, check same as col G)

Normal Curve HW C5: A study of distance runners found a mean weight of 63.1 kg, with a standard deviation of 4.8 kg. Assuming that the distribution of weights is normal, use EXCEL to draw the density curve of the weight distribution.