Stat 31, Section 1, Last Time Time series plots Numerical Summaries of Data: –Center: Mean, Medial –Spread: Range, Variance, S.D., IQR 5 Number Summary.

Slides:



Advertisements
Similar presentations
Stor 155, Section 2, Last Time Distributions (how are data “spread out”?) Visual Display: Histograms –Binwidth is critical Time Plots = Time Series Course.
Advertisements

Class Session #2 Numerically Summarizing Data
Normal and Standard Normal Distributions June 29, 2004.
The Standard Deviation as a Ruler and the Normal Model.
Chapter 6: The Standard Deviation as a Ruler and the Normal Model
HS 67 - Intro Health Stat The Normal Distributions
DENSITY CURVES and NORMAL DISTRIBUTIONS. The histogram displays the Grade equivalent vocabulary scores for 7 th graders on the Iowa Test of Basic Skills.
Jan Shapes of distributions… “Statistics” for one quantitative variable… Mean and median Percentiles Standard deviations Transforming data… Rescale:
1.2: Describing Distributions
Examples of continuous probability distributions: The normal and standard normal.
STAT 13 -Lecture 2 Lecture 2 Standardization, Normal distribution, Stem-leaf, histogram Standardization is a re-scaling technique, useful for conveying.
Basic Statistics Standard Scores and the Normal Distribution.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 4.1 Chapter Four Numerical Descriptive Techniques.
LECTURE 12 Tuesday, 6 October STA291 Fall Five-Number Summary (Review) 2 Maximum, Upper Quartile, Median, Lower Quartile, Minimum Statistical Software.
3.3 Density Curves and Normal Distributions
Warm Up Solve for x 2) 2x + 80 The product of a number
LECTURE 8 Thursday, 19 February STA291 Fall 2008.
Describing Location in a Distribution. Measuring Position: Percentiles Here are the scores of 25 students in Mr. Pryor’s statistics class on their first.
Copyright © 2010 Pearson Education, Inc. Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Random Variables Numerical Quantities whose values are determine by the outcome of a random experiment.
Transformations, Z-scores, and Sampling September 21, 2011.
Stat 155, Section 2, Last Time Numerical Summaries of Data: –Center: Mean, Medial –Spread: Range, Variance, S.D., IQR 5 Number Summary & Outlier Rule Transformation.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Chapter 6 The Normal Curve. A Density Curve is a curve that: *is always on or above the horizontal axis *has an area of exactly 1 underneath it *describes.
Essential Statistics Chapter 31 The Normal Distributions.
Slide 6-1 Copyright © 2004 Pearson Education, Inc.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Copyright © 2009 Pearson Education, Inc. Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
The Standard Deviation as a Ruler and the Normal Model
Summary Five numbers summary, percentiles, mean Box plot, modified box plot Robust statistic – mean, median, trimmed mean outlier Measures of variability.
Section 1.3 Density Curves and Normal Distributions.
1 Review Sections Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Find out where you can find rand and randInt in your calculator. Write down the keystrokes.
2.1 Density Curves & the Normal Distribution. REVIEW: To describe distributions we have both graphical and numerical tools.  Graphically: histograms,
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 5, Slide 1 Chapter 5 The Standard Deviation as a Ruler and the Normal Model.
Last Time Normal Distribution –Density Curve (Mound Shaped) –Family Indexed by mean and s. d. –Fit to data, using sample mean and s.d. Computation of Normal.
Slide Chapter 2d Describing Quantitative Data – The Normal Distribution Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley.
Stat 31, Section 1, Last Time Distributions (how are data “spread out”?) Visual Display: Histograms Binwidth is critical Bivariate display: scatterplot.
Stat 31, Section 1, Last Time Course Organization & Website What is Statistics? Data types.
Stat 31, Section 1, Last Time Big Rules of Probability –The not rule –The or rule –The and rule P{A & B} = P{A|B}P{B} = P{B|A}P{A} Bayes Rule (turn around.
Answering Descriptive Questions in Multivariate Research When we are studying more than one variable, we are typically asking one (or more) of the following.
Chapter 5 The Standard Deviation as a Ruler and the Normal Model.
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
© 2012 W.H. Freeman and Company Lecture 2 – Aug 29.
The Normal Distributions.  1. Always plot your data ◦ Usually a histogram or stemplot  2. Look for the overall pattern ◦ Shape, center, spread, deviations.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 6- 1.
Stat 31, Section 1, Last Time Linear transformations
Last Time Proportions Continuous Random Variables Probabilities
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
Chapter 2: Describing Location in a Distribution
The Normal Distribution
Chapter 6 The Normal Curve.
Good Afternoon! Agenda: Knight’s Charge-please wait for direction
Chapter 2: Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
2.1 Density Curve and the Normal Distributions
The normal distribution
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Chapter 2 Data Analysis Section 2.2
CHAPTER 2 Modeling Distributions of Data
Summary (Week 1) Categorical vs. Quantitative Variables
Summary (Week 1) Categorical vs. Quantitative Variables
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
Presentation transcript:

Stat 31, Section 1, Last Time Time series plots Numerical Summaries of Data: –Center: Mean, Medial –Spread: Range, Variance, S.D., IQR 5 Number Summary & Outlier Rule Course Organization & Website

Comments From Grader I encountered some problems in the grading. These problems are: 1. the homework pages are not stapled together. 2. the answers are not the same order as the questions. 3. the results, especially in excel tables, are not highlighted. Could you please emphasize the above problems in your class? If the students follow the rules, the grading will be much easier. In the grading of homework #2, I also hope that you can allow me to enforce the rules by giving zero points.

Linear Transformations Idea: What happens to data & summaries, when data are: “shifted and scaled” i.e. “panned and zoomed” Math: Shifted by a Scaled by b

Linear Transformations Effect on linear summaries: Centerpoints, and “follow data”:. Spreads, and “feel scale, not shift”:.

Most Useful Linear Transfo. “Standardization” Goal: put data sets on “common scale” Approach: 1.Subtract Mean, to “center at 0” 2.Divide by S.D., to “give common SD = 1”

Standardization Result is called “z-score”: Note that Thus is interpreted as: “number of SDs from the mean”

Standardization Example Buffalo Snowfall Data: Standardized data have same (EXCEL default) histogram shape as raw data. (Since axes and bin edges just follow the transformation) i.e. “shape” doesn’t depend on “scaling”

Standardization Example A look under the hood: Compute AVERAGE and SD 1.Standardize by: a.Create Formula in cell B2 b.Drag downwards c.Keep Mean and SD cells fixed using $s 3.Check stand’d data have mean 0 & SD 1 note that “8.247E-16 = 0”

Standardization HW C6:For the 18 female scores in 1.49, use EXCEL to: a.Give the list of standardized scores b.Give the Z-score for: (i)the mean (0) (ii)the median ( ) (iii)the smallest (-1.52) (iv)the largest (2.23) 1.79

Modelling Distributions Text: Section 1.3 Idea: Approximate histograms by: an “idealized curve” i.e. a “density curve” that represents the population

Idealized Curve Example Recall Hidalgo Stamps Data, Shifting Bin Movie (made # modes change): Add idealized curve: Note: “population curve” shows why histogram modes appear and disappear

Interpretation of Density Areas under density curve, give “relative frequency” Proportion of data between = = Area under =

Interpretation of Density Note: Total Area under density = 1 (since relative freq. of everything is 1) HW: 1.78 (b: 0.8), 1.79 Work with pencil and paper, not EXCEL

Most Useful Density “Normal Curve” = “Gaussian Density” Shape: “like a mound” E.g. of “sand dumped from a truck” Older, worse, description: “bell shaped”

Normal Density Example Winter Daily Maximum Temperatures in Melbourne, Australia Notes: Top Histogram is “mound shaped” Plus “small scale random variation” So model with “Normal Density”?

Normal Density Curves Note: there is a family of normal curves, indexed by: i.“Center”, i.e. Mean = ii.“Spread”, i.e. Stand. Deviation = Terminology: & are called “parameters” Greek “mu” Greek “sigma” ~ s

Family of Normal Curves Think about: “Shifts” (pans) indexed by “Scales” (zooms) indexed by Nice interactive graphical example: (note area under curve is always 1)

Normal Curve Mathematics The “normal density curve” is: usual “function” of circle constant = 3.14… natural number = 2.7…

Normal Curve Mathematics Main Ideas: Basic shape is: “Shifted to mu”: “Scaled by sigma”: Make Total Area = 1: divide by as, but never

Normal Model Fitting Idea: Choose to give: “good” fit to data. Approach: IF the distribution is “mound shaped” & outliers are negligible THEN a “good” choice of normal model is:

Normal Fitting Example Revisit Melbourne Daily Max Temps Fit curve, using “Visually good” approximation

Normal Fitting Example A look under the hood Use chosen (not default) histogram bins for nice comparison bins Use longer range to avoid the “More” bin Can compute with density formula (Two steps, in cols F and G) Or use NORMDIST function (col J, check same as col G)

Normal Curve HW C7: A study of distance runners found a mean weight of 63.1 kg, with a standard deviation of 4.8 kg. Assuming that the distribution of weights is normal, use EXCEL to draw the density curve of the weight distribution.

2 Views of Normal Fitting 1.“Fit Model to Data” Choose &. 2.“Fit Data to Model” First Standardize Data Then use Normal. Note: same thing, just different rescalings (choose scale depending on need)

Normal Distribution Notation The “normal distribution, with mean & standard deviation s ” is abbreviated as:

Interpretation of Z-scores Idea: Z-scores are on scale, so use areas to interpret Important Areas: Within 1 sd of mean “the majority”

Interpretation of Z-scores 2.Within 2 sd of mean “really most” 3.Within 3 sd of mean “almost all”

Interpretation of Z-scores Interactive Version (used for above pics) From Webster West’s Website:

Interpretation of Z-scores Summary: These relations are called the “ % Rule” HW: 1.82 (a: , b: 234, 298), 1.83