Stat 31, Section 1, Last Time Linear transformations

Slides:



Advertisements
Similar presentations
1 Objective Investigate how two variables (x and y) are related (i.e. correlated). That is, how much they depend on each other. Section 10.2 Correlation.
Advertisements

Chapter 8 Linear Regression © 2010 Pearson Education 1.
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
Stat 217 – Day 25 Regression. Last Time - ANOVA When?  Comparing 2 or means (one categorical and one quantitative variable) Research question  Null.
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Chap 6-1 Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall Chapter 6 The Normal Distribution Business Statistics: A First Course 6 th.
Covariance and correlation
Chapter 2.2 STANDARD NORMAL DISTRIBUTIONS. Normal Distributions Last class we looked at a particular type of density curve called a Normal distribution.
● Final exam Wednesday, 6/10, 11:30-2:30. ● Bring your own blue books ● Closed book. Calculators and 2-page cheat sheet allowed. No cell phone/computer.
1 Chapter 10 Correlation and Regression 10.2 Correlation 10.3 Regression.
Stat 155, Section 2, Last Time Numerical Summaries of Data: –Center: Mean, Medial –Spread: Range, Variance, S.D., IQR 5 Number Summary & Outlier Rule Transformation.
Stat 155, Section 2, Last Time Reviewed Excel Computation of: –Time Plots (i.e. Time Series) –Histograms Modelling Distributions: Densities (Areas) Normal.
Chapter 10 Correlation and Regression
Stat 155, Section 2, Last Time Binomial Distribution –Normal Approximation –Continuity Correction –Proportions (different scale from “counts”) Distribution.
June 11, 2008Stat Lecture 10 - Review1 Midterm review Chapters 1-5 Statistics Lecture 10.
Stat 31, Section 1, Last Time Time series plots Numerical Summaries of Data: –Center: Mean, Medial –Spread: Range, Variance, S.D., IQR 5 Number Summary.
Chapter 6 The Normal Distribution and Other Continuous Distributions.
Last Time Normal Distribution –Density Curve (Mound Shaped) –Family Indexed by mean and s. d. –Fit to data, using sample mean and s.d. Computation of Normal.
Last Time Interpretation of Confidence Intervals Handling unknown μ and σ T Distribution Compute with TDIST & TINV (Recall different organization) (relative.
Stat 31, Section 1, Last Time Distributions (how are data “spread out”?) Visual Display: Histograms Binwidth is critical Bivariate display: scatterplot.
Administrative Matters Midterm II Results Take max of two midterm scores:
Stat 31, Section 1, Last Time Course Organization & Website What is Statistics? Data types.
Stat 31, Section 1, Last Time Big Rules of Probability –The not rule –The or rule –The and rule P{A & B} = P{A|B}P{B} = P{B|A}P{A} Bayes Rule (turn around.
Stat 31, Section 1, Last Time Distribution of Sample Means –Expected Value  same –Variance  less, Law of Averages, I –Dist’n  Normal, Law of Averages,
Case Closed The New SAT Chapter 2 AP Stats at LSHS Mr. Molesky The New SAT Chapter 2 AP Stats at LSHS Mr. Molesky.
Correlation  We can often see the strength of the relationship between two quantitative variables in a scatterplot, but be careful. The two figures here.
AP Statistics Review Day 1 Chapters 1-4. AP Exam Exploring Data accounts for 20%-30% of the material covered on the AP Exam. “Exploratory analysis of.
The Normal Distribution Chapter 3. When Exploring Data Always start by plotting your individual variables Look for overall patterns (shape, centre, spread)
Thursday, May 12, 2016 Report at 11:30 to Prairieview
Lecture Slides Elementary Statistics Twelfth Edition
Describing Data: Two Variables
Last Time Proportions Continuous Random Variables Probabilities
Different Types of Data
Stat 350 Lab Session GSI: Yizao Wang Section 016 Mon 2pm30-4pm MH 444-D Section 043 Wed 2pm30-4pm MH 444-B.
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
Logistic Regression APKC – STATS AFAC (2016).
CHAPTER 2 Modeling Distributions of Data
Stat 31, Section 1, Last Time Sampling Distributions
Good Afternoon! Agenda: Knight’s Charge-please wait for direction
Module 11 Math 075. Module 11 Math 075 Bivariate Data Proceed similarly as univariate distributions … What is univariate data? Which graphical models.
Exploring Relationships Between Variables
9th Grade Making sense of data
Elementary Statistics
(Residuals and
CHAPTER 2 Modeling Distributions of Data
Regression and Residual Plots
Day 13 Agenda: DG minutes.
Lecture Slides Elementary Statistics Thirteenth Edition
Describing Bivariate Relationships
BUS7010 Quant Prep Statistics in Business and Economics
Review for Exam 2 Some important themes from Chapters 6-9
Unit 3 – Linear regression
Chapter 2 Data Analysis Section 2.2
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
Diagnostics and Remedial Measures
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
Lecture 8: Chapter 4, Section 4 Quantitative Variables (Normal)
Day 8 Agenda: Quiz 1.1 & minutes Begin Ch 3.1
Bivariate Data Response Variable: measures the outcome of a study (aka Dependent Variable) Explanatory Variable: helps explain or influences the change.
The Normal Distribution
Linear Regression and Correlation
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
Presentation transcript:

Stat 31, Section 1, Last Time Linear transformations Standardization (subt. mean / div. be SD) 5 Number Summary & Outlier Rule Modelling distributions - density Normal distributions Density Interpretation Computation

Computation of Normal Areas Classical Approach: Tables See inside covers of text Summarizes area computations Because can’t use calculus Constructed by “computers” (a job description in the early 1900’s!)

Computation of Normal Areas EXCEL Computation: works in terms of “lower areas” E.g. for Area < 1.3

Computation of Normal Areas Interactive Version (used for above pic) From Webster West’s Website: http://www.stat.sc.edu/~west/applets/normaldemo.html

Computation of Normal Areas EXCEL Computation: (of above e.g.) Enter parameters x is “cutoff point” Return is Area below x

Computation of Normal Areas Computation of areas over intervals: (use subtraction) = -

Computation of Normal Areas Computation of areas over intervals: (use subtraction for EXCEL too) E.g. Use Excel to check 68 - 95 - 99.7% Rule https://www.unc.edu/~marron/UNCstat31-2005/Stat31Eg10.xls

Normal Area HW HW: 1.89 1.92 (Hint: the % above 500 = 100% - % below 500) 1.97 1.104 (50%, 9.18%, 0.38%, 40.82%) Caution: Don’t just “twiddle EXCEL until answer appears”. Understand it!!!

Inverse of Area Function Inverse of Frequencies: “Quantiles” Idea: Given area, find “cutoff” x I.e. for Area = 80% This x is the “quantile”

Inverse of Area Function EXCEL Computation of Quantiles: Use NORMINV Continue Class Example: https://www.unc.edu/~marron/UNCstat31-2005/Stat31Eg10.xls “Probability” is “Area” Enter mean and SD parameters

Inverse Area Example When a machine works normally, it fills bottles with mean = 25 oz, and SD = 0.2 oz. The machine is “out of control” when it overfills. Choose an “alarm level”, which will give only 1 % false alarms. Want: cutoff, x, so that Area above = 1% Note: Area below = 100% - Area above = 99% https://www.unc.edu/~marron/UNCstat31-2005/Stat31Eg10.xls

Inverse Area HW 1.105 1.106 (-0.675, 0.675, 89.9, 110.1, 1.35, 0.7%) 1.107

Normal Diagnostic When is the Normal Model “good”? Useful Graphical Device: Q-Q plot = Normal Quantile Plot Won’t devote class time: Useful info in text about this

Variable Relationships Chapter 2 in Text Idea: Look beyond single quantities, to how quantities relate to each other. E.g. How do HW scores “relate” to Exam scores? Section 2.1: Useful graphical device: Scatterplot

Recall Scatterplot E.g. Toy Example: (1,2) (3,1) (-1,0) (2,-1)

Scatterplot E.g. Data from related Intro. Stat. Class (actual scores) How does HW score predict Final Exam? = HW, = Final Exam https://www.unc.edu/~marron/UNCstat31-2005/Stat31Eg11.xls In top half of HW scores: Better HW  Better Final For lower HW: Final is much more “random”

Scatterplots Common Terminology: When thinking about “X causes Y”, Call X the “Explanatory Var.” or “Indep. Var.” Call Y the “Response Var.” or “Dep. Var.” (think of “Y as function of X”) (although not always sensible)

Scatterplots Note: Sometimes think about causation, Other times: “Explore Relationship” HW: 2.1

Class Scores Scatterplots https://www.unc.edu/~marron/UNCstat31-2005/Stat31Eg11.xls How does HW predict Midterm 1? = HW, = MT1 Still better HW  better Exam But for each HW, wider range of MT1 scores I.e. HW doesn’t predict MT1 as well as Final “Outliers” in scatterplot may not be outliers in either individual variable e.g. HW = 72, MT1 = 94 (bad HW, but good MT1?, fluke???)

Class Scores Scatterplots https://www.unc.edu/~marron/UNCstat31-2005/Stat31Eg11.xls How does MT1 predict MT2? = MT1, = MT2 Idea: less “causation”, more “exploration Still higher MT1 associated with higher MT2 For each MT1, wider range of MT2 i.e. “not good predictor” Interesting Outliers: MT1 = 100, MT2 = 56 (oops!) MT1 = 23, MT2 = 74 (woke up!)

Important Aspects of Relations Form of Relationship Direction of Relationship Strength of Relationship

I. Form of Relationship Linear: Data approximately follow a line Previous Class Scores Example https://www.unc.edu/~marron/UNCstat31-2005/Stat31Eg11.xls Final vs. High values of HW is “best” Nonlinear: Data follows different pattern Nice Example: Bralower’s Fossil Data https://www.unc.edu/~marron/UNCstat31-2005/Stat31Eg12.xls

Bralower’s Fossil Data https://www.unc.edu/~marron/UNCstat31-2005/Stat31Eg12.xls From T. Bralower, formerly of Geological Sci. Studies Global Climate, millions of years ago: Ratios of Isotopes of Strontium Reflects Ice Ages, via Sea Level (50 meter difference!) As function of time Clearly nonlinear relationship

II. Direction of Relationship Positive Association X bigger  Y bigger Negative Association X bigger  Y smaller E.g. X = alcohol consumption, Y = Driving Ability Clear negative association

III. Strength of Relationship Idea: How close are points to lying on a line? Revisit Class Scores Example: https://www.unc.edu/~marron/UNCstat31-2005/Stat31Eg11.xls Final Exam is “closely related to HW” Midterm 1 less closely related to HW Midterm 2 even related to Midterm 1

Linear Relationship HW 2.3, 2.5, 2.7, 2.9