June 2, 2008Stat 111 - Lecture 18 - Review1 Final review Statistics 111 - Lecture 18.

Slides:



Advertisements
Similar presentations
Data: Quantitative (Histogram, Stem & Leaf, Boxplots) versus Categorical (Bar or Pie Chart) Boxplots: 5 Number Summary, IQR, Outliers???, Comparisons.
Advertisements

AP Statistics Course Review.
June 9, 2008Stat Lecture 8 - Sampling Distributions 1 Introduction to Inference Sampling Distributions Statistics Lecture 8.
Inference for Regression
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
July 1, 2008Lecture 17 - Regression Testing1 Testing Relationships between Variables Statistics Lecture 17.
PSY 307 – Statistics for the Behavioral Sciences
Chapter 10 Simple Regression.
Stat 512 – Lecture 18 Multiple Regression (Ch. 11)
Chapter 7 Sampling and Sampling Distributions
Lecture 5 Outline – Tues., Jan. 27 Miscellanea from Lecture 4 Case Study Chapter 2.2 –Probability model for random sampling (see also chapter 1.4.1)
Monday, 4/29/02, Slide #1 MA 102 Statistical Controversies Monday, 4/29/02 Today: CLOSING CEREMONIES!  Discuss HW #3  Review for final exam  Evaluations.
The Simple Regression Model
Pengujian Parameter Koefisien Korelasi Pertemuan 04 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
Lecture 16 – Thurs, Oct. 30 Inference for Regression (Sections ): –Hypothesis Tests and Confidence Intervals for Intercept and Slope –Confidence.
Chapter 11: Inference for Distributions
Inferences About Process Quality
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Regression Chapter 14.
Chapter 12 Section 1 Inference for Linear Regression.
June 23, 2008Stat Lecture 13 - One Mean1 Inference for a Population Mean Confidence Intervals and Tests with unknown variance and Two- sample Tests.
AP Stats Test Review  What are the four parts of the course? Inference, Experimental Design, Probability, and Data Analysis  How many multiple choice.
Inference for regression - Simple linear regression
Chapter 13: Inference in Regression
Things that I think are important Chapter 1 Bar graphs, histograms Outliers Mean, median, mode, quartiles of data Variance and standard deviation of.
● Midterm exam next Monday in class ● Bring your own blue books ● Closed book. One page cheat sheet and calculators allowed. ● Exam emphasizes understanding.
June 18, 2008Stat Lecture 11 - Confidence Intervals 1 Introduction to Inference Sampling Distributions, Confidence Intervals and Hypothesis Testing.
BPS - 3rd Ed. Chapter 211 Inference for Regression.
Review of Chapters 1- 5 We review some important themes from the first 5 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.
Mid-Term Review Final Review Statistical for Business (1)(2)
June 25, 2008Stat Lecture 14 - Two Means1 Comparing Means from Two Samples Statistics 111 – Lecture 14 One-Sample Inference for Proportions and.
© 2003 Prentice-Hall, Inc.Chap 13-1 Basic Business Statistics (9 th Edition) Chapter 13 Simple Linear Regression.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
● Final exam Wednesday, 6/10, 11:30-2:30. ● Bring your own blue books ● Closed book. Calculators and 2-page cheat sheet allowed. No cell phone/computer.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 10 Comparing Two Populations or Groups 10.2.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Inference for Regression Chapter 14. Linear Regression We can use least squares regression to estimate the linear relationship between two quantitative.
+ Chapter 12: More About Regression Section 12.1 Inference for Linear Regression.
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
Copyright ©2011 Brooks/Cole, Cengage Learning Inference about Simple Regression Chapter 14 1.
June 11, 2008Stat Lecture 10 - Review1 Midterm review Chapters 1-5 Statistics Lecture 10.
Statistics: Unlocking the Power of Data Lock 5 Exam 2 Review STAT 101 Dr. Kari Lock Morgan 11/13/12 Review of Chapters 5-9.
Review Lecture 51 Tue, Dec 13, Chapter 1 Sections 1.1 – 1.4. Sections 1.1 – 1.4. Be familiar with the language and principles of hypothesis testing.
Agresti/Franklin Statistics, 1 of 88 Chapter 11 Analyzing Association Between Quantitative Variables: Regression Analysis Learn…. To use regression analysis.
Applied Quantitative Analysis and Practices LECTURE#25 By Dr. Osman Sadiq Paracha.
June 30, 2008Stat Lecture 16 - Regression1 Inference for relationships between variables Statistics Lecture 16.
Review ~ Data & Displays Chapters 1-6 Key Terms: –Data : Categorical versus Quantitative –Frequency Table/Contingency Table/Marginal Distribution/Conditional.
© Copyright McGraw-Hill 2004
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
+ Unit 6: Comparing Two Populations or Groups Section 10.2 Comparing Two Means.
Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.
Objectives (BPS chapter 12) General rules of probability 1. Independence : Two events A and B are independent if the probability that one event occurs.
BPS - 5th Ed. Chapter 231 Inference for Regression.
Marginal Distribution Conditional Distribution. Side by Side Bar Graph Segmented Bar Graph Dotplot Stemplot Histogram.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
Statistics and probability Dr. Khaled Ismael Almghari Phone No:
CHAPTER 12 More About Regression
AP Stat Review Know Your Book!
Inference for Regression
Comparing Means from Two Samples
CHAPTER 12 More About Regression
CHAPTER 29: Multiple Regression*
CHAPTER 26: Inference for Regression
Elementary Statistics: Looking at the Big Picture
When You See (This), You Think (That)
Basic Practice of Statistics - 3rd Edition Inference for Regression
CHAPTER 12 More About Regression
CHAPTER 12 More About Regression
Introductory Statistics
Presentation transcript:

June 2, 2008Stat Lecture 18 - Review1 Final review Statistics Lecture 18

June 2, 2008Stat Lecture 18 - Review2 Administrative Notes Pass in homework Final exam on Thursday

June 2, 2008Stat Lecture 18 - Review3 Final Exam Tomorrow from 10:40-12:10 Show up a bit early, if you arrive after 10:40 you will be wasting time for your exam Calculators are definitely needed! Single 8.5 x 11 cheat sheet (two-sided) allowed Sample final is posted on the website Solutions for the final are posted on the website

June 2, 2008Stat Lecture 18 - Review4 Final Exam The test is going to be long – 6 or 7 questions Pace yourself Do NOT leave anything blank! I want to give you credit for what you know give me an excuse to give you points! The questions are NOT in order of difficulty If you get stuck on a question put it on hold and try a different question

June 2, 2008Stat Lecture 18 - Review5 Outline Collecting Data (Chapter 3) Exploring Data - One variable (Chapter 1) Exploring Data - Two variables (Chapter 2) Probability (Chapter 4) Sampling Distributions (Chapter 5) Introduction to Inference (Chapter 6) Inference for Population Means (Chapter 7) Inference for Population Proportions (Chapter 8) Inference for Regression (Chapter 10)

June 2, 2008Stat Lecture 18 - Review6 Experiments Try to establish the causal effect of a treatment Key is reducing presence of confounding variables Matching: ensure treatment/control groups are very similar on observed variables eg. race, gender, age Randomization: randomly dividing into treatment or control leads to groups that are similar on observed and unobserved confounding variables Double-Blinding: both subjects and evaluators don’t know who is in treatment group vs. control group Population Experimental Units Treatment Group Control Group Treatment No Treatment

June 2, 2008Stat Lecture 18 - Review7 Just like in experiments, we must be cautious of potential sources of bias in our sampling results Voluntary response samples, undercoverage, non- response, untrue-response, wording of questions Simple Random Sampling: less biased since each individual in the population has an equal chance of being included in the sample Population Sample Parameter Statistic Sampling Inference Estimation ? Sampling and Surveys

June 2, 2008Stat Lecture 18 - Review8 Different Types of Graphs A distribution describes what values a variable takes and how frequently these values occur Boxplots are good for center,spread, and outliers but don’t indicate shape of a distribution Histograms much more effective at displaying the shape of a distribution

June 2, 2008Stat Lecture 18 - Review9 Measures of Center and Spread Center: Mean Spread: Standard Deviation For outliers or asymmetry, median/IQR are better Center: Median - “middle number in distribution” Spread: Inter-Quartile Range IQR = Q3 - Q1 We use mean and SD more since most distributions we use are symmetric with no outliers(eg. Normal)

June 2, 2008Stat Lecture 18 - Review10 Relationships between continuous var. Scatterplot examines relationship between response variable (Y) and a explanatory variable (X): Positive vs. negativeassociations Correlation is a measure of the strength of linear relationship between variables X and Y r near 1 or -1 means strong linear relationship r near 0 means weak linear relationship Linear Regression: come back to later… Education and Mortality: r = -0.51

June 2, 2008Stat Lecture 18 - Review11 Probability Random process: outcome not known exactly, but have probability distribution of possible outcomes Event: outcome of random process with prob. P(A) Probability calculations: combinations of rules Equally likely outcomes rule Complement rule Additive rule for disjoint events Multiplication rule for independent events Random variable: a numerical outcome or summary of a random process Discrete r.v. has a finite number of distinct values Continuous r.v. has a non-countable number of values Linear transformations of variables

June 2, 2008Stat Lecture 18 - Review12 The Normal Distribution The Normal distribution has center  and spread  2 Have tables for any probability from the standard normal distribution (  = 0 and  2 = 1) Standardization: converting X which has a N( ,  2 ) distribution to Z which has a N(0,1) distribution: Z = (x-μ)/σ Reverse standardization: converting a standard normal Z into a non-standard normal X X = μ+zσ N(0,1) N(2,1) N(0,2)N(-1,2)

June 2, 2008Stat Lecture 18 - Review13 Inference using Samples Continuous: pop. mean estimated by sample mean Discrete: pop. proportion estimated by sample proportion Key for inference: Sampling Distributions Distribution of values taken by statistic in all possible samples from the same population Population Sample Parameters:  or p Statistics: or Sampling Inference Estimation ?

June 2, 2008Stat Lecture 18 - Review14 Sampling Distribution of Sample Mean The center of the sampling distribution of the sample mean is the population mean: mean( ) = μ Over all samples, the sample mean will, on average, be equal to the population mean (no guarantees for 1 sample!) The standard deviation of the sampling distribution of the sample mean is sd( )=σ/√n As sample size increases, standard deviation of the sample mean decreases! Central Limit Theorem: if the sample size is large enough, then the sample mean has an approximately Normal distribution

June 2, 2008Stat Lecture 18 - Review15 Binomial/Normal Dist. For Proportions Sample count Y follows Binomial distribution which we can calculate from Binomial tables in small samples If the sample size is large (n·p and n(1-p)≥ 10), sample count Y follows a Normal distribution: mean(Y) = np var(Y) = np(1-p) If the sample size is large, the sample proportion also approximately follows a Normal distribution: Mean( ) = p Var( ) = p(1-p)/n

June 2, 2008Stat Lecture 18 - Review16 Type of Data Unknown Parameter Statistic Variability of Statistic Distribution of Statistic Continuous  σ/√n Normal (if n large) Count X i = 0 or 1 p √(p(1-p)/n) Binomal (if n small) Normal (if n large) Summary of Sampling Distribution

June 2, 2008Stat Lecture 18 - Review17 Introduction to Inference Use sample estimate as center of a confidenceinterval of likely values for population parameter All confidence intervals have the same form: Estimate ± Margin of Error The margin of error is always some multiple of the standard deviation (or standard error) of statistic Hypothesis test: data supports specific hypothesis? 1.Formulate your Null and Alternative Hypotheses 2.Calculate the test statistic: difference between data and your null hypothesis 3.Find the p-value for the test statistic: how probable is your data if the null hypothesis is true?

June 2, 2008Stat Lecture 18 - Review18 Inference: Single Population Mean  Known SD  : confidence intervals and test statistics involve standard deviation and normal critical values Unknown SD  : confidence intervals and test statistics involve standard error and critical values from a t distribution with n-1 degrees of freedom t distribution has wider tails (more conservative)

June 2, 2008Stat Lecture 18 - Review19 Inference: Comparing Means  1 and  2 Known  1 and  2 : two-sample Z statistic uses normal distribution Unknown  1 and  2 : two-sample T statistic uses t distribution with degrees of freedom = min(n 1 -1,n 2 -1) Matched pairs: instead of difference of two samples X 1 and X 2, do a one-sample test on the difference d

June 2, 2008Stat Lecture 18 - Review20 Inference: Population Proportion p Confidence interval for p uses the Normal distribution and the sample proportion: Hypothesis test for p = p 0 also uses the Normal distribution and the sample proportion:

June 2, 2008Stat Lecture 18 - Review21 Inference: Comparing Proportions p 1 and p 2 Hypothesis test for p 1 - p 2 = 0 uses Normal distribution and complicated test statistic with pooled standard error: Confidence interval for p 1 = p 2 also uses Normal distribution and sample proportions

June 2, 2008Stat Lecture 18 - Review22 Linear Regression Use best fit line to summarize linear relationship between two continuous variables X and Y: The slope ( ): average change you get in the Y variable if you increased the X variable by one The intercept( ):average value of the Y variable when the X variable is equal to zero Linear equation can be used to predict response variable Y for a value of our explanatory variable X

June 2, 2008Stat Lecture 18 - Review23 Significance in Linear Regression Does the regression line show a significant linear relationship between the two variables? H 0 :  = 0 versus H a :  0 Uses the t distribution with n-2 degrees of freedom and a test statistic calculated from JMP output Can also calculate confidence intervals using JMP output and t distribution with n-2 degrees of freedom b +/- t n-2 * SE(b) a +/- t n-2 * SE(a)

June 2, 2008Stat Lecture 18 - Review24 Good Luck on Final!