June 11, 2008Stat 111 - Lecture 10 - Review1 Midterm review Chapters 1-5 Statistics 111 - Lecture 10.

Slides:



Advertisements
Similar presentations
Data: Quantitative (Histogram, Stem & Leaf, Boxplots) versus Categorical (Bar or Pie Chart) Boxplots: 5 Number Summary, IQR, Outliers???, Comparisons.
Advertisements

Chapter 3 Properties of Random Variables
AP Statistics Course Review.
June 9, 2008Stat Lecture 8 - Sampling Distributions 1 Introduction to Inference Sampling Distributions Statistics Lecture 8.
Jan Shapes of distributions… “Statistics” for one quantitative variable… Mean and median Percentiles Standard deviations Transforming data… Rescale:
Basic Statistical Concepts Psych 231: Research Methods in Psychology.
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
Statistics Psych 231: Research Methods in Psychology.
Ch. 6 The Normal Distribution
1.2: Describing Distributions
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology.
Chap 3-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 3 Describing Data: Numerical Statistics for Business and Economics.
Normal and Sampling Distributions A normal distribution is uniquely determined by its mean, , and variance,  2 The random variable Z = (X-  /  is.
June 2, 2008Stat Lecture 18 - Review1 Final review Statistics Lecture 18.
AP Stats Test Review  What are the four parts of the course? Inference, Experimental Design, Probability, and Data Analysis  How many multiple choice.
STAT 211 – 019 Dan Piett West Virginia University Lecture 2.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 12 Describing Data.
Describing distributions with numbers
© Copyright McGraw-Hill CHAPTER 6 The Normal Distribution.
STATISTICS: BASICS Aswath Damodaran 1. 2 The role of statistics Aswath Damodaran 2  When you are given lots of data, and especially when that data is.
B AD 6243: Applied Univariate Statistics Understanding Data and Data Distributions Professor Laku Chidambaram Price College of Business University of Oklahoma.
Numerical Descriptive Techniques
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
STA Lecture 161 STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately)
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
PROBABILITY & STATISTICAL INFERENCE LECTURE 3 MSc in Computing (Data Analytics)
June 10, 2008Stat Lecture 9 - Proportions1 Introduction to Inference Sampling Distributions for Counts and Proportions Statistics Lecture 9.
Review of Chapters 1- 5 We review some important themes from the first 5 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.
Chapter 3 concepts/objectives Define and describe density curves Measure position using percentiles Measure position using z-scores Describe Normal distributions.
Chapter 3 Descriptive Statistics: Numerical Methods Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
Transformations, Z-scores, and Sampling September 21, 2011.
Biostat. 200 Review slides Week 1-3. Recap: Probability.
Wednesday, May 13, 2015 Report at 11:30 to Prairieview.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Fall Final Topics by “Notecard”.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 7 Sampling Distributions.
Numerical Measures of Variability
Review Lecture 51 Tue, Dec 13, Chapter 1 Sections 1.1 – 1.4. Sections 1.1 – 1.4. Be familiar with the language and principles of hypothesis testing.
12/7/20151 Probability Introduction to Probability, Conditional Probability and Random Variables.
AP Statistics Semester One Review Part 2 Chapters 4-6 Semester One Review Part 2 Chapters 4-6.
AP Statistics Semester One Review Part 1 Chapters 1-3 Semester One Review Part 1 Chapters 1-3.
Review of Probability. Important Topics 1 Random Variables and Probability Distributions 2 Expected Values, Mean, and Variance 3 Two Random Variables.
AP Stats Review day 1 April 2, Basics Two Parts (90 Minutes each part) – 40 Multiple Choice Content Questions (10-15) Calculation Questions(25-30)
AP Statistics Review Day 1 Chapters 1-4. AP Exam Exploring Data accounts for 20%-30% of the material covered on the AP Exam. “Exploratory analysis of.
Marginal Distribution Conditional Distribution. Side by Side Bar Graph Segmented Bar Graph Dotplot Stemplot Histogram.
WARM UP: Penny Sampling 1.) Take a look at the graphs that you made yesterday. What are some intuitive takeaways just from looking at the graphs?
Chapter 9 Sampling Distributions 9.1 Sampling Distributions.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Week 2 Normal Distributions, Scatter Plots, Regression and Random.
Statistics and probability Dr. Khaled Ismael Almghari Phone No:
Midterm Review IN CLASS. Chapter 1: The Art and Science of Data 1.Recognize individuals and variables in a statistical study. 2.Distinguish between categorical.
Descriptive Statistics ( )
Thursday, May 12, 2016 Report at 11:30 to Prairieview
Stat Lecture 7 - Normal Distribution
MECH 373 Instrumentation and Measurements
MATH-138 Elementary Statistics
Review 1. Describing variables.
AP Stat Review Know Your Book!
Statistics.
CHAPTER 29: Multiple Regression*
Review of Hypothesis Testing
Descriptive and inferential statistics. Confidence interval
BUS173: Applied Statistics
When You See (This), You Think (That)
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
Fall Final Topics by “Notecard”.
Descriptive Statistics
Introductory Statistics
Presentation transcript:

June 11, 2008Stat Lecture 10 - Review1 Midterm review Chapters 1-5 Statistics Lecture 10

June 11, 2008Stat Lecture 10 - Review2 Administrative Notes Homework 3 is due Monday –Covers material from Chapter 5, so worth doing as practice for the midterm! Exam on Monday –Starts exactly at 10:40 – get here early

June 11, 2008Stat Lecture 10 - Review3 Some Topics Not Covered on Midterm Continuity correction for binomial calculations (chapter 5) Normal quintile plots(chapter 1)

June 11, 2008Stat Lecture 10 - Review4 Experiments Used to examine effect of a treatment eg. medical trials, education interventions Different from an observational study, where no treatment is imposed Observational studies can only examine associations between variables, whereas experiments try to establish causal effects Experiments can still be biased though! Population Experimental Units Treatment Group Control Group Treatment No Treatment Result 1 234

June 11, 2008Stat Lecture 10 - Review5 Just like in experiments, we must be cautious of potential sources of bias in our sampling results Voluntary response samples, undercoverage, non- response, untrue-response, wording of questions Simple Random Sampling: less biased since each individual in the population has an equal chance of being included in the sample Population Sample Parameter Statistic Sampling Inference Estimation ? Sampling and Surveys

June 11, 2008Stat Lecture 10 - Review6 Distributions A distribution describes what values a variable takes and how frequently these values occur Boxplots are good for center and spread, but don’t indicate shape of a distribution Histograms much more effective at displaying the shape of a distribution

June 11, 2008Stat Lecture 10 - Review7 Numerical Measures of Center Mean: Median: “middle number in distribution” Mean is more affected by large outliers and asymmetry than the median Symmetric: Mean ≈ Median Skewed Left: Mean<Median Skewed Right: Mean>Median

June 11, 2008Stat Lecture 10 - Review8 Variance: average of the squared deviations of each observation Standard Deviation = Inter-Quartile Range: IQR = Q3 - Q1 First Quartile (Q1) is the median of the smaller half of data Third Quartile (Q3) is the median of the larger half of data With outliers or asymmetry, median and IQR are better but we will use mean and SD more since most distributions we use (eg. normal distribution) are symmetric with no outliers Numerical Measures of Spread

June 11, 2008Stat Lecture 10 - Review9 Scatterplots of two variables Positiveassociation vs Negative association Some associations are not just positive or negative, but also appear to be linear Correlation is a measure of the strength of linear relationship between variables X and Y r near 1 or -1 means strong linear relationship r near 0 means weak linear relationship Negative r means negative association

June 11, 2008Stat Lecture 10 - Review10 Linear Regression Best fit line between X and Y: Y = a + b·X The slope b( ): average change you get in the Y variable if you increased the X variable by one The intercept a ( ):average value of the Y variable when the X variable is equal to zero Regression equation used to predict response variable Y for a value of our explanatory variable X

June 11, 2008Stat Lecture 10 - Review11 Probability Random process: outcome not known exactly, but have probability distribution of possible outcomes Event: outcome of random process with prob. P(A) Additive Rule for Disjoint Events: P(A or B) = P(A) + P(B) if A and B are disjoint Multiplication Rule for Independent Events: P(A and B) = P(A) x P(B) if A and B are independent Need to combine different rules (Eg. Lecture 8)

June 11, 2008Stat Lecture 10 - Review12 Probability and Random Variables Conditional Probability: Random variable: numerical outcome or summary of a random process A discrete random variable has a finite number of distinct values Continuous random variables can have a non- countable number of values

June 11, 2008Stat Lecture 10 - Review13 Discrete vs. Continuous RV’s Probability histogram for distribution of discrete r.v. Calculate probabilities by adding up bars of histogram Density curve used for distribution of continuous r.v. Calculate probabilities by integrating area under curve

June 11, 2008Stat Lecture 10 - Review14 Linear Transformations of Variables Same rules for both data and random variables: mean(a·X + c) = a·mean(X) + c variance(a·X + c) = a 2 ·variance(X) SD(a·X + c) = |a|· SD(X) Adding constants does not change spread measures Can also do combinations of more than one variable: If X and Y are variables and Z = a·X + b·Y + c mean(Z) = a·mean(X) + b·mean(Y) + c If X and Y are also independent then Variance(Z) = a 2 ·Variance(X) + b 2 ·Variance(Y)

June 11, 2008Stat Lecture 10 - Review15 The Normal Distribution The Normal distribution has the shape of a “bell curve” with parameters  and  2,denoted N( ,  2 ) StandardNormal:  = 0 and  2 = 1 Normal distribution follows the rule: 68% of observations are between  -  and  +  95% of observations are between  - 2  and  + 2  99.7% of observations are between  - 3  and  + 3  Have tables for any probability from the standard normal distribution N(0,1) N(2,1) N(0,2) N(-1,2)

June 11, 2008Stat Lecture 10 - Review16 Standardization For non-standard normal probabilities, need to transform to a standard normal distribution If X has a N( ,  2 ) distribution, then we can convert to Z which follows a N(0,1) distribution: Can then calculate P(Z < k) using table Reverse standardization: converting a standard normal Z into a non-standard normal X X = σZ + μ Practice makes perfect!

June 11, 2008Stat Lecture 10 - Review17 Inference for Continuous Data Continuous data is summarized by sample mean Sample mean is used as our estimate of the population mean, but how does sample mean vary between samples? Population Parameters:  and  2 Distribution of these values? Sample 1 of size n x Sample 2 of size n x Sample 3 of size n x Sample 4 of size n x Sample 5 of size n x Sample 6 of size n x.

June 11, 2008Stat Lecture 10 - Review18 Sampling Distribution of Sample Mean The center of the sampling distribution of the sample mean is the population mean: Over all samples, the sample mean will, on average, be equal to the population mean (no guarantees for 1 sample!) The spread of the sampling distribution of the sample mean is As sample size increases, variance of the sample mean decreases! Central Limit Theorem: if the sample size is large enough, then the sample mean X has an approximately Normal distribution

June 11, 2008Stat Lecture 10 - Review19 Inference for Count Data Goal for count data is to estimate the population proportion p From a sample of size n, we can calculate two statistics: 1. sample count Y 2. sample proportion Use sample proportion as our estimate of population proportion p Sampling Distribution of the Sample Proportion how does sample proportion change over different samples? Population Parameter: p Distribution of these values? Sample 1 of size n Sample 2 of size n Sample 3 of size n Sample 4 of size n Sample 5 of size n Sample 6 of size n.

June 11, 2008Stat Lecture 10 - Review20 Sampling Distribution for Proportion For small samples, use the Binomial distribution to calculate probabilities for the sample count or sample proportion Definition of “small”: n·p < 10 or n·(1-p) < 10 For large samples, we use the Normal approximation to the Binomial distribution for the sample count or sample proportion

June 11, 2008Stat Lecture 10 - Review21 Next Week - Lecture 11 Chapter 6 Good luck on midterm next Monday!