Lecture 12 Brownian motion, chi-square distribution, d.f. Adjusted schedule ahead Chi-square distribution (lot of supplementary material, come to class!!!)

Slides:



Advertisements
Similar presentations
Lecture 14 chi-square test, P-value Measurement error (review from lecture 13) Null hypothesis; alternative hypothesis Evidence against null hypothesis.
Advertisements

Stat 13 Final Review A. Probability tables to use. B. variance algebra, correlation, covariance, regression C. Probability and Conditional probability.
Rules for Means and Variances
Hypothesis: It is an assumption of population parameter ( mean, proportion, variance) There are two types of hypothesis : 1) Simple hypothesis :A statistical.
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Chapter 13: The Chi-Square Test
Chapter 10 Inference on Two Samples 10.4 Inference on Two Population Standard Deviations.
Econ 140 Lecture 41 More on Univariate Populations Lecture 4.
Evaluation (practice). 2 Predicting performance  Assume the estimated error rate is 25%. How close is this to the true error rate?  Depends on the amount.
Probability Densities
1 The Chi squared table provides critical value for various right hand tail probabilities ‘A’. The form of the probabilities that appear in the Chi-Squared.
Class notes for ISE 201 San Jose State University
Simple Linear Regression Analysis
Chapter 26: Comparing Counts. To analyze categorical data, we construct two-way tables and examine the counts of percents of the explanatory and response.
Inferences About Process Quality
BCOR 1020 Business Statistics
Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 8 th Edition Chapter 6 Sampling and Sampling.
Chapter 13 Chi-Square Tests. The chi-square test for Goodness of Fit allows us to determine whether a specified population distribution seems valid. The.
Cross Tabulation and Chi-Square Testing. Cross-Tabulation While a frequency distribution describes one variable at a time, a cross-tabulation describes.
Statistical Analysis I have all this data. Now what does it mean?
Testing Distributions Section Starter Elite distance runners are thinner than the rest of us. Skinfold thickness, which indirectly measures.
Stat13-lecture 25 regression (continued, SE, t and chi-square) Simple linear regression model: Y=  0 +  1 X +  Assumption :  is normal with mean 0.
What if the population standard deviation, , is unknown? We could estimate it by the sample standard deviation s which is the square root of the sample.
Confidence Intervals Chapter 6. § 6.1 Confidence Intervals for the Mean (Large Samples)
Chapter 6 Confidence Intervals.
Chapter 6: Probability Distributions
1 G Lect 6b G Lecture 6b Generalizing from tests of quantitative variables to tests of categorical variables Testing a hypothesis about a.
Chapter 10 The Normal and t Distributions. The Normal Distribution A random variable Z (-∞ ∞) is said to have a standard normal distribution if its probability.
(c) 2007 IUPUI SPEA K300 (4392) Outline Normal Probability Distribution Standard Normal Probability Distribution Standardization (Z-score) Illustrations.
Statistical Analysis I have all this data. Now what does it mean?
Chi-Square as a Statistical Test Chi-square test: an inferential statistics technique designed to test for significant relationships between two variables.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
Lecture 13 Chi-square and sample variance Finish the discussion of chi-square distribution from lecture 12 Expected value of sum of squares equals n-1.
Statistical estimation, confidence intervals
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Discrete distribution word problems –Probabilities: specific values, >, =, … –Means, variances Computing normal probabilities and “inverse” values: –Pr(X
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
Chi- square test x 2. Chi Square test Symbolized by Greek x 2 pronounced “Ki square” A Test of STATISTICAL SIGNIFICANCE for TABLE data.
Data Analysis for Two-Way Tables. The Basics Two-way table of counts Organizes data about 2 categorical variables Row variables run across the table Column.
Confidence Intervals for Variance and Standard Deviation.
1 G Lect 7a G Lecture 7a Comparing proportions from independent samples Analysis of matched samples Small samples and 2  2 Tables Strength.
Chapter 11: Additional Topics Using Inferences 11.1 – Chi-Square: Tests of Independence 11.2 – Chi-Square: Goodness of Fit 11.3 – Testing a Single Variance.
Review of Probability Concepts Prepared by Vera Tabakova, East Carolina University.
Beginning Statistics Table of Contents HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2008 by Hawkes Learning Systems/Quant Systems, Inc.
Section 6.4 Inferences for Variances. Chi-square probability densities.
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Chapter 5: The Basic Concepts of Statistics. 5.1 Population and Sample Definition 5.1 A population consists of the totality of the observations with which.
Objectives (BPS chapter 12) General rules of probability 1. Independence : Two events A and B are independent if the probability that one event occurs.
Lecture 7: Bivariate Statistics. 2 Properties of Standard Deviation Variance is just the square of the S.D. If a constant is added to all scores, it has.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Confidence Intervals. Point Estimate u A specific numerical value estimate of a parameter. u The best point estimate for the population mean is the sample.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Chi Square Test of Homogeneity. Are the different types of M&M’s distributed the same across the different colors? PlainPeanutPeanut Butter Crispy Brown7447.
Theoretical distributions: the other distributions.
Sampling and Sampling Distributions
Statistical Inferences for Population Variances
Inference concerning two population variances
Chapter 4. Inference about Process Quality
Chapter 15 Linear Regression
Quantitative Methods Simple Regression.
Data Analysis for Two-Way Tables
Probability Key Questions
Confidence Intervals for a Standard Deviation
PROBABILITY DISTRIBUTION
6.3 Sampling Distributions
Quadrat sampling & the Chi-squared test
Quadrat sampling & the Chi-squared test
Chapter 7 Lecture 3 Section: 7.5.
Presentation transcript:

Lecture 12 Brownian motion, chi-square distribution, d.f. Adjusted schedule ahead Chi-square distribution (lot of supplementary material, come to class!!!) 1 lecture Hypothesis testing (about the SD of measurement error)and P-value ( why n-1?supplement) 1 lecture Chi-square test for Model validation (chapter 11) Probability calculation (chapter 4) Binomial distribution and Poisson (chapter 5, supplement, horse-kick death cavalier data, hitting lottery, SARS infection) Correlation, prediction, regression (supplement) t-distribution, F-distribution

Brownian motion-random zigzag movement of small particles dispersed in fluid medium, R. Brown ( , Brit. Botanist) Molecule movement is unpredictable Position along each coordinate axis is modeled as a random variable with normal distribution (like pollen in water) 2- D case. A particle moves randomly on the glass surface, starting from (0, 0) position. The position one minute later is at (X,Y). Suppose EX=0, EY=0, SD(X)=SD(Y)=1 mm. Find the probability that the particle is within 2mm of the original. How about within 3mm? Or in general within c mm? That is, what is Pr( distance< c)? How big a circle has be drawn in order to have 95% chance of containing the particle?

X ~ normal(0,1), Y~normal(0,1) independent Pythagorean theorem: right triangle : c 2 = a 2 + b 2 Distance between two points on a plane with coordinates (a, b) and (c.d) is equal to square root of (a-c) 2 + (b-d) 2 ; give examples Squared distance between (X,Y) to (0,0) is D 2 =X 2 +Y 2 (why?) The name of the distribution of this random variable is known as a chi-square distribution with 2 degrees of freedom. (denoted by ~ chisq, df=2; or    d.f=2; or ~     ; computer programs are available for Finding Pr (X 2 + Y 2 a).

Use simulation : Suppose there are 1000 particles released at The origin and move independent of each other; one minute later, record their locations. Mark locations; Draw histogram of squared distance from the origin Find the proportion of particles that are within 1mm, 2 mm, 3 mm, and so on..

How to use table of chi-square distribution on page 567 At the first column, find df =2 The row corresponding to df is relevant to chi- quared distribution with two degrees of freedom Find the value at the rightmost end Look up for the column header :.001 This means that there is only a probability of.001 that the chi-squared random variable will be greater than 18.82

> (sqrt 13.82) > (sqrt 10.60) > (sqrt 9.210) > (sqrt 7.378) > (sqrt 5.991) > (sqrt 4.605) > (sqrt 3.794) > (sqrt 3.219)

Movement in three dimension space A point with Coordinate (X, Y, Z); squared distance from origin is D 2 =X 2 + Y 2 + Z 2 Then D 2 follows a Chi-square distribution with three degrees of freedom In general, movement in n dimension space; a point with coordinate (X 1, X 2,..X n ); squared distance from origin is D 2 =X 1 2 +X ….+ X n 2 If X 1,..X n are independent, normal (0,1), then D 2 follows a Chi-square distribution with n degrees of freedom If X, Y,Z are independent, normal (0,1)

3-Dimension case Distance of a point (x,y) from diagonal line : Look at the symmetric point (y,x) The projection must be at the middle; so squared distance is Projection to the diagonal (x=y=z) line Losing one degree of freedom due to projection constraint (from another viewpoint, one equation (X-C) + (Y-C)=0 to hold; ) R 2 = (X- C) 2 + (Y-C) 2, where C=(X+Y)/2 Follows a chi-squared with one degree of freedom R 2 = (X-C) 2 + (Y-C) 2 +(Z-C) 2 ; C= (X+Y+Z)/3 Follows a chi-square distribution with (3-1)=2 degrees of freedom N-dimensional case :

If variance of normal each X is  2 Then D 2 /  2 follows a chi-square distribution with n degrees of freedom R 2 /  2 follows a chi-square distribution with n-1 degrees of freedom ; this is also true even if the mean of the normal distribution (for each X) is not zero (why?) R 2 = (X 1 -C) 2 + (X 2 -C) 2 + …+ (X n -C) 2 ; C= (X X n )/n =average Follows a chi-square distribution with n-1 degrees of freedom