Download presentation
Presentation is loading. Please wait.
Published byEmily Parrish Modified over 9 years ago
2
Lectures prepared by: Elchanan Mossel Yelena Shvets Introduction to probability Stat 134 FAll 2005 Berkeley Follows Jim Pitman’s book: Probability Section 5.3
3
The normalization of the normal Recall: N(0,1) has density f(x) = Ce -1/2x 2 We will calculate the value of C using X,Y » N(0,1) that are independent. (X,Y) have joint density f(x,y) = C 2 e -1/2 ( x 2 + y 2 ) ; And: Question: what is the value of C? Answer:
4
Rotational Invariance Note: The joint density f(x,y) = C 2 e -1/2 ( x 2 + y 2 ) is rotationally invariant – the height depends only on the radial distance from (0,0) and not on the angle. Let: X Y R
5
Rotational invariance r r+dr Note that R 2 (r,r + dr) if (X,Y) is in the annulus A(r,r+dr) of circumference 2 r, and area 2 r dr. In A(r,r+dr) we have: f(x,y) » C 2 e -1/2 r 2. Hence: Therefore the density of R is: So: x y
6
The Variance of N(0,1) By the change of variables formula S = R 2 ~ Exp(1/2): The probability distribution of R is called the Rayleigh distribution. It has the density Therefore the Variance of N(0,1) is given by:
7
Radial Distance A dart is thrown at a circular target by an expert. The point of contact is distributed over the target so that approximately 50% of the shots are in the bull’s eye. Assume that the x and y-coordinates of the hits measured from the center, are distributed as (X,Y), where X,Y are independent N(0, ). What’s the % of the shots that land within the radius twice that of the bull’s eye? What’s the average distance of the shot from the center? Questions: What’s the radius of the bull’s eye?
8
Radial Distance r The hitting distance R has Rayleigh distribution. Therefore: What’s the radius r of the bull’s eye? P(A) ¼ 0.5 A What’s the % of the shots that land within the radius twice that of the bull’s eye?
9
Radial Distance What’s the approximate average distance of the shot from the center? The average is given by: (by symmetry,)
10
Linear Combinations of Independent Normal Variables Suppose that X, Y » N(0,1) and independent. Question: What is the distribution of Z = aX + bY ? Solution: Assume first that a 2 +b 2 = 1. Then there is an angle such that Z = cos X + sin Y.
11
Linear Combinations of Independent Normal Variables Z = cos X + sin Y. By rotational symmetry: P(x<Z<x+ x) = P(x<X<x+ x) So: Z ~ N(0,1). X Y sin Y cos X Z x x x x
12
Linear Combinations of Independent Normal Variables If Z = aX + bY, where a and b are arbitrary, we can define a new variable: So Z’ » N(0,1) and Z » N(0, a 2 + b 2 2 ). If X » N( , ) and Y » N(, ) then So X + Y » N( + , 2 + 2 2 ).
13
N independent Normal Variables Claim: If X 1,…, X N are independent N( i, i 2 ) variables then Z = X 1 +X 2 +…+X N » N( 1 +…+ N, ( 1 2 +…+ N 2 ) ). Proof: By induction. Base case is trivial: Z 1 = X 1 » N( 1, 1 2 ) Assuming the claim for N-1 variables we get Z N-1 » N( 1 +…+ N-1, ( 1 2 +…+ N-1 2 ) ). Now: Z N = Z N-1 + X N, where X N and Z N-1 are independent Normal variables. So by the previous result: Z N » N( 1 +..+ N, ( 1 2 +…+ N 2 ) ).
14
square Distribution Claim: The joint density of n independent N(0,1) variables is: This follows from the fact that a shell of radius r and thickness dr in n dimensions has volume c n r n-1 dr, where c n denotes the surface area of a unit sphere. Note: The density is spherically symmetric it depends on the radial distance: Claim:
15
square Distribution This distribution is also called the -square distribution with n degrees of freedom. Claim: The distribution of R 2 satisfies:
16
Applications of square Distribution Claim: Consider an experiment that is repeated independently n times where the i th outcomes has the probability p i for 1 · i · m. Let N i = # of outcomes of the i th type (N 1 +…+N m = n). Then for large n: 10 draws with replacement P b =6/20; P i =4/20; P c =10/20. N b =3; N i =1; N c =6. has approximately a -square distribution with m-1 degrees of freedom. R 2 2 = (3–3) 2 /3 + (1-2) 2 /2 + (6-5) 2 /5 =1/2 + 1/5 = 0.7
17
Note: The claim allows to “test” to what extent an outcome is consistent with an a priory guess about the actual probabilities. 10 draws with replacement P b =6/20; P i =4/20; P c =10/20. N b =3; N i =1; N c =6. = (3–3) 2 /3 + (1-2) 2 /2 + (6-5) 2 /5 =1/2 + 1/5 = 0.7 square Distribution 2 = 0.7 and the probability of observing a statistic of this size or larger is about 60%, so the sample is consistent with the box.
18
square Example SandalsSneakersLeather shoes BootsOtherTotals Male observed 617139550 Male expected 9.5111012.57 Female observed 135716950 Female expected 9.5111012.57 Total1922202514100 We have a sample of male and female college students and we record what type of shoes they are wearing. We would like to test the hypothesis that men and women are not different in their shoe habits, so we set the expected number in each category to be the average of the two observed values.
19
–square Example (Again, because of our balanced male/female sample, our row totals were the same, so the male and female observed-expected frequency differences were identical. This is usually not the case.) The total chi square value for Table 1 is 14.026 the number of degrees of freedom is 4. This gives This allows to reject the null hypothesis F/Sandals: ((13 - 9.5) 2 /9.5) =1.289 F/Sneakers: ((5 - 11) 2 /11) =3.273 F/L. Shoes: ((7 - 10) 2 /10) =0.900 F/Boots: ((16 - 12.5) 2 /12.5) =0.980 F/Other: ((9 - 7) 2 /7) =0.571 M/Sandals: ((6 - 9.5) 2 /9.5) =1.289 M/Sneakers: ((17 - 11) 2 /11) =3.273 M/L. Shoes: ((13 - 10) 2 /10) =0.900 M/Boots: ((9 - 12.5) 2 /12.5) =0.980 M/Other: ((5 - 7) 2 /7) =0.571
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.