tom.h.wilson Department of Geology and Geography West Virginia University Morgantown, WV
The probability of occurrence of specific values in a sample often takes on a bell-shaped appearance as in the case of our pebble mass distribution.
The Gaussian or normal distribution p(x) is a mathematical representation of this bell-shaped characteristic. This mathematical representation yields a bell shaped curve of probabilities whose form and extent is uniquely defined by the mean and variance derived form a sample.
The Gaussian distribution can also be written using the standard normal variable z. The standard normal variable z = (x-x)/ represents the number of standard deviations the value x is from the mean value.
The Gaussian (normal) distribution of pebble masses looks a bit different from the probability distribution we derived directly from the sample.
In the probability histogram, each bar represents a discrete sum of masses over a 50 gram range divided by the total number of the pebbles.
Consider question 7.4 (page 124) of Waltham - In this question Waltham evaluates the equivalent probability that a pebble having a mass somewhere between 401 and 450 grams will be drawn from a normal distribution having the same mean and standard deviation as the sample.
Note that 401grams lies ( )/48 or standard deviations from the mean. 450 grams lies ( )/48 or standard deviations from the mean value and 2.08 are z-values or standard normal representations of the pebble masses associated with this sample. Note that Waltham rounds off the mean and standard deviation to 350 and 48, respectively.
How can we estimate the area between p (z = 1.06) and p (z=2.08)? Note that area corresponds to the probability that a sample drawn at random from this population will have a value somewhere between 401 and 450 grams.
Note that we can express that area as one half the difference of areas. This Area The area we want to find The areas we get from tables
yields -... one half the combined areas. X
Waltham goes through a weighted average determination of the area under the curve between + and standard deviations. He obtains the area Confirm for yourself that the area out to + and is The difference is Now we take one-half of that to get Question 7.4 is a little more complicated. We no longer have numbers listed in the table.
P( 1 )=0.683 P( 1.1 )=0.729 P( 1.06 ) is six-tenths of the way from P(1) and P(1.1) or plus 0.6 times the difference (0.046) = = 0.71 This method of linear interpolation assumes linearity in the curve between 1 and 1.1 p=0.046
P( 2 )=0.954 P( 2.1 )=0.964 P( 2.08 ) is eight-tenths of the way from P(2) and P(2.1) or plus 0.8 times the difference (0.01) = = p=0.01
0.126 is the normal probability of obtaining a pebble with mass between 401 and 450 grams from the beach under investigation. The probability of finding pebbles with masses in the range 401 to 450 grams is ½ the differences in two areas- i.e. ½ ( ) =0.126 …
Note that the value derived from the normal distribution compares nicely with that observed in the sample (0.126 vs. 0.14).
Read section 7.5 carefully and be prepared to confirm the probabilities listed in Table 7.7 from Waltham.