Download presentation
Presentation is loading. Please wait.
Published byAbraham Stafford Modified over 9 years ago
1
Last Time Normal Distribution –Density Curve (Mound Shaped) –Family Indexed by mean and s. d. –Fit to data, using sample mean and s.d. Computation of Normal Probabilities –Using Excel function, NORMDIST –And Big Rules of Probability
2
Reading In Textbook Approximate Reading for Today’s Material: Pages 61-62, 66-70, 59-61, 322-326 Approximate Reading for Next Class: Pages 337-344, 488-498
3
Normal Density Fitting Idea: Choose μ and σ to fit normal density to histogram of data, Approach: IF the distribution is “mound shaped” & outliers are negligible THEN a “good” choice of normal model is:
4
Normal Density Fitting Melbourne Average Temperature Data
5
Computation of Normal Probs EXCEL Computation: probs given by “lower areas” E.g. for X ~ N(1,0.5) P{X ≤ 1.3} = 0.726
6
Computation of Normal Probs Computation of upper areas: (use “1 –”, i.e. “not” formula) = 1 -
7
Computation of Normal Probs Computation of areas over intervals: (use subtraction) = -
8
Z-score view of populations Idea: Reproducible view of “where data point lies in population”
9
Z-score view of populations Idea: Reproducible view of “where data point lies in population” Context 1: List of Numbers Context 2: Probability distribution
10
Z-score view of Lists of #s Idea: Reproducible view of “where data point lies in population”
11
Z-score view of Lists of #s Idea: Reproducible view of “where data point lies in population” Thought model: population is Normal
12
Z-score view of Lists of #s Idea: Reproducible view of “where data point lies in population” Thought model: population is Normal Population mean: μ
13
Z-score view of Lists of #s Idea: Reproducible view of “where data point lies in population” Thought model: population is Normal Population mean: μ Population standard deviation: σ
14
Z-score view of Lists of #s Idea: Reproducible view of “where data point lies in population” Thought model: population is Normal Population mean: μ Population standard deviation: σ Interpret data as “s.d.s away from mean”
15
Z-score view of Lists of #s Approach: Transform data
16
Z-score view of Lists of #s Approach: Transform data By subtracting mean & dividing by s.d
17
Z-score view of Lists of #s Approach: Transform data By subtracting mean & dividing by s.d. To get
18
Z-score view of Lists of #s Approach: Transform data By subtracting mean & dividing by s.d. To get (gives mean 0, s.d. 1)
19
Z-score view of Lists of #s Approach: Transform data By subtracting mean & dividing by s.d. To get (gives mean 0, s.d. 1) Interpret as
20
Z-score view of Lists of #s Approach: Transform data By subtracting mean & dividing by s.d. To get (gives mean 0, s.d. 1) Interpret as I.e. “ is sd’s above the mean”
21
Z-score view of Normal Dist. Approach: For
22
Z-score view of Normal Dist. Approach: For Subtract mean & divide by s.d
23
Z-score view of Normal Dist. Approach: For Subtract mean & divide by s.d. To get
24
Z-score view of Normal Dist. Approach: For Subtract mean & divide by s.d. To get (gives mean 0, s.d. 1, i.e. Standard Normal)
25
Z-score view of Normal Dist. Approach: For Subtract mean & divide by s.d. To get (gives mean 0, s.d. 1, i.e. Standard Normal) Interpret as
26
Z-score view of Normal Dist. Approach: For Subtract mean & divide by s.d. To get (gives mean 0, s.d. 1, i.e. Standard Normal) Interpret as I.e. “ is sd’s above the mean”
27
Z-score view of Normal Dist. HW: 1.117
28
Interpretation of Z-scores Z-scores
29
Interpretation of Z-scores Z-scores are on N(0,1) scale,
30
Interpretation of Z-scores Z-scores are on N(0,1) scale,
31
Interpretation of Z-scores Z-scores are on N(0,1) scale, so use areas to interpret them
32
Interpretation of Z-scores Z-scores are on N(0,1) scale, so use areas to interpret them Important Areas:
33
Interpretation of Z-scores Z-scores are on N(0,1) scale, so use areas to interpret them Important Areas: 1.Within 1 sd of mean
34
Interpretation of Z-scores Z-scores are on N(0,1) scale, so use areas to interpret them Important Areas: 1.Within 1 sd of mean
35
Interpretation of Z-scores Z-scores are on N(0,1) scale, so use areas to interpret them Important Areas: 1.Within 1 sd of mean “the majority”
36
Interpretation of Z-scores Z-scores are on N(0,1) scale, so use areas to interpret them Important Areas: 1.Within 1 sd of mean “the majority” ≈ 68%
37
Interpretation of Z-scores Z-scores are on N(0,1) scale, so use areas to interpret them Important Areas: 2.Within 2 sd of mean “really most” ≈ 95%
38
Interpretation of Z-scores Z-scores are on N(0,1) scale, so use areas to interpret them Important Areas: 3.Within 3 sd of mean “almost all” ≈ 99.7%
39
Interpretation of Z-scores Summary: these are called the “68 - 95 - 99.7 % Rule”
40
Interpretation of Z-scores Summary: these are called the “68 - 95 - 99.7 % Rule” Mean +- 1 - 2 – 3 sd’s
41
Interpretation of Z-scores Summary: “68 - 95 - 99.7 % Rule” Excel Calculation From Class Example 9: http://www.stat-or.unc.edu/webspace/courses/marron/UNCstor155-2009/ClassNotes/Stor155Eg9.xls
42
Interpretation of Z-scores Summary: “68 - 95 - 99.7 % Rule” Excel Calculation
43
Interpretation of Z-scores HW: 1.115, 1.116 (50%, 2.5%, 0.18-0.22) 1.119
44
Inverse Normal Probs Idea, for a given cutoff value, x
45
Inverse Normal Probs Idea, for a given cutoff value, x Calculated P{X < x}
46
Inverse Normal Probs Idea, for a given cutoff value, x Calculated P{X < x} as Area under normal density
47
Inverse Normal Probs Idea, for a given cutoff value, x Calculated P{X < x} as Area under normal density Using Excel function: NORMDIST
48
Inverse Normal Probs Now for a given P{X < x}, i.e. Area
49
Inverse Normal Probs Now for a given P{X < x}, i.e. Area Find corresponding cutoff x
50
Inverse Normal Probs Now for a given P{X < x}, i.e. Area Find corresponding cutoff x Terminology:
51
Inverse Normal Probs Now for a given P{X < x}, i.e. Area Find corresponding cutoff x Terminology: Quantile
52
Inverse Normal Probs Now for a given P{X < x}, i.e. Area Find corresponding cutoff x Terminology: Quantile Percentile
53
Inverse Normal Probs E.g. Given area = 80%
54
Inverse Normal Probs E.g. Given area = 80% This x is the
55
Inverse Normal Probs E.g. Given area = 80% This x is the 0.8-quantile
56
Inverse Normal Probs E.g. Given area = 80% This x is the 0.8-quantile 80-th percentile
57
Inverse Normal Probs Now for a given P{X < x}, i.e. Area Find: Quantile Percentile
58
Inverse Normal Probs Now for a given P{X < x}, i.e. Area Find: Quantile Percentile Excel Computation: NORMINV
59
Inverse Normal Probs Excel Computation: NORMINV
60
Inverse Normal Probs Excel Computation: NORMINV (very similar to other Excel functions)
61
Inverse Normal Probs Excel Computation: NORMINV (very similar to other Excel functions) (and reasonably well organized)
62
Inverse Normal Probs Excel Computation: NORMINV Examples in: http://www.stat-or.unc.edu/webspace/courses/marron/UNCstor155-2009/ClassNotes/Stor155Eg9.xls
63
Inverse Normal Probs Excel Computation: NORMINV
64
Inverse Normal Probs Excel Computation: NORMINV Set: Mean = 0
65
Inverse Normal Probs Excel Computation: NORMINV Set: Mean = 0 s.d. = 1 prob = 0.8
66
Inverse Normal Probs Excel Computation: NORMINV Set: Mean = 0 s.d. = 1 prob = 0.8 Get answer
67
Inverse Normal Probs Excel Computation: NORMINV or can just type in formula
68
Inverse Normal Probs Excel Computation: NORMINV or can just type in formula Get answer
69
Inverse Normal Probs Now for a given P{X < x}, i.e. Area Find: Quantile Percentile = 0.84
70
Inverse Normal Probs Excel Computation: NORMINV Another example: for X ~ N(100,20)
71
Inverse Normal Probs Excel Computation: NORMINV Another example: for X ~ N(100,20)
72
Inverse Normal Probs Excel Computation: NORMINV Another example: for X ~ N(100,20) Find x, so that 30% = P{X < x}
73
Inverse Normal Probs Excel Computation: NORMINV Another example: for X ~ N(100,20) Find x, so that 30% = P{X < x} i.e. the 30-th percentile
74
Inverse Normal Probs Excel Computation: NORMINV Another example: for X ~ N(100,20) Find x, so that 30% = P{X < x} i.e. the 30-th percentile Answer: slightly less than mean
75
Inverse Normal Probs Example: Quality Control
76
Inverse Normal Probs When a machine works normally, it fills bottles with mean = 25 oz, and SD = 0.2 oz.
77
Inverse Normal Probs When a machine works normally, it fills bottles with mean = 25 oz, and SD = 0.2 oz. The machine is “out of control” when it overfills.
78
Inverse Normal Probs When a machine works normally, it fills bottles with mean = 25 oz, and SD = 0.2 oz. The machine is “out of control” when it overfills. Choose an “alarm level”, which will give only 1 % false alarms.
79
Inverse Normal Probs When a machine works normally, it fills bottles with mean = 25 oz, and SD = 0.2 oz. The machine is “out of control” when it overfills. Choose an “alarm level”, which will give only 1 % false alarms. Want: cutoff, x, so that Area above = 1%
80
Inverse Normal Probs When a machine works normally, it fills bottles with mean = 25 oz, and SD = 0.2 oz. The machine is “out of control” when it overfills. Choose an “alarm level”, which will give only 1 % false alarms. Want: cutoff, x, so that Area above = 1% Note: Area below = 100% - Area above = 99%
81
Inverse Normal Probs When a machine works normally, it fills bottles with mean = 25 oz, and SD = 0.2 oz. Want: cutoff, x, so that Area above = 1% Note: Area below = 100% - Area above = 99%
82
Inverse Normal Probs When a machine works normally, it fills bottles with mean = 25 oz, and SD = 0.2 oz. Want: cutoff, x, so that Area above = 1% Note: Area below = 100% - Area above = 99%
83
Inverse Normal Probs When a machine works normally, it fills bottles with mean = 25 oz, and SD = 0.2 oz. Want: cutoff, x, so that Area above = 1% Note: Area below = 100% - Area above = 99%
84
Inverse Normal Probs When a machine works normally, it fills bottles with mean = 25 oz, and SD = 0.2 oz. Want: cutoff, x, so that Area above = 1% Note: Area below = 100% - Area above = 99%
85
Inverse Normal Probs When a machine works normally, it fills bottles with mean = 25 oz, and SD = 0.2 oz. Want: cutoff, x, so that Area above = 1% Note: Area below = 100% - Area above = 99%
86
Inverse Normal Probs When a machine works normally, it fills bottles with mean = 25 oz, and SD = 0.2 oz. Want: cutoff, x, so that Area above = 1% Note: Area below = 100% - Area above = 99% So set alarm threshold to 25.47
87
Inverse Normal Probs HW: 1.122 (-0.675, 0.385) 1.123 1.132 (1294) 1.133 1.139
88
And Now for Something Completely Different A fun idea. Can you read this?
89
And Now for Something Completely Different A fun idea. Can you read this? Olny srmat poelpe can raed this.
90
And Now for Something Completely Different A fun idea. Can you read this? Olny srmat poelpe can raed this. I cdnuolt blveiee that I cluod aulaclty uesdnatnrd what I was rdanieg.
91
And Now for Something Completely Different A fun idea. Can you read this? Olny srmat poelpe can raed this. I cdnuolt blveiee that I cluod aulaclty uesdnatnrd what I was rdanieg. The phaonmneal pweor of the hmuan mnid, aoccdrnig to rscheearch at Cmabrigde Uinervtisy.
92
And Now for Something Completely Different The phaonmneal pweor of the hmuan mnid, aoccdrnig to rscheearch at Cmabrigde Uinervtisy.
93
And Now for Something Completely Different The phaonmneal pweor of the hmuan mnid, aoccdrnig to rscheearch at Cmabrigde Uinervtisy. It deosn't mttaer in what oredr the ltteers in a word are, the olny iprmoatnt tihng is that the first and last ltteer be in the rghit pclae.
94
And Now for Something Completely Different The phaonmneal pweor of the hmuan mnid, aoccdrnig to rscheearch at Cmabrigde Uinervtisy. It deosn't mttaer in what oredr the ltteers in a word are, the olny iprmoatnt tihng is that the first and last ltteer be in the rghit pclae. The rset can be a taotl mses and you can still raed it wouthit a porbelm.
95
And Now for Something Completely Different The rset can be a taotl mses and you can still raed it wouthit a porbelm.
96
And Now for Something Completely Different The rset can be a taotl mses and you can still raed it wouthit a porbelm. Tihs is bcuseae the huamn mnid deos not raed ervey lteter by istlef, but the word as a wlohe.
97
And Now for Something Completely Different The rset can be a taotl mses and you can still raed it wouthit a porbelm. Tihs is bcuseae the huamn mnid deos not raed ervey lteter by istlef, but the word as a wlohe. Amzanig huh?
98
And Now for Something Completely Different The rset can be a taotl mses and you can still raed it wouthit a porbelm. Tihs is bcuseae the huamn mnid deos not raed ervey lteter by istlef, but the word as a wlohe. Amzanig huh? Yaeh and I awlyas tghuhot slpeling was ipmorantt!
99
Checking Normality Idea: For which data sets, will the normal distribution be a good model?
100
Checking Normality Idea: For which data sets, will the normal distribution be a good model? Recall fitting normal density to data:
101
Normal Density Fitting Idea: Choose μ and σ to fit normal density to histogram of data, Approach: IF the distribution is “mound shaped” & outliers are negligible THEN a “good” choice of normal model is:
102
Normal Density Fitting Melbourne Average Temperature Data
103
Checking Normality Idea: For which data sets, will the normal distribution be a good model? Useful graphical device to check: IF the distribution is “mound shaped” & outliers are negligible
104
Checking Normality Useful graphical device:
105
Checking Normality Useful graphical device: Quantile – Quantile plot
106
Checking Normality Useful graphical device: Quantile – Quantile plot Varying Terminology:
107
Checking Normality Useful graphical device: Quantile – Quantile plot Varying Terminology: Q-Q plot
108
Checking Normality Useful graphical device: Quantile – Quantile plot Varying Terminology: Q-Q plot Normal Quantile plot (text book)
109
Checking Normality Q-Q plot
110
Checking Normality Q-Q plot Idea: graphical comparison
111
Checking Normality Q-Q plot Idea: graphical comparison of data distribution
112
Checking Normality Q-Q plot Idea: graphical comparison of data distribution vs. normal distribution
113
Checking Normality Q-Q plot Idea: graphical comparison of data distribution vs. normal distribution as data quantiles vs. normal quantiles
114
Checking Normality Q-Q plot, implementation:
115
Checking Normality Q-Q plot, implementation: Sort data, to find data quantiles
116
Checking Normality Q-Q plot, implementation: Sort data, to find data quantiles Assign corresponding probabilities:
117
Checking Normality Q-Q plot, implementation: Sort data, to find data quantiles Assign corresponding probabilities: (equally spaced, strictly between 0 and 1)
118
Checking Normality Q-Q plot, implementation: Sort data, to find data quantiles Assign corresponding probabilities: Compute corresponding normal quantiles
119
Checking Normality Q-Q plot, implementation: Sort data, to find data quantiles Assign corresponding probabilities: Compute corresponding normal quantiles (using NORMINV)
120
Checking Normality Q-Q plot, implementation: Sort data, to find data quantiles Assign corresponding probabilities: Compute corresponding normal quantiles (using NORMINV) Make plot with x-axis
121
Checking Normality Q-Q plot, implementation: Sort data, to find data quantiles Assign corresponding probabilities: Compute corresponding normal quantiles (using NORMINV) Make plot with x-axis & y-axis
122
Checking Normality Q-Q plot, interpretation:
123
Checking Normality Q-Q plot, interpretation: When distribution is normal:
124
Checking Normality Q-Q plot, interpretation: When distribution is normal: –Points lie close to a line
125
Checking Normality Q-Q plot, interpretation: When distribution is normal: –Points lie close to a line –For standard normal quantiles
126
Checking Normality Q-Q plot, interpretation: When distribution is normal: –Points lie close to a line –For standard normal quantiles Y-intercept of line is mean Slope of line is s.d.
127
Checking Normality Q-Q plot, interpretation: When distribution is normal: –Points lie close to a line –For standard normal quantiles Y-intercept of line is mean Slope of line is s.d. For non-normal distribution:
128
Checking Normality Q-Q plot, interpretation: When distribution is normal: –Points lie close to a line –For standard normal quantiles Y-intercept of line is mean Slope of line is s.d. For non-normal distribution: –Q-Q plot will curve away from line
129
Checking Normality Q-Q plot, e.g.
130
Checking Normality Q-Q plot, e.g. Excel analyses available in: http://www.stat-or.unc.edu/webspace/courses/marron/UNCstor155-2009/ClassNotes/Stor155Eg10.xls
131
Checking Normality Q-Q plot, e.g. n = 1000 from N(0,1) Data simulated as:
132
Checking Normality Q-Q plot, e.g. n = 1000 from N(0,1) Data simulated as: Data Tab
133
Checking Normality Q-Q plot, e.g. n = 1000 from N(0,1) Data simulated as: Data Tab Data Analysis
134
Checking Normality Q-Q plot, e.g. n = 1000 from N(0,1) Data simulated as: Data Tab Data Analysis Random Number Generation
135
Checking Normality Q-Q plot, e.g. n = 1000 from N(0,1) Data simulated as: Data Tab Data Analysis Random Number Generation Set parameters
136
Checking Normality Q-Q plot, e.g. n = 1000 from N(0,1) Data simulated as: Data Tab Data Analysis Random Number Generation Set parameters
137
Checking Normality Q-Q plot, e.g. n = 1000 from N(0,1) Data simulated as: Data Tab Data Analysis Random Number Generation Set parameters
138
Checking Normality Q-Q plot, e.g. n = 1000 from N(0,1) Data simulated as: Data Tab Data Analysis Random Number Generation Set parameters
139
Checking Normality Q-Q plot, e.g. n = 1000 from N(0,1) Next sort data Copy to another column
140
Checking Normality Q-Q plot, e.g. n = 1000 from N(0,1) Next sort data Copy to another column Highlight
141
Checking Normality Q-Q plot, e.g. n = 1000 from N(0,1) Next sort data Copy to another column Highlight Data Tab
142
Checking Normality Q-Q plot, e.g. n = 1000 from N(0,1) Next sort data Copy to another column Highlight Data Tab Sort Button
143
Checking Normality Q-Q plot, e.g. n = 1000 from N(0,1) Next sort data Copy to another column Highlight Data Tab Sort Button Gives Data Quantiles
144
Checking Normality Q-Q plot, e.g. n = 1000 from N(0,1) Next compute Normal Quantiles
145
Checking Normality Q-Q plot, e.g. n = 1000 from N(0,1) Next compute Normal Quantiles 1 st type indices Range of probs i / (n+1)
146
Checking Normality Q-Q plot, e.g. n = 1000 from N(0,1) Next compute Normal Quantiles 1 st type indices Range of probs i / (n+1) Normal quantiles
147
Checking Normality Q-Q plot, e.g. n = 1000 from N(0,1) Now plot Data Quantiles vs. Normal Quantiles
148
Checking Normality Q-Q plot, e.g. n = 1000 from N(0,1) Now plot Data Quantiles vs. Normal Quantiles
149
Checking Normality Q-Q plot, e.g. n = 1000 from N(0,1) Now plot Data Quantiles vs. Normal Quantiles Insert Tab
150
Checking Normality Q-Q plot, e.g. n = 1000 from N(0,1) Now plot Data Quantiles vs. Normal Quantiles Insert Tab Scatter Button
151
Checking Normality Q-Q plot, e.g. n = 1000 from N(0,1) Now plot Data Quantiles vs. Normal Quantiles Insert Tab Scatter Button Fill out menu (as before)
152
Checking Normality Q-Q plot, e.g. n = 1000 from N(0,1) Results: Looks very linear
153
Checking Normality Q-Q plot, e.g. n = 1000 from N(0,1) Results: Looks very linear As expected
154
Checking Normality Q-Q plot, e.g. n = 1000 from N(0,1) Results: Looks very linear As expected Y-intercept = 0 (= mean)
155
Checking Normality Q-Q plot, e.g. n = 1000 from N(0,1) Results: Looks very linear As expected Y-intercept = 0 (= mean) Slope = 1 (= s.d.)
156
Checking Normality Q-Q plot, e.g. Buffalo Snowfalls
157
Checking Normality Q-Q plot, e.g. Buffalo Snowfalls Recall Histogram
158
Checking Normality Q-Q plot, e.g. Buffalo Snowfalls Recall Histogram - Roughly symmetric
159
Checking Normality Q-Q plot, e.g. Buffalo Snowfalls Recall Histogram - Roughly symmetric - Mound shaped
160
Checking Normality Q-Q plot, e.g. Buffalo Snowfalls Recall Histogram - Roughly symmetric - Mound shaped - Does Normal Curve fit the data?
161
Checking Normality Q-Q plot, e.g. Buffalo Snowfalls Approximately linear
162
Checking Normality Q-Q plot, e.g. Buffalo Snowfalls Approximately linear Suggests normal
163
Checking Normality Q-Q plot, e.g. Buffalo Snowfalls Approximately linear Suggests normal But some wiggles?
164
Checking Normality Q-Q plot, e.g. Buffalo Snowfalls Approximately linear Suggests normal But some wiggles? Due to natural sampling variation?
165
Checking Normality Q-Q plot, e.g. Buffalo Snowfalls Approximately linear Suggests normal But some wiggles? Due to natural sampling variation? Study with smaller simulation
166
Checking Normality Q-Q plot, e.g. n = 100 from N(0,1)
167
Checking Normality Q-Q plot, e.g. n = 100 from N(0,1) Approximately linear
168
Checking Normality Q-Q plot, e.g. n = 100 from N(0,1) Approximately linear Some wiggliness
169
Checking Normality Q-Q plot, e.g. n = 100 from N(0,1) Approximately linear Some wiggliness Suggests Buffalo variation is usual
170
Checking Normality Q-Q plot, e.g. n = 100 from N(0,1) Approximately linear Some wiggliness Suggests Buffalo variation is usual Make this more precise?
171
Checking Normality Q-Q plot, e.g. British Suicides
172
Checking Normality Q-Q plot, e.g. British Suicides Recall Histogram
173
Checking Normality Q-Q plot, e.g. British Suicides Recall Histogram Strong right skewness
174
Checking Normality Q-Q plot, e.g. British Suicides Recall Histogram Strong right skewness So mean >> median
175
Checking Normality Q-Q plot, e.g. British Suicides Recall Histogram Strong right skewness So mean >> median Not mound shaped
176
Checking Normality Q-Q plot, e.g. British Suicides
177
Checking Normality Q-Q plot, e.g. British Suicides Distinct non-linearity (curvature)
178
Checking Normality Q-Q plot, e.g. British Suicides Distinct non-linearity (curvature) Conclude data not normal
179
Checking Normality Q-Q plot, e.g. British Suicides Distinct non-linearity (curvature) Conclude data not normal Characteristic of right skewness
180
Checking Normality Q-Q plot, e.g. Log10 British Suicides Recall: log10 transformation resulted in mound shape
181
Checking Normality Q-Q plot, e.g. Log10 British Suicides Recall Histogram
182
Checking Normality Q-Q plot, e.g. Log10 British Suicides Recall Histogram: o Much more mound shaped
183
Checking Normality Q-Q plot, e.g. Log10 British Suicides Recall Histogram: o Much more mound shaped o Check for normality with Q-Q plot
184
Checking Normality Q-Q plot, e.g. Log10 British Suicides
185
Checking Normality Q-Q plot, e.g. Log10 British Suicides Looks very linear
186
Checking Normality Q-Q plot, e.g. Log10 British Suicides Looks very linear Indicates normal distribution is good fit
187
Checking Normality Q-Q plot, e.g. Log10 British Suicides Looks very linear Indicates normal distribution is good fit I.e. transformation worked!
188
Checking Normality HW: 1.143 1.145 1.146 (a. approx. normal + big outlier; b. close to normal; c. right skew + one big outlier; d. Non-normal with several clusters
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.