Managerial Economics & Decision Sciences Department random variables  density functions  cumulative functions  business analytics II Developed for ©

Slides:



Advertisements
Similar presentations
The Normal Distribution
Advertisements

Statistics Review – Part II Topics: – Hypothesis Testing – Paired Tests – Tests of variability 1.
Probability Distributions CSLU 2850.Lo1 Spring 2008 Cameron McInally Fordham University May contain work from the Creative Commons.
9.1 confidence interval for the population mean when the population standard deviation is known
Statistics and Quantitative Analysis U4320
Confidence Interval Estimation of Population Mean, μ, when σ is Unknown Chapter 9 Section 2.
Lab 4: What is a t-test? Something British mothers use to see if the new girlfriend is significantly better than the old one?
© 2010 Pearson Prentice Hall. All rights reserved Confidence Intervals for the Population Mean When the Population Standard Deviation is Unknown.
Normal Distributions (2). OBJECTIVES –Revise the characteristics of the normal probability distribution; –Use the normal distribution tables (revision);
Estimation Procedures Point Estimation Confidence Interval Estimation.
Chapter 6 Continuous Random Variables and Probability Distributions
1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 22 = More of Chapter “Confidence Interval Estimation”
1 The t table provides critical value for various probabilities of interest. The form of the probabilities that appear in Appendix B are: P(t > t A, d.f.
Chapter 5 Continuous Random Variables and Probability Distributions
Inference on averages Data are collected to learn about certain numerical characteristics of a process or phenomenon that in most cases are unknown. Example:
Continuous probability distributions
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution Business Statistics: A First Course 5 th.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics, A First Course.
Confidence Intervals for the Mean (σ Unknown) (Small Samples)
Confidence Interval A confidence interval (or interval estimate) is a range (or an interval) of values used to estimate the true value of a population.
Chapter 4 Continuous Random Variables and Probability Distributions
Hypothesis Testing. Central Limit Theorem Hypotheses and statistics are dependent upon this theorem.
Linear Regression Inference
Confidence Interval Estimation
Chap 6-1 Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall Chapter 6 The Normal Distribution Business Statistics: A First Course 6 th.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 PROBABILITIES FOR CONTINUOUS RANDOM VARIABLES THE NORMAL DISTRIBUTION CHAPTER 8_B.
Theory of Probability Statistics for Business and Economics.
Ch.5 CONTINOUS PROBABILITY DISTRIBUTION Prepared by: M.S Nurzaman, S.E, MIDEc. ( deden )‏
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 8-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
Lesson 2 - R Review of Chapter 2 Describing Location in a Distribution.
Chapter 7 Lesson 7.3 Random Variables and Probability Distributions 7.3 Probability Distributions for Continuous Random Variables.
Interval Estimation and Hypothesis Testing Prepared by Vera Tabakova, East Carolina University.
Continuous Probability Distributions Statistics for Management and Economics Chapter 8.
Week111 The t distribution Suppose that a SRS of size n is drawn from a N(μ, σ) population. Then the one sample t statistic has a t distribution with n.
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
Statistics What is statistics? Where are statistics used?
Mystery 1Mystery 2Mystery 3.
Hypothesis Testing. Central Limit Theorem Hypotheses and statistics are dependent upon this theorem.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 6-1 The Normal Distribution.
Copyright © Cengage Learning. All rights reserved. 9 Inferences Based on Two Samples.
1 Chapter 8 Interval Estimation. 2 Chapter Outline  Population Mean: Known  Population Mean: Unknown  Population Proportion.
Section 6.2 Confidence Intervals for the Mean (Small Samples) Larson/Farber 4th ed.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics: A First Course 5 th Edition.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Estimating the Value of a Parameter 9.
Managerial Economics & Decision Sciences Department hypotheses, test and confidence intervals  linear regression: estimation and interpretation  linear.
Managerial Economics & Decision Sciences Department tyler realty  old faithful  business analytics II Developed for © 2016 kellogg school of management.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
THE NORMAL DISTRIBUTION
Managerial Economics & Decision Sciences Department hypotheses  tests  confidence intervals  business analytics II Developed for © 2016 kellogg school.
Central Bank of Egypt Basic statistics. Central Bank of Egypt 2 Index I.Measures of Central Tendency II.Measures of variability of distribution III.Covariance.
business analytics II ▌assignment three - solutions pet food 
business analytics II ▌assignment four - solutions mba for yourself 
business analytics II ▌assignment three - solutions pet food 
Estimating the Value of a Parameter Using Confidence Intervals
Normal Distribution and Parameter Estimation
assignment 7 solutions ► office networks ► super staffing
Point and interval estimations of parameters of the normally up-diffused sign. Concept of statistical evaluation.
business analytics II ▌assignment one - solutions autoparts 
Chapter 6 Confidence Intervals.
Confidence Interval Estimation
Normal Probability Distributions
Interval Estimation and Hypothesis Testing
Estimating the Value of a Parameter
Test 2 Covers Topics 12, 13, 16, 17, 18, 14, 19 and 20 Skipping Topics 11 and 15.
Advanced Algebra Unit 1 Vocabulary
The Normal Distribution
Presentation transcript:

Managerial Economics & Decision Sciences Department random variables  density functions  cumulative functions  business analytics II Developed for © 2016 kellogg school of management | managerial economics and decision sciences department | business analytics II ▌ statistical models: hypotheses, tests & confidence intervals week 1 week 2 week 3 statistics appendix

Managerial Economics & Decision Sciences Department session one – statistics appendix statistical models: hypotheses, tests & confidence intervals business analytics II Developed for random variables ◄ density functions ◄ cumulative functions ◄ © 2016 kellogg school of management | managerial economics and decision sciences department | business analytics II | page1 random variables session one- statistics appendix ► Economic and business environments are “plagued” with uncertainty – a fancy name to capture the idea that there are several possible outcomes each occurring with a certain probability (likelihood). We represent this type of uncertainty through random variables. ► Roll the dice. Let X represent the side that shows up; for a fair dice each side shows up with equal probability 1/6, thus the representation: ► Stock returns. Let R represent the daily stock returns for IBM shares. The problem here is that the possible outcomes are “too many” to allow a representation as above. This is a continuous random variable and in such cases the representation consists of: - the range of the possible outcomes   R  +  - the likelihood of each possible outcome f ( r ) The likelihood is not a probability per se but has the interpretation of the how likely is a certain outcome to appear: if f ( r 1 ) > f ( r 2 ) then the returns are more likely to be around r 1 than around r ← possible outcomes 1/6 ← probability of occurrence

Managerial Economics & Decision Sciences Department session one – statistics appendix statistical models: hypotheses, tests & confidence intervals business analytics II Developed for random variables ◄ density functions ◄ cumulative functions ◄ © 2016 kellogg school of management | managerial economics and decision sciences department | business analytics II | page2 density functions session one- statistics appendix ► The function f is called the density function, and you are probably already familiar with one such function – the bell shape “distribution” for a normal random variable. STATA provides a very simple way to plot the standard normal (zero mean and unit standard deviation) density function: twoway function normalden( x ), range(-4 4) The normalden( x ) is the built-in standard normal density function in STATA, all you need to specify is the range over which you need to plot the function, i.e. range( a b ) will indicate that the function will be evaluate for x ranging from a to b. In order to obtain the value of normalden at a certain point x, say x = 1, use display normalden(1). Remark : The density is symmetric around the mean (which is 0 for the standard normal); a fact that is a bit more difficult to see (or check) is that the area under the curve is equal to 1 (this is the continuous counterpart of the property that the sum of probabilities of all possible outcomes is 1). STATA provides a very simple way to plot the standard normal (zero mean and unit standard deviation) density function: twoway function normalden(x), range(-4 4) Figure 1. Normal density function

Managerial Economics & Decision Sciences Department session one – statistics appendix statistical models: hypotheses, tests & confidence intervals business analytics II Developed for random variables ◄ density functions ◄ cumulative functions ◄ © 2016 kellogg school of management | managerial economics and decision sciences department | business analytics II | page3 density functions session one- statistics appendix ► Normal distribution. A more general normal distribution is one with mean  and standard deviation . The connection between the density functions (for general and standard) is given by twoway function miunormal = normalden(x-1), range(-4 4) Here we plot the normal density for mean 1 and standard deviation 1. The “miunormal” in front of normalden function is simply for labeling purposes. twoway function sigmanormal = ½*normalden(x/2), range(-4 4) Here we plot the normal density for mean 0 and standard deviation 2. The “sigmanormal” in front of normalden function is simply for labeling purposes. Don’t forget to add the range you want ! twoway function msnormal = 1/sigma*normalden((x-miu)/sigma), range(-4 4) Here we plot the normal density for mean 0 and standard deviation 2. The “msnormal” in front of normalden function is simply for labeling purposes. Don’t forget to add the range you want ! normalden((x –  )/  ) STATA provides a very simple way to plot the normal density function for various means and standard deviations:

Managerial Economics & Decision Sciences Department session one – statistics appendix statistical models: hypotheses, tests & confidence intervals business analytics II Developed for random variables ◄ density functions ◄ cumulative functions ◄ © 2016 kellogg school of management | managerial economics and decision sciences department | business analytics II | page4 density functions session one- statistics appendix ► Normal(1,1) – The shape is maintained as symmetric around the new mean – basically you perform a shift of the curve towards the new mean. ► Normal(0,2) – The shape is changed: lower “peak” at the mean and fatter tails. The intuition is fairly straightforward: a higher standard deviation means that the outcomes further away from the mean becomes more likely. Remark : The command syntax for plotting several cures on the same graph simply requires to list the functions you want to plot between parentheses. Figure 2. Normal density function (different means) Figure 3. Normal density function (different variances)

Managerial Economics & Decision Sciences Department session one – statistics appendix statistical models: hypotheses, tests & confidence intervals business analytics II Developed for random variables ◄ density functions ◄ cumulative functions ◄ © 2016 kellogg school of management | managerial economics and decision sciences department | business analytics II | page5 density functions session one- statistics appendix ► The t-distribution. This is a very useful distribution, also known as the Student distribution, for testing hypotheses. The t -distribution has one parameter df – degrees of freedom. For the moment let’s visualize the distribution using STATA. twoway function tden( df,x), range(-4 4) Remark : On the left df = 2 while on the right we added a t -distribution with df = 20. In both cases the resemblance to the (standard) normal distribution is quite striking! Figure 4. t -distribution density function (different means)Figure 5. t -distribution density function (different df )

Managerial Economics & Decision Sciences Department session one – statistics appendix statistical models: hypotheses, tests & confidence intervals business analytics II Developed for random variables ◄ density functions ◄ cumulative functions ◄ © 2016 kellogg school of management | managerial economics and decision sciences department | business analytics II | page6 cumulative functions session one- statistics appendix ► Given a random variable X the function F is called the cumulative distribution function associated to X and is defined as F X ( x 0 ) = Pr[ X  x 0 ] The function F basically provides the probability that the random variable will be less than a given cutoff level. Graphically, F X ( x 0 ) is the area under the f (density function) curve to the left of x 0. STATA allows you to obtain directly the cumulative distribution function. Using STATA ’s command normal(-1) we obtain which is really the shaded area in the diagram. The curve represents the density for the standard normal distribution (mean is zero and standard deviation is 1). normal(-1) quiz Using the above result suppose you are asked what is F X (0) when X is a normal with mean 1 and standard deviation 1. Will this number be greater or smaller than the result above? Figure 6. The Normal distribution density function

Managerial Economics & Decision Sciences Department session one – statistics appendix statistical models: hypotheses, tests & confidence intervals business analytics II Developed for random variables ◄ density functions ◄ cumulative functions ◄ © 2016 kellogg school of management | managerial economics and decision sciences department | business analytics II | page7 cumulative functions session one- statistics appendix Using the above result suppose you are asked what is F X (0) when X is a normal with mean 1 and standard deviation 1. Will this number be greater or smaller than the result above? twoway function normalden( x -1), range(-3 5) twoway function normalden( x ), range(-4 4) Answer A change in mean does not change the shape of the normal density just the “location”, i.e. it shifts the curve horizontally. In both cases we are calculating the area at the left of the point (mean - 1), in the first case at the left of = -1 for a normal with mean 0, and in the second case at the left of 1 – 1 = 0 for a normal with mean 1. The result is the same in both cases! quiz Figure 7. A shift in the Normal distribution density function

Managerial Economics & Decision Sciences Department session one – statistics appendix statistical models: hypotheses, tests & confidence intervals business analytics II Developed for random variables ◄ density functions ◄ cumulative functions ◄ © 2016 kellogg school of management | managerial economics and decision sciences department | business analytics II | page8 cumulative functions session one- statistics appendix ► The connection between a normal distribution with mean  and standard deviation  and a standard normal distribution: normal((x-miu)/sigma) ► It is very easy to calculate the probability that the random variable X is greater than a given cutoff x 0 : Pr[ X  x 0 ]  1  F X ( x 0 ) Remark : The relation above follows from the fact that the total area under the density curve f is 1. Since F X ( x 0 ) is the area under this curve to the left of x 0, the area to the right of x 0, which is really Pr[ X  x 0 ], must be 1  F X ( x 0 ). 1 – normal(  1) normal((x –  )/  ) Figure 8. Area to the right of a given cutoff

Managerial Economics & Decision Sciences Department session one – statistics appendix statistical models: hypotheses, tests & confidence intervals business analytics II Developed for random variables ◄ density functions ◄ cumulative functions ◄ © 2016 kellogg school of management | managerial economics and decision sciences department | business analytics II | page9 cumulative functions session one- statistics appendix ttail( df,x) ► For the t distribution STATA provides a command that allows you to calculate directly the probability that t is greater than the cutoff x (thus the “right tail” area): Remark : Using STATA ’s command ttail(2,1), which gives the probability that a t -variable with 2 degrees of freedom is greater than 1, we get the answer which is the area under the t -distribution density curve and to the right of 1. To calculate the area to the left of some given cutoff x, we use of course 1 – ttail( df,x) ttail(2,1) Figure 9. Area to the right of a given cutoff

Managerial Economics & Decision Sciences Department session one – statistics appendix statistical models: hypotheses, tests & confidence intervals business analytics II Developed for random variables ◄ density functions ◄ cumulative functions ◄ © 2016 kellogg school of management | managerial economics and decision sciences department | business analytics II | page10 cumulative functions session one- statistics appendix ► The cumulative function provides the answer to the question: “what is the area, under the density function, to the left of a given cutoff point x 0 ?” ► The inverse function will answer the question: “what is the cutoff x 0 for which the area, under the density function, to the left of x 0 is equal to a given area  ?” ► Obviously these two questions are really the “two-sides of the same coin”: the connection between the cutoff x 0 and area  is simply F X ( x 0 )   The cumulative function solves for  when x 0 is given while the inverse function solves for x 0 when  is given. In STATA, given a number  between 0 and 1 we find x 0 using: invnormal(  ) invttail( df,  ) Remark : If you use invttail( df,  ) you will obtain the x 0 such that the area, under the distribution curve, to the right of x 0 is exactly . Since the density curve for the t distribution is symmetric around 0, the area to the left of – x 0 exactly equals the area to the right of x 0.

Managerial Economics & Decision Sciences Department session one – statistics appendix statistical models: hypotheses, tests & confidence intervals business analytics II Developed for random variables ◄ density functions ◄ cumulative functions ◄ © 2016 kellogg school of management | managerial economics and decision sciences department | business analytics II | page11 cumulative functions session one- statistics appendix invttail(2, ) –invttail(2, ) ► Below is an example for  = (you may check that ttail(2,2) = ) Figure 10. Inverse cumulative function for t distribution