Estimation and Hypothesis Testing. The Investment Decision What would you like to know? What will be the return on my investment? Not possible PDF for.

Slides:



Advertisements
Similar presentations
Introduction to Hypothesis Testing
Advertisements

Estimation of Means and Proportions
“Students” t-test.
Hypothesis testing Another judgment method of sampling data.
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Statistics.  Statistically significant– When the P-value falls below the alpha level, we say that the tests is “statistically significant” at the alpha.
Copyright © 2011 Pearson Education, Inc. Statistical Tests Chapter 16.
Hypothesis Testing A hypothesis is a claim or statement about a property of a population (in our case, about the mean or a proportion of the population)
Sampling Distributions (§ )
Inferential Statistics & Hypothesis Testing
Probability - 1 Probability statements are about likelihood, NOT determinism Example: You can’t say there is a 100% chance of rain (no possibility of.
Chapter 10: Hypothesis Testing
Topic 6: Introduction to Hypothesis Testing
Evaluation (practice). 2 Predicting performance  Assume the estimated error rate is 25%. How close is this to the true error rate?  Depends on the amount.
1 MF-852 Financial Econometrics Lecture 4 Probability Distributions and Intro. to Hypothesis Tests Roy J. Epstein Fall 2003.
Using Statistics in Research Psych 231: Research Methods in Psychology.
Lecture 8 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
1/55 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 10 Hypothesis Testing.
Topic 2: Statistical Concepts and Market Returns
Evaluating Hypotheses
Experimental Evaluation
BCOR 1020 Business Statistics Lecture 20 – April 3, 2008.
Using Statistics in Research Psych 231: Research Methods in Psychology.
Hypothesis Testing.
AM Recitation 2/10/11.
Hypothesis Testing:.
Statistical Hypothesis Testing. Suppose you have a random variable X ( number of vehicle accidents in a year, stock market returns, time between el nino.
Statistical inference: confidence intervals and hypothesis testing.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
Chapter 9.3 (323) A Test of the Mean of a Normal Distribution: Population Variance Unknown Given a random sample of n observations from a normal population.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Statistical Inferences Based on Two Samples Chapter 9.
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
Statistical Review We will be working with two types of probability distributions: Discrete distributions –If the random variable of interest can take.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Individual values of X Frequency How many individuals   Distribution of a population.
Mid-Term Review Final Review Statistical for Business (1)(2)
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Section 8.1 Estimating  When  is Known In this section, we develop techniques for estimating the population mean μ using sample data. We assume that.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
Binomial Experiment A binomial experiment (also known as a Bernoulli trial) is a statistical experiment that has the following properties:
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 16 Statistical Tests.
1 Chapter 8 Hypothesis Testing 8.2 Basics of Hypothesis Testing 8.3 Testing about a Proportion p 8.4 Testing about a Mean µ (σ known) 8.5 Testing about.
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Business Statistics,
Inference: Probabilities and Distributions Feb , 2012.
Review of Statistics.  Estimation of the Population Mean  Hypothesis Testing  Confidence Intervals  Comparing Means from Different Populations  Scatterplots.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Ch4: 4.3The Normal distribution 4.4The Exponential Distribution.
Continuous Random Variables Lecture 26 Section Mon, Mar 5, 2007.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University.
Chapter 8 Estimation ©. Estimator and Estimate estimator estimate An estimator of a population parameter is a random variable that depends on the sample.
Hypothesis Tests. An Hypothesis is a guess about a situation that can be tested, and the test outcome can be either true or false. –The Null Hypothesis.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Evaluating Hypotheses. Outline Empirically evaluating the accuracy of hypotheses is fundamental to machine learning – How well does this estimate its.
Inferential Statistics Psych 231: Research Methods in Psychology.
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a number that describes the population. In statistical practice, the value of.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 FINAL EXAMINATION STUDY MATERIAL III A ADDITIONAL READING MATERIAL – INTRO STATS 3 RD EDITION.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Evaluating Hypotheses. Outline Empirically evaluating the accuracy of hypotheses is fundamental to machine learning – How well does this estimate accuracy.
Hypothesis Testing. Steps for Hypothesis Testing Fig Draw Marketing Research Conclusion Formulate H 0 and H 1 Select Appropriate Test Choose Level.
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
Hypothesis Testing: Hypotheses
Chapter 9 Hypothesis Testing.
Econometric Models The most basic econometric model consists of a relationship between two variables which is disturbed by a random error. We need to use.
What are their purposes? What kinds?
Psych 231: Research Methods in Psychology
Sampling Distributions (§ )
Presentation transcript:

Estimation and Hypothesis Testing

The Investment Decision What would you like to know? What will be the return on my investment? Not possible PDF for return Use statistics to estimate the correct PDF. Can do for discrete PDF’s. For continuous PDFs, beyond the scope of this course. 1)Assume the normal PDF 2)Use statistics to estimate E[r] and .

The Game of Investing Suppose you’re offered to play a game: Cost: $1.25 Flip a coin If heads, you get $2 (return is 60%) If tails, you get $0 (return is -100%) Coin may be biased Assume the true pdf for this investment is Of course, this information is unknown!

The Game of Investing You would like to know What will the coin flip turn out to be? Not possible What is the PDF (probability of getting heads/tails)? Not possible to know with certainty: need to estimate E[r] and  Not possible to know perfectly: need to estimate. How to estimate? Why not flip the coin a few times?

Random Sample Suppose you flip the coin 10 times and get H, H, T, H, H, H, H, H, H, T 8 heads, 2 tails This is an independent random sample A sample of observations generated by the same pdf Independent: One outcome does not affect others Nothing other than the pdf determines how observations are chosen. No “cherry picking”: picking certain observations you like, and eliminating others How do we use this information to estimate f(heads) E[r] 

Estimators Estimator – function of outcomes for a random sample. An estimator is a random variable It has it’s own PDF! Let be an estimator of E[r] Two important properties of estimators: Unbiased: Consistent: As the number of observations we observe becomes large, Suppose we use the following rule to get our estimator of E[r]: For any random sample, choose the first observation as our estimate of E[r]. Call this estimator r 1.

Estimator of E[r] Is r 1 unbiased? For any random sample of any size, r 1 is simply a random variable governed by the pdf So E[r 1 ]=12%=E[r] r 1 is therefore an unbiased estimator of E[r]

Estimator of E[r] Is r 1 consistent? For any random sample of any size, r 1 is simply a random variable governed by the pdf So Var[r 1 ]= for any sample size. r 1 is therefore not a consistent estimator of E[r]

Estimator of E[r] Suppose we use the following rule to get our estimator of E[r]: For any random sample, take the average return as our estimator of E[r]. Call this estimator.

Estimator of E[r] Is unbiased? What is ? Use stat rule #1 is therefore an unbiased estimator of E[r]

Estimator of E[r] Is consistent? How do we find the variance of a sum of random variables? We haven’t learned this yet, but we will later. We can generate random samples of estimators to get some idea of the properties of their PDF. For each outcome for an estimator, we need to generate a random sample of observations, and then compute the estimator. Use Excel Spreadsheet (posted on course website)

Comment Why do we use probability weights when calculating E[r] from a pdf, but when we estimate E[r] we just use an equally weighted average? Given the PDF above, we should expect in any random sample to see heads 70% of the time. Assume we draw a sample where exactly 70% are heads 70% of the returns in the sample will be % of the returns in the sample will be A simple average across observations is equal to Simple averages naturally put more weight on those outcomes which are more likely.

Estimator of Stdev[r] How do we use data to estimate Stdev[r]? Var[r]=E(r-E[r]) 2 = E[r 2 ]-E[r] 2 Stdev(r)=sqrt(Var(r)) Suppose we use the following rule to get our estimator of Stdev[r]: Is the estimator unbiased? Is the estimator consistent? Use Excel spreadsheet.

Average The same results apply to continuous PDFs For a given random sample:

Estimates When is the average a good estimate of E[r]? When is our estimator for standard deviation a good estimate of Stdev[r]? When you have a large sample of outcomes When the PDF doesn’t change mid-sample

Stat Rules Stat Rule 1.E Let x 1,…,x n be a random sample of the random variable X. Let y 1,…,y n be a random sample of the random variable Y. Let z i =ax i +by i, for i=1,…,n where a and b are constants. Then Stat Rule 2.E Let x 1,…,x n be a random sample of the random variable X. Let z i =ax i +c, for i=1,…,n where a is a constant Then

Estimated Sharpe Ratio The Sharpe Ratio may be estimated as where we use the yield on a t-bill as a proxy for the risk-free rate.

Time and E[r] and Stdev[r] E[r] and stdev[r] have a unit of time attached to them. E[r]=10% over a year is much different than E[r]=10% over a day.  [r]=0.16 over a year is much different than  [r]=0.16 over a day. Let p denote a “short” time period (e.g., a month) Let P denote a “long” time period (e.g., a year) Let N denote the number of “short” time periods in a “long” time period (e.g., 12) Let E p [r] and  p [r] be the appropriate parameters over the short time period Let E P [r] and  P [r] be the appropriate parameters over the long time period Then to a close approximation,

How Good Are the Estimates? Does the E[r] for a stock meet some pre-determined benchmark? You can’t observe the PDF to calculate the true E[r] Over the past 10 years, returns have been as follows: From this you estimate E[r] to be 18.3% Is this enough information to reject the hypothesis that the true E[r] for the PDF that generated this sample is 10%?

Hypothesis Testing Null Hypothesis The hypothesis to be tested E[r]=10% Alternative Hypothesis E[r]  10%

Hypothesis Testing c = distance of test statistic from null hypothesis that defines zone of acceptance.

Hypothesis Testing Standard Practice: Choose c so that we know the probability of making a type 1 error.

Hypothesis Tests of the Mean Let (X1, X2, …,X n ) be an independent random sample from any PDF True mean=  True standard deviation=  What PDF governs the outcome for the sample average? The laws of statistics say that the sample average is approximately Normally distributed True mean=  Standard Deviation= The standard deviation of the sample average is called the “standard error”

Hypothesis Testing Null Hypothesis E[r]=10% Alternative Hypothesis E[r]  10% Assuming the null is true, the sample mean is approximately normally distributed with  =.10  =0.0921

Normal Distribution

Hypothesis Testing For any normally distributed random variable, there is only a 5% probability of getting an outcome above  or below . Assuming the null is true, there is only a 5% chance of drawing a sample average above *(0.0921) = or below *(0.0921)= If the sample average is above or below , we therefore conclude that it’s too unlikley (<5%) that we would observe such an outcome, given the null is true. Hence, the null must not be true. Reject H 0.

Hypothesis Testing

The sample average we observe is 18.3% This is in the zone of acceptance. Do not reject the null hypothesis. “We cannot reject the hypothesis that the true mean is 10% with 95% Confidence.

Hypothesis Tests of the Mean Example Hypothesis:  =2% Alternative:   2% 100 years of stock market returns Sample Average = 16% Standard Deviation = 0.18 Hence, standard error is 0.18/10 = 0.018

Hypothesis Tests of the Mean c = 1.96*0.018 = Assuming null hypothesis is true, too unlikely you would observe the actual sample mean. Reject the null hypothesis.

T-statistic