Econometric analysis of CVM surveys. Estimation of WTP The information we have depends on the elicitation format. With the open- ended format it is relatively.

Slides:



Advertisements
Similar presentations
Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
Advertisements

Probit The two most common error specifications yield the logit and probit models. The probit model results if the are distributed as normal variates,
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
1 12. Principles of Parameter Estimation The purpose of this lecture is to illustrate the usefulness of the various concepts introduced and studied in.
Estimation  Samples are collected to estimate characteristics of the population of particular interest. Parameter – numerical characteristic of the population.
Fundamentals of Data Analysis Lecture 12 Methods of parametric estimation.
Nguyen Ngoc Anh Nguyen Ha Trang
Models with Discrete Dependent Variables
Visual Recognition Tutorial
Binary Response Lecture 22 Lecture 22.
Maximum likelihood Conditional distribution and likelihood Maximum likelihood estimations Information in the data and likelihood Observed and Fisher’s.
Linear and generalised linear models
Copyright © Cengage Learning. All rights reserved. 6 Point Estimation.
Linear and generalised linear models
Linear and generalised linear models Purpose of linear models Least-squares solution for linear models Analysis of diagnostics Exponential family and generalised.
Inferential Statistics
Bootstrapping applied to t-tests
Chapter 7 Estimation: Single Population
Confidence Interval Estimation
1 CSI5388: Functional Elements of Statistics for Machine Learning Part I.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Random Sampling, Point Estimation and Maximum Likelihood.
Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.
Montecarlo Simulation LAB NOV ECON Montecarlo Simulations Monte Carlo simulation is a method of analysis based on artificially recreating.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Lecture 3: Inference in Simple Linear Regression BMTRY 701 Biostatistical Methods II.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Deterministic vs. Random Maximum A Posteriori Maximum Likelihood Minimum.
Estimating a Population Proportion
9-1 MGMG 522 : Session #9 Binary Regression (Ch. 13)
ECE 8443 – Pattern Recognition LECTURE 07: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Class-Conditional Density The Multivariate Case General.
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
Maximum Likelihood Estimation Methods of Economic Investigation Lecture 17.
Chapter 5 Parameter estimation. What is sample inference? Distinguish between managerial & financial accounting. Understand how managers can use accounting.
Issues in Estimation Data Generating Process:
PROBABILITY AND STATISTICS FOR ENGINEERING Hossein Sameti Department of Computer Engineering Sharif University of Technology Principles of Parameter Estimation.
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Machine Learning 5. Parametric Methods.
G. Cowan Lectures on Statistical Data Analysis Lecture 9 page 1 Statistical Data Analysis: Lecture 9 1Probability, Bayes’ theorem 2Random variables and.
Chapter 8 Estimation ©. Estimator and Estimate estimator estimate An estimator of a population parameter is a random variable that depends on the sample.
The Probit Model Alexander Spermann University of Freiburg SS 2008.
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
Instructor: R. Makoto 1richard makoto UZ Econ313 Lecture notes.
Fundamentals of Data Analysis Lecture 11 Methods of parametric estimation.
Class Six Turn In: Chapter 15: 30, 32, 38, 44, 48, 50 Chapter 17: 28, 38, 44 For Class Seven: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 Read.
Chapter 6 The Normal Distribution and Other Continuous Distributions
Estimating standard error using bootstrap
The Probit Model Alexander Spermann University of Freiburg SoSe 2009
Copyright © Cengage Learning. All rights reserved.
ESTIMATION.
12. Principles of Parameter Estimation
Point and interval estimations of parameters of the normally up-diffused sign. Concept of statistical evaluation.
THE LOGIT AND PROBIT MODELS
Chapter 6: Sampling Distributions
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
Hypothesis Testing: Hypotheses
THE LOGIT AND PROBIT MODELS
CONCEPTS OF ESTIMATION
Continuous Random Variable
Arithmetic Mean This represents the most probable value of the measured variable. The more readings you take, the more accurate result you will get.
Confidence Interval Estimation
10701 / Machine Learning Today: - Cross validation,
The Simple Linear Regression Model: Specification and Estimation
Chapter 10: Estimating with Confidence
Chapter 7: Sampling Distributions
Parametric Methods Berlin Chen, 2005 References:
12. Principles of Parameter Estimation
MGS 3100 Business Analysis Regression Feb 18, 2016
Business Statistics For Contemporary Decision Making 9th Edition
Presentation transcript:

Econometric analysis of CVM surveys

Estimation of WTP The information we have depends on the elicitation format. With the open- ended format it is relatively easy to obtain for example mean or median WTP, and with regression analysis we can estimate the determinants of the WTP. With other formats we get less information. For example with the single- bounded format we only know whether a respondents’ WTP is higher or lower than the bid. So the econometric analysis is going to depend on the elicitation format. We will look at: - Open-ended - Closed-ended

Open-ended responses

Open-ended analysis With an open-ended question we simply ask respondents what their WTP is. So for each individual we will have an observation of his or her WTP. What do we want to know? 1.Mean WTP? This is perhaps the most natural measure of WTP that we could use. For example, if we would like to conduct a welfare analysis, mean WTP would be a natural choice. 2. Median WTP? This would correspond to a relevant measure if we would want to know what would happen in a referendum. This tells us that 50% of the population have a WTP that is at least equal to the median. This measure is also much less sensitive to extreme values. We usually report both these measures.

Known problems with open-ended responses 1. Extreme values Often a few extreme responses that have a large impact on the mean WTP. Several solutions: - Remove the extreme values using some rule: Share of income, large impact on mean etc. - Report both mean and median, since the median is much more insensitive to outliers. 2. A large fraction of zeros Often between 10 to 50 percent of the respondents state a zero WTP. This might not be a problem for the method if their WTP actually is zero, although this could be discussed. However, it might have implications for the econometric analysis. We should distinguish between conditional and unconditional WTP…

Conditional and unconditional WTP We can define two types of respondents: 1. Zero WTP responses 2. Positive WTP responses So when we talk about mean WTP this could be for the whole population and for the population with positive WTP. Let P(Zero), be the probability that a respondent has a zero WTP, and let P(Positive) = 1-P(Zero) be the probability that a respondent has a positive WTP If the mean WTP for the conditional sample (positive WTP) is: E[WTP|WTP>0] Then the mean WTP for the unconditional sample is: E[WTP] = Pr(Zero)*0+ E(WTP|WTP>0)*Pr(Positive)

Closed-ended responses

Remember With closed-ended data, we only observe if a person says yes or no to a certain bid. This is very limited information. The question is how we can obtain estimates of mean and/or median WTP, and if we can estimate determinants of WTP?

Two different approaches: A. The random utility approach. Start from the utility function. Make assumptions about the functional form of the utility function and the error term of the utility function. (B. Parametric modeling of the WTP. Start from the WTP function. Make assumptions about the functional form of the WTP function and the error term of the WTP function). We will only do A

A. The Random Utility Model Basic idea From the investigator’s point of view there are random elements of the utility function that are not observable. Suppose that an individual is confronted with a CV scenario in which a discrete change in an environmental good from q0 to q1 is proposed. The indirect utility function is Where we have a random element in the utility function. In the CV scenario a certain bid or cost is proposed. The probability that the respondent will respond with a Yes given the bid t k can then be expressed as

Note We assume that 1. the individual understands the proposed change in the environmental good, 2. is capable of evaluating the effect of this change on his or her utility and 3. considers the proposed bid level. Furthermore, his or her response depends only on the maximization of the underlying utility function. All of these assumptions may or may not be correct.

The road ahead We are not even close to have an empirical model yet. We will have to make a number of assumptions now, including assumption about the functional for of the utility function and the distribution of the random term in the utility function.

Assumption 1. Additive error term Assume that the error term is additive The probability can then be written Define We can then write Where F is the CDF (cummulative density function) of the error term. For a symmetric distribution This is something we will use later on.

The PDF (probabilty density function) CDF and PDF gives a complete description of the probability distribution of a random variable. PDF The probability that a random variable X takes the value x, f(x) > 0 Some properties:

The CDF The probability that X will assume any value less than or equal to some specific x. This is equal to the area under the pdf. Some properties:

The probability of a Yes-response So the value of the CDF at the change in utility gives the probability of a yes responses. The probability is increasing in the change in utility. The probability must be between 0 and 1. The cdf is between 0 and 1.

Assumptions 2 and 3 In order to proceed we need to make assumptions about the functional form of the utility function and a specific assumption about the distribution of the error term. In theory we could consider any functional form of the utility function. For simplicity and identification a set of functional forms are more common than others. We will mainly work with the very simple case of a linear utility function. Mainly because it is simple to work with. Regarding error terms the two most common assumptions are - Normal distribution (Probit) - Logistic distribution (Logit).

Assumption 2. A linear utility function For individual k the indirect utility for a certain level of the public good is The utility levels for the two responses: No and Yes are Using our previous result we thus have Now we have to make an assumption about the error terms.

Assumptions 3a. Normal distribution (Probit) IID normally distributed We have to convert this into a standard normal distribution to be able to estimate the model The probability of a yes response is Two parameters are estimated. The parameters are divided by an unknown standard deviation, called the scale parameter. Only the ratios can be estimated. Often the model is presented without the scale parameter. The probability of yes response is then

Assumption 3b. Logistic distribution (Logit) IID logistically distributed with mean zero and variance Convert this into a standard logistic distribution The probability of a Yes response is then Note, we have an unidentified scale parameter and only the ratios can be estimated.

But the main purpouse was to get the WTP... From the assumed utility function we can derive a WTP function. WTP is the maximum amount of money an individual is willing for the new level of the public good. Solving yields individual k’s WTP Note: - This is the WTP for a certain individual. - WTP depends on the distribution There are 3 sources of variation (for now only look at the first one) - Preference uncertainty. Unobserved heterogeneity. -Variation across individuals. Observed heterogeneity. -Uncertainty from the randomness of parameters. The parameters are obtained from maximum likelihood.

1. Preference uncertainty Two common measures of central tendency with respect to preference uncertainty are the mean (expected value) and the median. The mean is in this case For both the normal and logistic distribution since they have mean zero.

Note something important The mean for a linear utility function is But as we have shown we estimate However, this does not matter, since the scale parameter is going to cancel from the expression. This is a very important fact, that we will come back to later on.

Estimating the model So the probit model said that the probability of a yes response depends on an intercept and the bid-level (Actually we write it with a minus sign infront of the bid level). In Limdep: Probit;lhs=yes;rhs=one,bid$ Calc;list;wtp=-b1/b2$ logit;lhs=yes;rhs=one,bid$ calc;list;wtp=-b1/b2$ OR with Wald-command: wald;start=b;var=varb;labels=4_b; fn1=-(b1/b2)$ This is all we need to write to get the probit/logit results and the WTP values.....

The results Remember what we said about the coefficients and the scale parameter. So the “size” of the coefficients here does not really say much since they are divided by the scale parameter. Note also the difference between probit and logit results, but that the mean WTP are almost identical. We are not primarily interested in the coefficient estimates, but WTP. But you should check, in this simple model, that the bid is significant.

Sources of variation There are 3 sources of variation: 1. Preference uncertainty, due to the random utility formulation of the utility function. 2. Variation across individuals. The WTP function can include covariates. 3. Uncertainty from the randomness of parameters. The parameters are obtained from maximum likelihood. Let us now look at 2 and 3.

2. Variation across individuals The utility function is often a function of socio-economic characteristics. These are included in order to capture individual heterogeneity. Doing the same derivations as before we have Note: We have a mean WTP due to the preference uncertainty, but we can also talk about the mean WTP of our sample. In the latter case the most common approach is to calculate mean WTP at sample mean or to calculate mean WTP for each individual and then take the mean of that.

3. Randomness of parameters We want to find the standard deviation of an expression that is a non-linear function of a number of parameters. Delta method A first-order Taylor series of the WTP expression WALD command in Limdep Krinsky-Robb Draw a number of times from the asymptotic normal distribution of the parameter estimates and calculate the welfare measure for each of these draws.

Randomness of parameters in Limdep We can use a command called Wald in Limdep to get the standard deviatins of a non-linear function using either the Delta-method or Krinsky-Robb (In the previous version of Limdep KR was not available). Delta method Probit;lhs=yes;rhs=one,nbid$ wald;labels=b1,b2;start=b;var=varb; fn1=b1/b2$ Krinsky-Robb Probit;lhs=yes;rhs=one,nbid$ wald;labels=b1,b2;start=b;var=varb; fn1=b1/b2; K&R;Pts=1000$

Summarising the RUM analysis - Specify a utility function. - Specify an empirical model by specifying the distribution of the error terms. - Estimate the parameters (confounded by the scale parameter) - Derive the WTP function from the utility function - Obtain mean/median WTP expressions - Using the estimated parameters calculate mean/median WTP.