Introduction to elementary quantitative concepts and methods Guest lecture Carl Henrik Knutsen, 14/5-2008.

Slides:



Advertisements
Similar presentations
A Brief Introduction to Spatial Regression
Advertisements

Managerial Economics in a Global Economy
Structural Equation Modeling
Chap 12-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 12 Simple Regression Statistics for Business and Economics 6.
Hypothesis Testing Steps in Hypothesis Testing:
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Inference for Regression
Correlation & Regression Chapter 15. Correlation statistical technique that is used to measure and describe a relationship between two variables (X and.
Bivariate Regression Analysis
Lecture 8 Relationships between Scale variables: Regression Analysis
1-1 Regression Models  Population Deterministic Regression Model Y i =  0 +  1 X i u Y i only depends on the value of X i and no other factor can affect.
Chapter 10 Simple Regression.
Regression Analysis. Unscheduled Maintenance Issue: l 36 flight squadrons l Each experiences unscheduled maintenance actions (UMAs) l UMAs costs $1000.
Chapter 12 Simple Regression
The Simple Regression Model
1 Econ 240A Power Outline Review Projects 3 Review: Big Picture 1 #1 Descriptive Statistics –Numerical central tendency: mean, median, mode dispersion:
Topic 3: Regression.
Linear Regression Example Data
Business Statistics - QBM117 Statistical inference for regression.
Correlation and Regression Analysis
Introduction to Regression Analysis, Chapter 13,
Relationships Among Variables
Lecture 5 Correlation and Regression
Correlation & Regression
Lecture 16 Correlation and Coefficient of Correlation
Regression and Correlation Methods Judy Zhong Ph.D.
This Week: Testing relationships between two metric variables: Correlation Testing relationships between two nominal variables: Chi-Squared.
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition
Introduction to Linear Regression and Correlation Analysis
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
MAT 254 – Probability and Statistics Sections 1,2 & Spring.
Correlation and Regression. The test you choose depends on level of measurement: IndependentDependentTest DichotomousContinuous Independent Samples t-test.
Chapter 15 Correlation and Regression
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Statistics & Biology Shelly’s Super Happy Fun Times February 7, 2012 Will Herrick.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
Statistics for Business and Economics 7 th Edition Chapter 11 Simple Regression Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
EQT 373 Chapter 3 Simple Linear Regression. EQT 373 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value.
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
Statistical Methods Statistical Methods Descriptive Inferential
Research Process Parts of the research study Parts of the research study Aim: purpose of the study Aim: purpose of the study Target population: group whose.
Examining Relationships in Quantitative Research
Ordinary Least Squares Estimation: A Primer Projectseminar Migration and the Labour Market, Meeting May 24, 2012 The linear regression model 1. A brief.
Introduction to Probability and Statistics Thirteenth Edition Chapter 12 Linear Regression and Correlation.
Y X 0 X and Y are not perfectly correlated. However, there is on average a positive relationship between Y and X X1X1 X2X2.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
Part 2: Model and Inference 2-1/49 Regression Models Professor William Greene Stern School of Business IOMS Department Department of Economics.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Lecture 10: Correlation and Regression Model.
Examining Relationships in Quantitative Research
Review Lecture 51 Tue, Dec 13, Chapter 1 Sections 1.1 – 1.4. Sections 1.1 – 1.4. Be familiar with the language and principles of hypothesis testing.
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 22.
Chapter Eight: Using Statistics to Answer Questions.
11 Chapter 5 The Research Process – Hypothesis Development – (Stage 4 in Research Process) © 2009 John Wiley & Sons Ltd.
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Statistics for Managers Using Microsoft® Excel 5th Edition
Research Methodology Lecture No :26 (Hypothesis Testing – Relationship)
Data Analysis. Qualitative vs. Quantitative Data collection methods can be roughly divided into two groups. It is essential to understand the difference.
Week 2 Normal Distributions, Scatter Plots, Regression and Random.
PO 141: INTRODUCTION TO PUBLIC POLICY Summer I (2015) Claire Leavitt Boston University.
Regression Analysis AGEC 784.
Bivariate & Multivariate Regression Analysis
Correlation and Simple Linear Regression
Correlation and Regression
Correlation and Simple Linear Regression
Simple Linear Regression and Correlation
Seminar in Economics Econ. 470
Presentation transcript:

Introduction to elementary quantitative concepts and methods Guest lecture Carl Henrik Knutsen, 14/5-2008

Motivation Social sciences, and science in general: We are generally interested in: – “How” questions – “Why” questions. Social scientists seek descriptions of empirical phenomena and try to come up with causal explanations. Both quantitative and qualitative methodology try to respond to such questions. Nature of problem question is important for choice of methodology, even if in the real world of social science, researchers often choose method after their knowledge and “taste”. Knowledge of different methodologies allow researchers and students to fit methodology to problem question  Improve analysis. Triangulation can often be a good idea: Usage of different methodologies to illuminate a problem in a more comprehensive fashion. The knowledge of elementary quantitative method enables you to read different types of research.

Causality and the control problem Independent of choice of methodology Theory and clever design needed Three causal structures that might lead to correlation: X YXY X Y Z

Generalization The big advantage of quantitative methods Provides stringent criteria for when we can be relatively certain that our generalizations hold true and are not driven by coincidences. Remember that in the social sciences, we do not face deterministic relationships between factors. Quant. methods takes into account the stochastic structure of social life.

Data There exists a vast number of sources for data constructed by different agencies or researchers: You do not need to construct your own data for many purposes. But: Know the data you use in order to avoid different pit-falls. Sources on the web: World Development Indicators, Penn World Tables, World Governance Indicators, Polity, Freedom House, OECD, UNESCO, UNCTAD etc!

Descriptive statistics Descriptive vs inferential statistics Descriptive statistics: Draw out comprehensible information about the structure of your data 1) Central tendencies, 2) variation, 3) correlation

Central tendency of variable Mean Median Mode

Variation Range Variance (S^2 = (Σ(X-M)^2)/(N-1)) Standard deviation

Correlation Covariance cov(xy) = (Σ((X-Xm)(Y-Ym)/(N-1) Correlation coefficients Pearson’s r = cov(xy)/(S(x)*S(y)): Always between -1 and 1. NB: Gives only degree of linear relationship.

Presentation of data Tables Histogram Bar- and pie-charts Scatter plots Important to think about the reader: Combrehensible and informative. Need to strike a balance on the amount of information presented in a chart. Label charts.

Table MaleFemale No higher educationUniversityNo higher educationUniversity Mean income (N)150 (2000)300 (1000)100 (2500)250 (700)

Scatter plot

Inferential statistics The aim is solid inference from an observed sample to a larger (unobserved) universe. Generalization about populations or about effects. For effects: Can we say that trajectories we observe are due to “real” effects or are they likely only a product of chance?

Law of large numbers... – Population, samples, – Estimates and underlying mean. Random selection? Selection bias ALWAYS a possibility. Sampling techniques: – Experiment – Random draws – Stratification

Hypothesis test Democracy and economic growth as example. – H 0 : Democracy has no effect on growth – H alt : Democracy has an effect on growth In general H 0 is often a hypothesis which claims that there is no effect. We often want to investigate whether we can with relative certainty claim that H alt is valid. Burden of proof is on the alternative hypothesis. Conservative bias: we have to have relatively strong results to claim a relationship is not due to pure chance. Central limit theorem as underlying. How do we know the distribution given H 0 ? Use given distribution to find out what one is likely to arrive at by pure chance. The normal distribution.

Central limit theorem “The central limit theorem is one of the most remarkable results of the theory of probability. In its simplest form, the theorem states that the sum of a large number of independent observations from the same distribution has, under certain general conditions, an approximate normal distribution. Moreover, the approximation steadily improves as the number of observations increases. The theorem is considered the heart of probability theory, although a better name would be normal convergence theorem.” html (Berrie Zielman) html

Significance levels and p-values Significance level. If we take H 0 as true, then we want to have a critical level beyond which it is unlikely that we will see results. For example 5%. Only in 5% probability that we will see this strong relationship if H 0 is true. Important to have large sample. P-value: The lowest significance level that will give rejection of H 0. If H 0 is true: What is probability that we will see this extreme result.

Models Stockburger: “A model is a representation containing the essential structure of some object or event in the real world.” – 1. Models are necessarily incomplete – (2. The model may be changed or manipulated with relative ease.)

Regression analysis How to fit a straight line through a scatterplot! Best fit: one criteria is to minimize sum of squared residuals  Ordinary Least Squares (OLS) Bivariate regression equation: Y = a + bX + ε Regression analysis recognizes that the world is not deterministic. The role of the error term: ε. Large error terms in general implies large uncertainty Interpretation of a: Mean value of Y when X is equal to zero. Often no substantial interpretation. Not so interesting Interpretation of b: Increase in mean of Y when X increases with one unit. Effect of X on Y?

Assumptions of distribution error term when using OLS: Homoskedastic No autocorrelation Normally distributed

Multivariate regression Y = a + b1X1 + b2X2 +b3X3 + ε New interpretation of b: The mean increase in Y when relevant X increases with one unit, given that all other variables are held constant. R-square: How much of the variation in the data is “explained by the model” (A very imprecise interpretation). Goes from 0 to 1. “Control variables” Extensions of regression analysis: Generalized Least Squares, Systems of equations, Instrumental Variables, Logit and Probit models and many more.

Extensions Dummy variable Squared X Logarithmic specifications Splitting the sample

Problems 1) “Simultaneity bias”: Reverse causation. Exogeneity vs endogeneity of X-variables. 2) “Omitted variable bias” 3) Measurement error. – Reliability. Where does the data come from? GDP in developing countries. – Validity (TFP and technological change) Operationalization of variable: Have to be observable, quantifiable and measurable.