A multilevel approach to geography of innovation Martin Srholec TIK Centre University of Oslo DIME International Workshop.

Slides:



Advertisements
Similar presentations
Questions From Yesterday
Advertisements

Multilevel modelling short course
Seven Deadly Sins of Dyadic Data Analysis David A. Kenny February 14, 2013.
Hierarchical Linear Modeling: An Introduction & Applications in Organizational Research Michael C. Rodriguez.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Lecture 11 (Chapter 9).
By Zach Andersen Jon Durrant Jayson Talakai
Lecture #11: Introduction to the New Empirical Industrial Organization (NEIO) - What is the old empirical IO? The old empirical IO refers to studies that.
Longitudinal and Multilevel Methods for Models with Discrete Outcomes with Parametric and Non-Parametric Corrections for Unobserved Heterogeneity David.
Random effects as latent variables: SEM for repeated measures data Dr Patrick Sturgis University of Surrey.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 13 Nonlinear and Multiple Regression.
Objectives (BPS chapter 24)
Multilevel Models 4 Sociology 8811, Class 26 Copyright © 2007 by Evan Schofer Do not copy or distribute without permission.
Logistic Regression Multivariate Analysis. What is a log and an exponent? Log is the power to which a base of 10 must be raised to produce a given number.
Chapter 13 Multiple Regression
The Simple Linear Regression Model: Specification and Estimation
School of Veterinary Medicine and Science Multilevel modelling Chris Hudson.
Chapter 3 Simple Regression. What is in this Chapter? This chapter starts with a linear regression model with one explanatory variable, and states the.
Chapter 12 Multiple Regression
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 7-1 Chapter 7 Confidence Interval Estimation Statistics for Managers.
Clustered or Multilevel Data
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 11 th Edition.
An Introduction to Logistic Regression
Ch. 14: The Multiple Regression Model building
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
Single and Multiple Spell Discrete Time Hazards Models with Parametric and Non-Parametric Corrections for Unobserved Heterogeneity David K. Guilkey.
9. Binary Dependent Variables 9.1 Homogeneous models –Logit, probit models –Inference –Tax preparers 9.2 Random effects models 9.3 Fixed effects models.
MODELS OF QUALITATIVE CHOICE by Bambang Juanda.  Models in which the dependent variable involves two ore more qualitative choices.  Valuable for the.
Correlation and Linear Regression
BPS - 3rd Ed. Chapter 211 Inference for Regression.
Inference for Linear Regression Conditions for Regression Inference: Suppose we have n observations on an explanatory variable x and a response variable.
Applications The General Linear Model. Transformations.
Chapter 12 Multiple Linear Regression Doing it with more variables! More is better. Chapter 12A.
Introduction Multilevel Analysis
+ Chapter 12: Inference for Regression Inference for Linear Regression.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
University of Warwick, Department of Sociology, 2014/15 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Week 7 Logistic Regression I.
CS 478 – Tools for Machine Learning and Data Mining Linear and Logistic Regression (Adapted from various sources) (e.g., Luiz Pessoa PY 206 class at Brown.
Managerial Economics Demand Estimation & Forecasting.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
1 11 Simple Linear Regression and Correlation 11-1 Empirical Models 11-2 Simple Linear Regression 11-3 Properties of the Least Squares Estimators 11-4.
Chapter 16 Data Analysis: Testing for Associations.
Copyright ©2011 Brooks/Cole, Cengage Learning Inference about Simple Regression Chapter 14 1.
HLM Models. General Analysis Strategy Baseline Model - No Predictors Model 1- Level 1 Predictors Model 2 – Level 2 Predictors of Group Mean Model 3 –
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
28. Multiple regression The Practice of Statistics in the Life Sciences Second Edition.
Chapter 8: Simple Linear Regression Yang Zhenlin.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 10 th Edition.
Introduction to Multiple Regression Lecture 11. The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & 2 or more.
MERIT1 Does collaboration improve innovation outputs? Anthony Arundel & Catalina Bordoy MERIT, University of Maastricht Forthcoming in Caloghirou, Y.,
BPS - 5th Ed. Chapter 231 Inference for Regression.
Introduction Many problems in Engineering, Management, Health Sciences and other Sciences involve exploring the relationships between two or more variables.
Chapter 7 Demand Estimation and Forecasting
Chapter 14 Introduction to Multiple Regression
Inference for Regression (Chapter 14) A.P. Stats Review Topic #3
Multiple Regression Analysis and Model Building
An introduction to basic multilevel modeling
HLM with Educational Large-Scale Assessment Data: Restrictions on Inferences due to Limited Sample Sizes Sabine Meinck International Association.
CHAPTER 12 More About Regression
John Loucks St. Edward’s University . SLIDES . BY.
CHAPTER 29: Multiple Regression*
Simple Linear Regression
CHAPTER 12 More About Regression
Seminar in Economics Econ. 470
CHAPTER 12 More About Regression
Presentation transcript:

A multilevel approach to geography of innovation Martin Srholec TIK Centre University of Oslo DIME International Workshop on "Reconsidering the Regional Knowledge Economy: Theoretical, Empirical and Policy Insights from Diverse Research Approaches", KITE, Newcastle University, 4-5 September 2008, Newcastle, United Kingdom

A multilevel problem Multilevel modeling concerns hypothesis/problem/data with hierarchical structure A hierarchy refers to units clustered (grouped/related/similar, etc.) at different levels Acknowledged in many fields of social science or biology for a long time School performance is not only given by the amount of study time of a child, but also by higher-level factors such as characteristics of the class, school or national educational system Similarly the innovation process in firms is influenced by factors operating at different levels

Innovator and environment(s) Schumpeter focused on constraining factors in the environment: the forces of habit, resistance to innovation by the environment The idea about survival of firms propelled by innovation, but determined by the environment in the evolutionary economics National, regional and sectoral innovation systems: organizations, relations and institutions that enable innovation in firms A need to reach beyond the dichotomy between methodological individualism vs. collectivism in empirical research on innovation A need for multilevel analysis of interactions between various levels of the analysis in the innovation process

Multilevel modeling A multilevel model is a statistical model that relates a dependent variable to explanatory variables at more than one level Sociology, educational research, health science, demography, epidemiology or biology Not used in research on innovation (and rarely in economics)

Suppose a multilevel model has 2-level structure with firms at level-1 located in regions at level-2. A standard linear 2-level model with one explanatory variable at each level is the following: where y ij is the dependent variable, x ij is the level-1 predictor, w j is the level-2 predictor, e ij, u 0j and u 1j are random effects (normally distributed residual terms for each equation), i is the firm (i = 1…n) and j is the region (j = 1…m).

Why multilevel modeling? Single-level models assume that observations (and hence residuals) are independent from each other Single-level models may suffer from the fallacy of the wrong level Analyses that use exclusively micro data to account for the contextual effects often suffer from endogeneity A proper recognition of data hierarchies allows us to examine new lines of questions And at last but not least the concept of innovation systems implicitly predicts a nested structure of micro data

Why not just dummies? Using dummies might be a useful quick-fix solution, if we are interested exclusively in the level-1 relationships, but it is of a little help if we are interested in cross-level relations A dummy is a “catch-all” variable stripped of the context for which we can only speculate what it really represents If there are many higher-level units, “dummy” models will have many more parameters, decreasing the degrees of freedom If the higher-level dummies significantly improve predictive power of the model, which indeed is often the case in the literature; a multilevel model should be given priority

Dataset Micro data from CIS3 ( ) in the Czech Republic A representative sample of 5,829 enterprises with more than 10 employees was surveyed of which about 65% responded After omitting firms with incomplete records a dataset of 3,801 firms Firm-level variables: INNOV, SIZE, AGE and FOREIGN Factor analysis on 11 regional variables (77 regions at NUTS4)

NUTS4 regions (unemployment rate)

Results of the factor analysis (factor loadings after rotation)

A logit multilevel model where  ij is the log of the odds of success (to introduce an innovation, for instance). Although  ij is constrained to be in the interval (0,1), the logit transformation allows  ij to take any value and therefore can be substituted to the structural model. Note that the predicted log-odds can be converted to an odds by exp(  ij ) and to the predicted probability  ij by exp  ij  /(1+exp  ij  ). Since INNOV is binary, we need to specify a non-linear multilevel model. For this purpose, we distinguish between the sampling model (3.1), a link function (3.2) and a structural part of the multilevel model (3.3). Only the level-1 model differs from the linear case and can be written down as follows:

Basic model Although there are no level-2 predictors, we allow the level-1 intercept and slopes to vary across regions by including the random effects: where there are four fixed effects (  00 …  30 ) and four random effects (u 0j … u 3j )

Intercept-as-outcome model incorporates the regional variables for RIS and STR into the model as predictors of the level-1 intercept: where we assume that the level-2 predictors explain the different average propensity to innovate across regions, but let the level-1 effects of SIZE, AGE and FOREIGN remain “unconditional” at the regional level.

A typical domestic-owned firm (all of the level-1 effects of zero) located in the best region, which is the Prague agglomeration with the best RIS and modest STR score, is predicted to have 37.3% probability to innovate, whereas the same firm located in the Karviná region with the worst combination of conditions has only 24.4% probability to innovate.

Slopes-as-outcomes model in which we allow the regional variables to influence also slopes of the level-1 predictors: where the new fixed parameters (  11 …  31 and  12 …  32 ) indicate cross-level interactions between the regional and firm-level predictors.

Conclusions Quality of the regional innovation system directly determines firm’s likelihood to innovate and mediates the effect of some firm-level factors Also structural problems in the region influence innovation in firms Just a beginning to show that this approach is viable (another papers in progress) Main limits given by high demands on the quality (and quantity) of data and computational power of the computer (and the researcher…) Important implications for policy-making