1 ESSnet on Small Area Estimation Meeting no. 4 ESSnet on Small Area Estimation Meeting no. 4 Neuchatel, 7-8 July 2011 WP4: Software Tools.

Slides:



Advertisements
Similar presentations
GRA 5917 Public Opinion and Input Politics. Lecture September 16h 2010 Lars C. Monkerud, Department of Public Governance, BI Norwegian School of Management.
Advertisements

Apr-15H.S.1 Stata: Linear Regression Stata 3, linear regression Hein Stigum Presentation, data and programs at: courses.
Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 March 12, 2012.
Kin 304 Regression Linear Regression Least Sum of Squares
Multilevel Modeling in Health Research April 11, 2008.
Sampling: Final and Initial Sample Size Determination
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Chapter 1 The Where, Why, and How of Data Collection
Multiple regression analysis
Section 7.3 Table summarizes hypothesis tests to compare two means. These tests assume that each sample is taken from a normal distribution, but.
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #18.
ESSnet on Small Area Estimation Stefano Falorsi ISTAT International conference on Indicators and Survey Methodology, 2010, Wien
Statistical Methods Chichang Jou Tamkang University.
Small area Estimation of Italian poverty and social exclusion indicators Stefano Falorsi Michele D’Alò Loredana Di Consiglio Fabrizio Solari Matteo Mazziotta.
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
Clustered or Multilevel Data
FACTOR THE FOLLOWING: Opener. 2-5 Scatter Plots and Lines of Regression 1. Bivariate Data – data with two variables 2. Scatter Plot – graph of bivariate.
Finding Data for Quantitative Analysis Lecture 11.
Regression Model Building Setting: Possibly a large set of predictor variables (including interactions). Goal: Fit a parsimonious model that explains variation.
Examples from Singer’s Using SAS Proc Mixed to Fit Multilevel Models… “ To use the paper effectively, … in particular the reader must understand: The difference.
Chapter 12 Section 1 Inference for Linear Regression.
Forecasting Revenue: An Example of Regression Model Building Setting: Possibly a large set of predictor variables used to predict future quarterly revenues.
Analysis of Clustered and Longitudinal Data
Arun Srivastava. Small Areas What is a small area? Sub - population Domain The Domain need not necessarily be geographical. Examples Geographical Subpopulations.
Lecture 12 Model Building BMTRY 701 Biostatistical Methods II.
Montecarlo Simulation LAB NOV ECON Montecarlo Simulations Monte Carlo simulation is a method of analysis based on artificially recreating.
Lecture 3: Inference in Simple Linear Regression BMTRY 701 Biostatistical Methods II.
Chapter 12: Linear Regression 1. Introduction Regression analysis and Analysis of variance are the two most widely used statistical procedures. Regression.
Mixed Linear Models An Introductory Tutorial. What we have covered!!! The Linear model: Mean Structure or Fixed Effects Errors.
Multilevel Data in Outcomes Research Types of multilevel data common in outcomes research Random versus fixed effects Statistical Model Choices “Shrinkage.
Regression. Population Covariance and Correlation.
1 Enhancing Small Area Estimation Methods Applications to Istat’s Survey Data Ranalli M.G. ~ Università di Perugia D’Alo’ M., Di Consiglio L., Falorsi.
1 ESSnet on Small Area Estimation Stefano Falorsi Istat
ESSnet on Small Area Estimation
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Multilevel Modeling Software Wayne Osgood Crime, Law & Justice Program Department of Sociology.
NAME Evaluation Report Name of author(s) Name of institution Year.
Introduction to SPSS. Object of the class About the windows in SPSS The basics of managing data files The basic analysis in SPSS.
Multiple Regression Petter Mostad Review: Simple linear regression We define a model where are independent (normally distributed) with equal.
Copyright © 2010 Pearson Addison-Wesley. All rights reserved. Chapter 12 Multiple Linear Regression and Certain Nonlinear Regression Models.
Evaluating generalised calibration / Fay-Herriot model in CAPEX Tracy Jones, Angharad Walters, Ria Sanderson and Salah Merad (Office for National Statistics)
Analysis of Covariance Combines linear regression and ANOVA Can be used to compare g treatments, after controlling for quantitative factor believed to.
1 Regression Analysis The contents in this chapter are from Chapters of the textbook. The cntry15.sav data will be used. The data collected 15 countries’
Eurostat Weighting and Estimation. Presented by Loredana Di Consiglio Istituto Nazionale di Statistica, ISTAT.
One-Way Analysis of Covariance (ANCOVA)
Ch14: Linear Least Squares 14.1: INTRO: Fitting a pth-order polynomial will require finding (p+1) coefficients from the data. Thus, a straight line (p=1)
Analytical Example Using NHIS Data Files John R. Pleis.
Essnet STAND-PREP Rome, 6-7 June 2011 Rome, 6-7 June 2011 ESSnet “Preparation of standardisation” WP1 report: outline of the report, content, responsibility.
Analysis of Experiments
Lesson 4.5 – Conducting a Survey to Collect Two-Variable Data.
Joint Eurostat Unece Worksession on Statistical Data Confidentiality 2011, Tarragona Initial analyses on comparable dissemination from the Essnet project.
 Seeks to determine group membership from predictor variables ◦ Given group membership, how many people can we correctly classify?
Scatter Plots & Lines of Best Fit To graph and interpret pts on a scatter plot To draw & write equations of best fit lines.
WARM UP: Penny Sampling 1.) Take a look at the graphs that you made yesterday. What are some intuitive takeaways just from looking at the graphs?
Conflict-of-interest disclosures: none NIH Working Group: Multi-level Interventions in Vulnerable Populations September 15, 2015 Henry Feldman, PhD Clinical.
Population (millions)
Lab 9 – Regression Diagnostics
STAT 6304 Final Project Fall, 2016.
Regression Analysis Week 4.
Estimation Point Estimates Industrial Engineering
Basic Econometrics Chapter 4: THE NORMALITY ASSUMPTION:
Review of Hypothesis Testing
Tue 8-10, Period III, Jan-Feb 2018
Regression diagnostics
Crime survey Neuchatel, 7-8 July 2011
Lecture 12 Model Building
Regression Forecasting and Model Building
Istat - Structural Business Statistics
Principal Component Analysis
Rachael Bedford Mplus: Longitudinal Analysis Workshop 23/06/2015
Presentation transcript:

1 ESSnet on Small Area Estimation Meeting no. 4 ESSnet on Small Area Estimation Meeting no. 4 Neuchatel, 7-8 July 2011 WP4: Software Tools

2 2 Deliverables  4.1 review of available SAE routines and software, focusing on open source software (final report: October 2011)  4.2: review on the software in use in the ESS (final report: October 2011)  4.3: recommendations for standardisation and certification of ESS tools for SAE (final report: October 2011)  4.4: collection of existing routines and/or development of new routines (final version by October 2011)

3 3 Task 4.1: Review of available SAE routines and software, focusing on open source software  R (CBS); SAS (GUS); MLwiN, SPSS, Stata, BUGS/WinBUGS (Istat): Contents: General description, Small area estimation, References CBS: OK GUS: Small area estimation, References Istat: References Deadline: 15 August 2011

4 4 Task 4.2: Review on the software in use in the ESS Each member: brief section summarizing the questionnaires (only CBS sent it) Deadline: 15 August 2011

5 5 Task 4.3: Recommendations for standardisation and certification of ESS tools for SAE  Artificial population: –From R. Valiant, A. H. Dorfman, R. M. Royall (2001) “Finite Population Sampling and Inference”, section 5.7.1, page 153 –Input dataset: Hospital: sample of short-stay hospitals with fewer than 1000 beds (Appendix B.2, page 424) –Variables: number of beds (x), number of discharged patients (y) –Artificial population : –Number of records: 1,000,000 –Number of areas: 1,000 –Number of dependent variables: 6

6 6 Task 4.3: Recommendations for standardisation and certification of ESS tools for SAE Scatter plots of y1, y2, y3,y4,y5, y6 vs x. The line is the regression line of y given x

7 7 Task 4.3: Recommendations for standardisation and certification of ESS tools for SAE  Artificial population:  Between area variance equals to 1/10 and 1/1000 of the within area.  Other covariates other than x. It could be an option to use transformation of x, both continuous and categorigal. Another option might be to add a higher regional level by grouping areas into classes. Some dependence of the target variables on these covariates may then be introduced. The regional variable could then be used as the level of estimation (so that we can try SAE with fewer areas) or as a categorical covariate (so that we can try a model with larger number of fixed effects ).  Another option might be to add a higher regional level by grouping areas into classes. Some dependence of the target variables on these covariates may then be introduced. The regional variable could then be used as the level of estimation (so that we can try SAE with fewer areas) or as a categorical covariate (so that we can try a model with larger number of fixed effects).  how many samples to draw from the populations? They might be simple random samples, e.g. one of size and one of size ? Or 200,000? How many areas?

8 8 Task 4.4: Collection of existing routines and/or development of new routines  Computation of the estimates and MSE (Istat):  Linear mixed model:  Unit level: Synthetic, EBLUP (uncorrelated random effects), EBLUP (correlated random effects)  Area level: Synthetic, EBLUP (uncorrelated random effects)  Logistic mixed model  Model choice (CBS):  Conditional AIC (unit and area level linear mixed model)  Cross validation (unit and area level linear mixed model)  Model diagnostic (Istat): ???  Bias diagnostic  Goodness of fit diagnostic  Coverage diagnostic  Calibration diagnostic Deadline: beta version 15 August, final version 30 September 2011

9 9 Task 4.4: Collection of existing routines and/or development of new routines  R-style documentation Deadline: 31 August 2011  Manual with small example Deadline: 30 September 2011  R-package?