Gridded Population of the World (GPW, v3) - Lessons Learned? W. Christopher Lenhardt CIESIN – Columbia University 27 May 2005 © 2005. The Trustees of Columbia.

Slides:



Advertisements
Similar presentations
Population Modeling Methods and Projects: Implications for Data Use Greg Yetman CIESIN, Columbia University GEO Meningitis.
Advertisements

European Research Policy: from coordination and cooperation to integration and the ERA Dr. Maria Nedeva MIoIR, MBS. The University of Manchester EULAKS.
O AK R IDGE N ATIONAL L ABORATORY U. S. D EPARTMENT OF E NERGY Budhendra Bhaduri Overview of Geospatial Computing at ORNL Geographic Information Science.
World Data Center for Human Interactions in the Environment 1 The Earth Institute at the Lamont Campus Director, Jeffrey Sachs Director, Mike Purdy Director.
GEOSS Architecture Implementation Pilot (AIP-5) Kickoff Workshop Sneha RaoRobert S. ChenSri Vinay SOCIO-ECONOMIC DATA AND APPLICATIONS CENTER (SEDAC),
Overview of SEDAC Data Products and Services and Terra Viva Data Viewer Dr. Malanding S. Jaiteh GIS Specialist Socioeconomic and Data Applications Center.
Gridded Population of the World Version 2: 1995 UN adjusted population density Gridded Population Workshop May 2-3, 2000.
GIS Overview. What is GIS? GIS is an information system that allows for capture, storage, retrieval, analysis and display of spatial data.
The MetaDater Model and the formation of a GRID for the support of social research John Kallas Greek Social Data Bank National Center for Social Research.
Global and continental population databases “Supply side view” What has been done Related developments Possible next steps.
Systems Oceanography: Observing System Design. Why not hard-wire the system? Efficiency of interface management –Hard-wire when component number small,
PAGE # 1 Presented by Stacey Hancock Advised by Scott Urquhart Colorado State University Developing Learning Materials for Surface Water Monitoring.
Joint Research Centre GLC2000 First Results Workshop Ispra 18 th to 22 nd March 2002 Project overview.
Data Quality Data quality Related terms:
Shuming Bao China Data Center University of Michigan Spatial Intelligence for Demographic and Economic Information of China.
ASSESSING THE LOCAL EARTHQUAKE RISK Justin Czarka, Lehman College, CUNY – May 2013 Agung Swastika/AFP/Getty Images.
Second High Level Forum on GGIM Seminar on Regional Cooperation in Geospatial Information Management Doha, Qatar, 7 February 2013 Overview on Geospatial.
Integration of Statistical and Spatial Information for Data Dissemination in Cape Verde United Nations Regional Workshop on Data Dissemination and Communication.
September 18-19, 2006 – Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development Using Geographic Information Systems (GIS) as.
2 TerraViva! SEDAC Software produced by ISciences LLC based in Ann Arbor MI Provides visualization of SEDAC and remote sensing/ biophysical datasets in.
Adaptive Kernel Density in Demographic Analysis Richard Lycan Institute on Aging Portland State University.
Igor Kuzma, Statistical Office of the Republic of Slovenia Tomaž Žagar, Geodetic Institute of Slovenia GIS Portal – dissemination of geostatistics
Geography and Environment 24/7 Population modelling for natural hazard assessment Alan Smith University of Southampton, UK Colloquium on Spatial Analysis,
Copyright 2010, The World Bank Group. All Rights Reserved. COVERAGE, FRAMES & GIS, Part 2 Quality assurance for census 1.
2011 CENSUS Coverage Assessment – What’s new? OWEN ABBOTT.
TerraPop Vision An organizational and technical framework to preserve, integrate, disseminate, and analyze global-scale spatiotemporal data describing.
1 POPULATION PROJECTIONS Session 8 - Projections for sub- national and sectoral populations Ben Jarabi Population Studies & Research Institute University.
5 Marzo 2007 Census mapping and Gis Part II: dissemination Fabio Crescenzi Istat, Central Directorate on General Censuses UNECE Training Workshop on Census.
Dataset Development within the Surface Processes Group David I. Berry and Elizabeth C. Kent.
1 Archiving Michael J. Levin Harvard Center for Population and Development Studies
New and easier ways of working with aggregate data and geographies from UK censuses Justin Hayes UK Data Service Census Support.
US BENEFITS. It Addresses Priorities The US and Canada have common scientific, economic and strategic interests in arctic observing: marine and air transportation.
IPCC TGICA and IPCC DDC for AR5 Data GO-ESSP Meeting, Seattle, Michael Lautenschlager World Data Center Climate Model and Data / Max-Planck-Institute.
Gridding global male and female populations: New data from the Gridded Population of the World (GPW) Erin Doxsey-Whitfield Susana B. Adamo Kytt MacManus.
Integrating a gender perspective into environment statistics Workshop on Integrating a Gender Perspective into National Statistics, Kampala, Uganda 4 -
1 Land accounts in Europe – current state and outlook Land accounts 01/10/2015 Daniel Desaulty
Welcome to the PRECIS training workshop
A Compendium of Selected Educational Resources for You and Your Students, Part 2 CIESIN.
NSIDC—Enhancing NASA’s Contribution to Polar Science A response to the NRC Polar Research Board’s review of NASA’s polar geophysical data sets Mark Parsons.
Unit 6: Population Distribution & Growth World Geo 3200/3202 May 2011.
Methods of Statistical Analysis and Dissemination of Census Results in Guyana MORGAN CLITUS DIAS SENIOR CARTOGRAPHER BUREAU OF STATISTICS GEORGEOWN,GUYANA.
SEDAC Long-Term Archive Development Robert R. Downs Socioeconomic Data and Applications Center Center for International Earth Science Information Network.
Working with your archive organization: Broadening your user community Robert R. Downs, PhD Socioeconomic Data and Applications Center (SEDAC) Center for.
Census Office Fernando Casimiro Geneva, July 2010 Portugal – Census results tailored to user needs «
Mr. Walsh- Geography. What is geography? 1. The spatial perspective: how human activities are organized in space and how they relate to the natural environment.
Working with Your Archive : Broadening Your User Community Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Global Data Integration CRED Workshop October 26, 2009 Greg Yetman World Data Center for Human Interactions in the Environment.
Data access and development: The IPUMS perspective United Nations Commission on Population and Development The data revolution in action: National and.
Lessons Learned from the production of Gridded Population of the World Version 4 (GPW4) Columbia University, CIESIN, USA EFGS October 2014.
The evolution of the England and Wales census in a European context Garnett Compton, ONS RSS Conference, 9 September 2015.
Geocoding and Georeferencing
SESSION 9 PLANNING AND IMPLEMENTING A CENSUS GEOSPATIAL PROGRAMME
SciDataCon 2016 – 13 – September
World Conference on Climate Change October 24-26, 2016 Valencia, Spain
Urbanization and Development: Is LAC Different from the Rest of the World? Mark Roberts (GSURR, World Bank), Brian Blankespoor (DEC-RG, World Bank),
Markus Erhard European Environment Agency (EEA) 1. Introduction:
TerraPop Goals Lower barriers to conducting interdisciplinary human-environment interactions research by making data with different formats from different.
SciDataCon September, 2016 Greg Yetman Kytt MacManus
W. Christopher Lenhardt
Working with your archive organization Broadening your user community
CyberGIS: Reston, VA, September 22, 2018
Latest work on regional statistics and analysis at OECD
Repository Platforms for Research Data Interest Group: Requirements, Gaps, Capabilities, and Progress Robert R. Downs1, 1 NASA.
TerraPop Goals Lower barriers to conducting interdisciplinary human-environment interactions research by making data with different formats from different.
Technical guidance for grid based provision of data for MSFD reporting
  1-A) How would Arctic science benefit from an improved GIS?
My name is VL, I work at the EEA, on EA, and particularly on developing a platform of exchange which aims at facilitating the planning and development.
Geo-enabling the SDG indicators – experiences from the UN Global Geospatial Management and the GEOSTAT 3 project Agenda item 12 Ekkehard PETRI – Eurostat,
Presentation transcript:

Gridded Population of the World (GPW, v3) - Lessons Learned? W. Christopher Lenhardt CIESIN – Columbia University 27 May 2005 © The Trustees of Columbia University in the City of New York

Overview Background –A few words about CIESIN and SEDAC –What is GPW –How is it created –History of GPW A Few Applications What Have We Learned –Challenges Responses –Implications

And now few words from our sponsor… What is the Center for International Earth Science Information Network? –Part of the Earth Institute at Columbia University –Interdisciplinary mission to support research on human interactions in the environment CIESIN’s Socieconomic Data and Applications Center (SEDAC) one of NASA’s Distributed Active Archive Centers (DAACs), part of NASA’s Earth Observing Data and Information System.

We have a pretty good idea how many people there are… “USA Today has come out with a new survey: apparently, three out of every four people make up 75% of the population.” -- David Letterman

However: We’d like to know where are the people The “Where’s Waldo?” problem –(And Waldo 在哪里是 ? and Wo ist Walter? and Où est Waldo ? and Где Waldo? and so on…

GPW: Inputs Inputs are relative simple –Population of administrative areas, usually in census years –Spatial boundaries of administrative areas Population and boundary data must match –Best available & ‘match-able’ data are used Matching the inputs to one another is not as easy as it might seem –Boundaries change often and come in different scales –Population data may not match boundaries We may have population values for different years at different levels (e.g., district-level one year, state-level another) –Population and boundary data may not match themselves

Source Data Characteristics Commercial, government and other institutional sources –over 150 sources –roughly 100 data suppliers Information is often missing –Projection of spatial data –Relationship between new and old administrative units –Basic metadata Implications: –Educated guesswork is sometimes the best we can do! –Limits on redistribution of input data We haven’t made redistribution of the inputs a priority In some instances, the data we have are propriety so that we cannot re-disseminate them Even where we could, our notes on assumptions used to clean the input shape files are often less clean than we would like for a public release project

What do the input data look like? Population data: –Paper tables –Numbers in digital reports (e.g., pdf) –Digital tables (e.g., xls) –In digital file attached to spatial data (e.g., shp file) Spatial data –Paper maps –Digital images –Digital maps

Data acquisition From known sources –Established data providers (without personal connection) Census Bureaus –Rely on a network on like-minded associates UN agencies, The World Bank, Regional institutions, In-country collaborators Alternative sources – requests and occasional phone calls to census offices and geographic units of governments in far away places –Tourist and assorted other maps occasionally valuable for island nations –Opportunistic

Data conditioning: a spatial example + =

Spatial data preparation Clean boundaries –E.g., remove slivers Make them consistent across borders and coasts –Use international standard—the Digital Chart of the World (DCW) — with exceptions Europe—most spatially data supplied by one agency (SABE – Seamless Administrative Boundary for Europe) and all international boundaries are internally consistent –Coastlines matched to DCW, except where much higher quality data are supplied E.g., Indonesia Data table needs to include the same variables, with the same variable names, formats, etc.

Data conditioning: A population example  Places highlighted in yellow are new municipios  Need to find where they came from & their pop size  Use on-line atlases or newer maps, when available  Add new pop to unit of origin or allocate old population to new unit proportionally.

After the data are conditioned That is, there is a clean, consistent, spatial data file with a table of data with the expected content in the expected form—that is, iso.shp –— it’s time to grid!

Gridding Algorithm Proportional allocation used to spread the population over grid cells Virtually all data work completed on vector data –Gridding is the last step National grids created, global grids assembled by adding national grids together –Country grids are created with collars so that they start and end on even degrees; therefore the assembly of the grids without interpolation is possible –Replacement of country-specific grids feasible

Area 16.1 km 2 Pop = * ,118.9 persons Area 2.6 km 2 Pop = * 2.6 1,634.1persons Area 0.05 km 2 Pop = * persons Cell by cell…

Version (pub)GPW v1 (1995)GPW v2 (2000)GPW v3 (2003) Estimates for , , 1995, 2000 Input units19,000127,000~ 350,000 globally 102,000 units in Africa GPW History: Ten Years of Progress

Lots of hard work, was it worth it? Mean resolution in km =

GPW limitations Population estimation is not time-varying –Census measure, e.g., usual residence –One point in time, not where do work how do you commute where might you spend significant time Resolution may be too coarse for some applications –E.g., estimates of coastal population within a 50 km buffer No formal delineation of urban areas or other features –except those that may be deduced from population density –No other demographic variables (age, gender, etc)

Applications of GPW How many people live near the coast How many people live next to volcanoes Relationships between population and ecosystems Incorporated into a model to estimate potential risks for space vehicle re-entry

Ecosystems and population density Red/pink = Coastal Green = Mountainous Beige/brown = Drylands –See Millennium Ecosystem Assessment for more details on system classifications

What have we learned and what does it mean for users Challenges Responses Implications for Users

Challenges/Responses Lots and lots of files both inputs and outputs -> How to manage? How to document? Granularity –At what level should be provide metadata: global, continental, national, sub-national? Continued need for access to the best available data –Humanitarian and other types of disasters highlight the need to provide better access to existing data sources and may serve to ‘shake loose’ previously unavailable data at least on a temporary basis

Challenges/Responses (cont.) Capture –Data analysis gap (!) –User Workshop results Provide an integrated information system for users to access –Data –Metadata and documentation –Graphics –Citations Need for ‘continuous process improvement’ in terms of data management and documentation –Work as a team, involve data managers, documentation specialists, archival specialists all along the way

Implications for Users Better get it right given the potential uses for things like hazard risk estimation and other policy questions Data quality review –Review process (alpha -> beta -> production) –Rely on our users to discover errors More and better data –Development of derivative products –Continued need to work on cross-national research to harmonize social science data (methods, variable operationalization, and so on) Need to think more how ‘cyberinfrastructure’ can further the social science and interdisciplinary research agendas (integrating IT and science for a new paradigm for scientific research) –Address issues of ontologies, epistemological differences, scale and resolution

Thank you!