Data Processing Hollerith 1921

Slides:



Advertisements
Similar presentations
19-20 September 2013, IBGE, Rio de Janeiro, Brazil
Advertisements

Panel 2 – Promoting Re-Use of Scientific Collections John Harrison SHAMAN Project University of Liverpool
Preserving and Sharing Digital Data Greg Colati, Director, Archives and Special Collections May 11, 2012.
March 2002 DEVELOPING SERVICES IN AN EVOLVING TECHNOLOGICAL AND POLITICAL ERA: CASE STUDY Fred Guy Director of ICT National Library of Scotland.
National Digital Information Infrastructure and Preservation Program (NDIIPP) Data-PASS/NDIIPP: A new effort to harvest our history A funder view May 25,
TRAC / TDR ICPSR Trustworthy Digital Repositories.
U.S. Department of the Interior U.S. Geological Survey National Geospatial Technical Operations Center Towards a More Consistent Framework for Disseminated.
Building Historical Social Science Infrastructure: Data Integration Projects of the Minnesota Population Center Steven Ruggles Minnesota Population Center.
Archiving our Social Science Digital History ECURE 2005 March 1, 2005.
Kansas Enterprise Electronic Preservation (KEEP) System.
Richard MARCIANO Chien-Yi HOU School of Information and Library Science (SILS) Sustainable Archives & Leveraging Technologies Group (SALT) University of.
RATIONALE The storage in a smart phone would cost (in 2011 dollars) $7,571 in 2001 $212,040 in 1991 $3,796,800 in 1981 $56,168,800 in 1971 $1,233,179,000.
New Generation SDI and Cyber-Infrastructure Prof. Guoqing Li CEODE/CAS March 29, 2009, Newport Beach, USA Presented to 4th China-US Roundtable Meeting.
Digital Enterprise Research Institute Digital Enterprise Research Institute – Connecting with Industry Mr. Michael Turley, CEO
Grid Interest Group Conclusion Prof. Nataliia Kussul, Space Research Institute NASU-NSAU, Ukraine WGISS-28, September 29, 2009 Pretoria, South Africa.
Advances in Cyberinfrastructure with a Focus on Data: a U.S. National Science Foundation Overview Alliance for Permanent Access to Records of Science in.
HATHITRUST A Shared Digital Repository HathiTrust: Putting Research in Context HTRC UnCamp September 10, 2012 John Wilkin, Executive Director, HathiTrust.
Kansas Enterprise Electronic Preservation (KEEP) System.
Fall 2002 DLF Forum RLG Cultural Materials DLF Forum Ricky Erway Digital Resources Manager, RLG.
World Data Center for Human Interactions in the Environment Conducting a Self-Assessment of a Long-Term Archive for Interdisciplinary Scientific Data as.
13 September 2012 The Libraries’ Role in Research Data Management: A Case Study from the University of Minnesota Meghan Lafferty, Chemistry, Chemical Engineering,
Corral: A Texas-scale repository for digital research data Chris Jordan Data Management and Collections Group Texas Advanced Computing Center.
TerraPop Vision An organizational and technical framework to preserve, integrate, disseminate, and analyze global-scale spatiotemporal data describing.
James H. Butler, Acting Director NOAA Strategic Planning Moving NOAA into the 21 st Century Third GOES-R User Conference May 2004, Boulder, Colorado.
The Minnesota Data Harmonization Projects Bill & Melinda Gates Foundation Seattle, Washington May 21, 2014 Elizabeth Boyle, Miriam King, Matthew Sobek.
The American Geographical Society Library UW-Milwaukee Patti Day Reference Librarian for Digital Spatial Data American Geographical Society Library
Data Projects at the Minnesota Population Center Resources for Comparative Population and Health Research Seattle, Washington May 22, 2014 Elizabeth Boyle,
Lima, 9 de octubre de However, only 2.3% of Scientific Research is produced in Latin America -9% of World´s population is concentrated in Latin.
Interoperability Grids, Clouds and Collaboratories Ruth Pordes Executive Director Open Science Grid, Fermilab.
1 Attachment A USGS EROS Data Center U.S. Geological Survey Department of the Interior EROS Data Center.
Digital Preservation Ontario Consortium of University Libraries (OCUL) Caitlin Tillman OCUL IR Chair With notes from Kathy Scardellato, OCUL Executive.
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
TerraPop Mission Enabling research, learning, and policy analysis by providing integrated spatiotemporal data describing people and their environment.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
Digital Data Collections ARL, CNI, CLIR, and DLF Forum October 28, 2005 Washington DC Chris Greer Program Director National Science Foundation.
California Water Plan Update Advisory Committee Meeting January 20, 2005.
Aligning Digital Preservation Policies with Community Standards Nancy McGovern Digital Preservation Officer.
1 US National Spatial Data Infrastructure: Common Standards and System Interoperability GITA-JAPAN 14 th Conference 5 November 2003 Alan R. Stevens, PhD.
Preliminary Findings Baseline Assessment of Scientists’ Data Sharing Practices Carol Tenopir, University of Tennessee
SEDAC Long-Term Archive Development Robert R. Downs Socioeconomic Data and Applications Center Center for International Earth Science Information Network.
Infrastructure Breakout What capacities should we build now to manage data and migrate it over the future generations of technologies, standards, formats,
SUBCOMMITTEE ON SEDIMENTATION Proposal to become a subgroup under the ACWI September 9, 2003.
Integrated Public Use Microdata Series IPUMS Internationalwww.ipums.org Matt Sobek Minnesota Population Center
Data access and development: The IPUMS perspective United Nations Commission on Population and Development The data revolution in action: National and.
A41I-0105 Supporting Decadal and Regional Climate Prediction through NCAR’s EaSM Data Portal Doug Schuster and Steve Worley National Center for Atmospheric.
Using Analysis and Tools to Inform Adaptation and Resilience Decisions -- the U.S. national experiences Jia Li Climate Change Division U.S. Environmental.
ICPSR Data Fair November 8, 2010 Katherine McNeill, MIT Libraries
USGS EROS LCMAP System Status Briefing for CEOS
Robert R. Downs1and Robert S. Chen2
Outreach NSF 12-month Review September 24-25, 2012
Priorities and coordination of capacity building in Azerbaijan
IU Digital Library Program
DataNet Collaboration
Collaboration and Outreach
TerraPop Goals Lower barriers to conducting interdisciplinary human-environment interactions research by making data with different formats from different.
Global Statistical Geospatial Framework – interoperability challenges
W. Christopher Lenhardt
CyberGIS: Reston, VA, September 22, 2018
Terra Populus Data Domains
Michael P. Finn, Barbara S. Poore, and Mark R. Feller
Implementing an Institutional Repository: Part III
TerraPop Goals Lower barriers to conducting interdisciplinary human-environment interactions research by making data with different formats from different.
Reinforcing Statistical Cooperation at the Regional Level to
Brian Matthews STFC EOSCpilot Brian Matthews STFC
Virtual organizations: Team Science, Team Shakespeare
Space Climate Observatory
Institutional Repositories
Hans Dufourmont Eurostat Unit E4 – Structural Funds
Nancy Y. McGovern Digital Preservation Officer, ICPSR IASSIST 2007
Hans Dufourmont Eurostat Unit E4 – Structural Funds
Presentation transcript:

Data Processing 1890-1964 Hollerith 1921 Hollerith Electric Tabulator 1890 IBM (Hollerith) System 360 1964

The price of storage has fallen by a factor of 12 million! Byte Magazine, July 1980 $1170 per MB, 2010 dollars Amazon.Com, January 2010 0.0095 cents per MB, 2010 dollars

The price of storage has fallen by a factor of 35 million! Byte Magazine, July 1980 $1,207 per MB, 2011 dollars Amazon.Com, September 2011 $0.000035 per MB, 2011 dollars

The storage in my new phone would cost (in 2011 dollars) $7,571 in 2001 $212,040 in 1991 $3,796,800 in 1981 $56,168,800 in 1971 $1,233,179,000 in 1961

Sustainable Digital Data Preservation and Access Network Partners (DataNet) 2007 National Science Foundation Solicitation $100 Million set aside for first phase DataNet Goal: provide reliable digital preservation, access, integration, and analysis capabilities for science and/or engineering data over a decades-long timeline.

TerraPop Goals A framework for global-scale spatiotemporal data Census microdata Government land-use statistics Land cover data from satellite imagery Historical climate records (temperature, precipitation, cloud cover)

TerraPop Goals Provide global-scale data on population, land use, land cover, and climate from the 19th century to the present Interoperable across time and space Accessible to researchers and the public

TerraPop Goals Create a sustainable organization that can guarantee preservation and access over multiple decades Organizational sustainability Financial sustainability Technological sustainability Organization will be a collaboration with major data archives and universities and will eventually assume responsibility for IPUMS

Terra Populus Terra Populus geology transportation paleontology criminology hazards Population Climate Terra Populus pollution Land Use Land Cover health economics politics biology hydrography/ water resources

Sustainability Organizational sustainability Financial sustainability Technological sustainability

Organizational Sustainability Hybrid approach: University based like a library International membership base ultimately responsible for governance

Financial Sustainability University endowment ($1 million) University will migrate technology “indefinitely, as long as it is needed” Three archival partners contribute to preservation Institutional member subscriptions Grants and contracts

Technological Sustainability Software and hardware continuously renewed Virtualization/private cloud/evolvable cyberinfrastructure Preservation standards (ICPSR, CIESIN, CESSDA/UKDA)

TerraPop Time Dimension

Khartoum, CBS-Sudan

Dhaka, Bangladesh Bureau of Statistics 18

Three Formats of Integrated Data Census microdata with attached characteristics describing land use, land cover, and climate for local areas Aggregate data and for administrative districts with tabulated population data and environmental characteristics Gridded data with characteristics of population and environment

Timeline September 2011: Funding begins February 2013: Functional prototype July 2015: Application for years 6-10

Reduced Scope 60% reduction in budget Not a research project: Infrastructure project

Reduced Scope

Additions Data inventory: Inventory and evaluate land-use, climate, and small-area census data for inclusion in the infrastructure. Data acquisition: Collect and preserve selected datasets relating to population and the environment. Boundary files: acquire or develop files, account for boundary changes

Organization Scientific Leadership Group Executive Committee Policy & Planning Committees Sustainability Preservation and Protection Data Acquisition Information Technology Committees User Interface Data Processing Scientific Web Communities Advisory Board

Scientific Leadership Group Steven Ruggles, Principal Investigator/Program Director. Victoria Interrante, Co-Principal Investigator Steven Manson, Co-Principal Investigator Jaideep Srivastava, Co-Principal Investigator Shashi Shekhar, Co-Principal Investigator Jonathon Foley, Director of the Institute on the Environment, University of Minnesota Wendy Pradt Lougee, University of Minnesota Librarian Robert Chen, Director, Center for International Earth Science Information Network, Columbia University George Alter, Director, Inter-University Consortium for Political and Social Research

Executive Committee Cathy Fitch, Executive Director Matt Sobek, Director of Microdata Integration Pete Clark, Director of Software Development Dave Van Riper, Director of Spatial Data TerraPop Project Manager

Sustainability Steve Ruggles Wendy Lougee George Alter Bob Chen Cathy Fitch

Preservation and Protection Steve Ruggles George Alter Nancy McGovern Bob Downs John Butler Matt Sobek Cathy Fitch

Data Acquisition Dave Van Riper Steve Manson Jon Foley Bob Chen Susana Adamo Bob Downs

User Interface Pete Clark Vicki Interrante Jaideep Srivastava Steve Manson Steve Ruggles

Data infrastructure Pete Clark Matt Sobek Steve Ruggles Jaideep Srivastava Shashi Shekhar

Scientific Web Communities Cathy Fitch Pete Clark John Butler Jaideep Srivastava Loren Terveen