Next Generation Archives: The NC Geospatial Data Archiving Project Jeff Essic Geospatial Data Services Librarian North Carolina State University Libraries.

Slides:



Advertisements
Similar presentations
GeoMAPP Business Planning: Developing Materials to Get Stakeholder Buy-in Alec Bethune, North Carolinas Center for Geographic Information and Analysis.
Advertisements

GeoSpatial MultiState Archive and Preservation Partnership State and Local Agency Geospatial Resources Content Transfer, Demonstration, and Learning Project.
NDIIPP Project Update NC Geospatial Data Archiving Project (NCGDAP) North Carolina State University Libraries North Carolina Center for Geographic Information.
The Disappearing Data Problem: Preserving Today's Geospatial Data to Meet Tomorrow's Temporal Analysis Needs Steve Morris Head of Digital Library Initiatives.
Collecting Digital Content Going Forward: Lessons Learned and New Initiatives NC Geospatial Data Archiving Project (NCGDAP) North Carolina State University.
Preservation of Geospatial Data Kenny Ratliff, DGI and Glen McAninch, KDLA April 22, 2008 Digital Technology Summit Lexington, Kentucky.
Identification, Selection, and Appraisal within the North Carolina Geospatial Data Archiving Project (NCGDAP) NCSU Libraries Steve Morris Head of Digital.
NATIONAL STATES GEOGRAPHIC INFORMATION COUNCIL 2105 Laurel Bush Rd. Suite 200 Bel Air, MD GIS Inventory powered by Ramona.
Archiving State and Local Agency Digital Geospatial Data: An Overview of the Problem Area Steven P. Morris Head of Digital Library Initiatives North Carolina.
Co-funded by the European Union under FP7-ICT Alliance Permanent Access to the Records of Science in Europe Network Co-ordinated by aparsen.eu #APARSEN.
2006 ESRI International Users ConferenceAugust 8, 2006 Spatial Data Infrastructure and Data Preservation in North Carolina Jefferson F. Essic, Robert Farrell,
North Carolina Geospatial Data Archiving Project (NCGDAP) Project Overview Partnership –University library (NCSU) and state agency (NCCGIA) –$520,000 funding,
Data-PASS Shared Catalog Micah Altman & Jonathan Crabtree 1 Micah Altman Harvard University Archival Director, Henry A. Murray Research Archive Associate.
NCSU Libraries Ingest Workflow Issues: Metadata North Carolina Geospatial Data Archiving Project Steve Morris North Carolina State University Libraries.
Content and Practice: Background to the NC Geospatial Data Archiving Project Steve Morris NCSU Libraries.
Interoperability ERRA System.
Twenty Years of Spatial Vision, But What Does 1987 Look Like in Your GIS? – Emerging Issues, Hindsight and Insights from the NC Preservation Partnership.
Collection and Preservation of At-Risk Digital Geospatial Data: NDIIPP Project Update on the NC Geospatial Data Archiving Project (NCGDAP) Steven P. Morris.
State Presentation Multi-State Geospatial Partnership Kick-off Meeting Salt Lake City, Utah January 23, 2008.
Chasing Mayflies Archiving Geospatial Data Linda Zellmer Government Information & Data Services Librarian Western Illinois University
Copyright © 2008, Open Geospatial Consortium, Inc., All Rights Reserved. NDIIPP Partnership Update: North Carolina and Multi-state Demonstration Projects.
State and Local Agency Digital Geospatial Data Preservation The North Carolina Experience Steve Morris NCSU Libraries Earth Sciences Information Partners.
North Carolina Geospatial Data Archiving Project (NCGDAP) JISC/NDIIPP Joint Digital Preservation Workshop – May 2006 Presented by: Rob Farrell, Steve Morris,
Putting time into the GeoWeb: Data persistence in a web services environment Steve Morris NCSU Libraries July 23, 2008.
ESRI User Conference, August 8, 2006 Long-term archiving of geospatial data: the NGDA project Julie Sweetkind-Singer John Banning Stanford University.
Preservation of Digital Geospatial Data: Challenges and Opportunities Steve Morris Head of Digital Library Initaitives North Carolina State University.
The North Carolina Geospatial Data Archiving Project Steven P. Morris North Carolina State University Libraries Maintaining Long-Term Access to Geospatial.
Why Archiving and Preserving GIS Data Is Important Maps tell a compelling story of change over time. They document movement, progress, and change to the.
Are Geodatabases a Suitable Long-Term Archival Format? Jeff Essic, Matt Sumner North Carolina State University Libraries 2009 ESRI International Users.
Collection Building Processes within the North Carolina Geospatial Data Archiving Project (NCGDAP) NCSU Libraries Steve Morris Head of Digital Library.
OGC ® © 2006 Open Geospatial Consortium, Inc.1 Introduction to Archives and Geospatial Issues ( Continued ) Steve Morris Head, Digital Library Initiatives.
Metadata Handling in the North Carolina Geospatial Data Project (NCGDAP) NCSU Libraries Steve Morris Head of Digital Library Initiatives Rob Farrell Geospatial.
GeoMAPP Project Overview and Conclusions Alec Bethune- NC Center for Geographic Information and Analysis Matt Peters- Utah Automated Geographic Reference.
Cooperative Project with Library of Congress on Preservation of Digital Geospatial Data Steve Morris Head of Digital Library Initiatives NCSU Libraries.
Preserving State and Local Government Digital Geospatial Data Steve Morris Head of Digital Library Initiatives North Carolina State University Libraries.
Collection and Preservation of At- Risk Digital Geospatial Data: North Carolina Geospatial Data Archiving Project (NDIIPP Partnership) Steve Morris Head.
Long-Term Preservation of At- Risk Digital Geospatial Data: A Cooperative Agreement with Library of Congress Steve Morris NCSU Libraries Zsolt Nagy NC.
GeoMAPP: Using Metadata to Help Preserve Geospatial Content Matt Peters, Utah’s Automated Geographic Reference Center Glen McAninch, Kentucky Department.
Preserved Digital Content: Value to Public Policy Decision Making Now and in the Future NC Geospatial Data Archiving Project (NCGDAP) North Carolina State.
Preservation of Coastal Community Geospatial Content: What's Your Long Term Care Plan For Aging Data? Jeff Essic North Carolina State University Libraries.
North Carolina Geospatial Data Archiving Project : Cooperative Project with Library of Congress on Preservation of Digital Geospatial Data Partners: NCSU.
Collection and Preservation of At- Risk Digital Geospatial Data: the North Carolina NDIIPP Project Partners: NCSU Libraries Project Lead: Steve Morris.
NCPMA Fall MeetingOctober 11, 2006 GIS Data Preservation: Partnership with Library of Congress Steve Morris North Carolina State University Libraries.
NCSU Libraries 9 October 2006 EPA Meeting Preservation Partnership with Library of Congress: NDIIPP and the North Carolina Geospatial Data Archiving Project.
Long-term preservation of digital geospatial data: challenges for ensuring access and encouraging reuse Anne Robertson, EDINA & Steve Morris, NCSU Libraries.
Archiving Geospatial Data: Background to the Problem Area State Government Users Committee October 16, 2008 Steve Morris, NCSU Libraries.
ESRI International Users ConferenceJune 20, 2007 Data Snapshot Archiving: A Frequency of Capture Survey Steve Morris Jeff Essic North Carolina State University.
Preserving Geospatial Data: Challenges and Opportunities Steve Morris NCSU Libraries Indo-US Workshop on Trends in Digital Preservation March 24, 2009.
Preserving Digital Geospatial Data: The NC Geospatial Data Archiving Project (NCGDAP) Steven P. Morris North Carolina State University Libraries CRADLE.
Geospatial Data Preservation Challenges at the Sub-National Level: The North Carolina Experience Steve Morris Head of Digital Library Initiatives North.
…..Kansas Department of Revenue – Property Valuation Division – Kansas GIS Policy Board - DASC ….. Statewide Tax Units Database A collaborative partnership.
NCSU Libraries 13 June 2006 JCDL 2006 NDIIPP Preservation Network: Progress, Problems, and Promise Jim Tuttle, Geospatial Data Librarian.
NDIIPP Project: North Carolina Geospatial Data Archiving Project Partners: NCSU Libraries Project Lead: Steve Morris NC Center for Geographic Information.
North Carolina Geospatial Data Archiving Project/NDIIPP: Collection and preservation of at- risk digital geospatial data Partners: NCSU Libraries Project.
GISC Seminar: Towards Uncharted GroundSeptember 29, 2006 North Carolina Partnership with Library of Congress on Long-term Preservation of Digital Geospatial.
NDIIPP Project: Collection and Preservation of At-Risk Digital Geospatial Data Partners: NCSU Libraries Project Lead: Steve Morris NC Center for Geographic.
City of Woodcreek, TX Project Proposal GEO 4427 Advanced GIS II.
The Disappearing Data Problem Steve Morris Head of Digital Library Initiatives North Carolina State University Libraries.
Models for Shared Responsibility: Collaboration and Engagement with the NCGDAP and GeoMAPP Partnerships Steve Morris North Carolina State Libraries Zsolt.
Mountain Region GIS Advisory Council Meeting September 15, 2006 Long-Term Preservation of Digital Geospatial Data: A Cooperative Project with Library of.
Library of Congress Partnerships for Managing Geospatial Data North Carolina Geographic Information Coordinating Council Raleigh, NC November 7, 2007 William.
Preservation Strategies in the North Carolina Geospatial Data Archiving Project (NCGDAP) NCSU Libraries Steve Morris Head of Digital Library Initiatives.
North Carolina Geospatial Data Archiving Project/NDIIPP: Collection and preservation of at-risk digital geospatial data Partners: NCSU Libraries NC Center.
Overview: GeoMAPP Appraisal Efforts NDSA Geospatial Working Group| 27 June 2012 |
Preservation of State and Local Government Digital Geospatial Data: The North Carolina Geospatial Data Archiving Project Steven P. Morris, James Tuttle,
Preserving Digital Geospatial Data: The NC Geospatial Data Archiving Project (NCGDAP) Steven P. Morris North Carolina State University Libraries CRADLE.
Long-Term Preservation of At-Risk Digital Geospatial Data: The North Carolina Geospatial Data Archiving Project Steve Morris NCSU Libraries.
Update on Geospatial Data Preservation Efforts
Collecting Digital Content Going Forward: Lessons Learned and New Initiatives NC Geospatial Data Archiving Project (NCGDAP) North Carolina State University.
Preserved Digital Content: Collections, Value, and Stewardship NC Geospatial Data Archiving Project (NCGDAP) North Carolina State University Libraries.
Presentation transcript:

Next Generation Archives: The NC Geospatial Data Archiving Project Jeff Essic Geospatial Data Services Librarian North Carolina State University Libraries NACIS 2008October 10, 2008

2 NC Geospatial Data Archiving Project (NCGDAP) Three year partnership between university library (NCSU) and state agency (NCCGIA), with Library of Congress under the National Digital Information Infrastructure and Preservation Program (NDIIPP) One of 8 initial NDIIPP collection building partnerships Focus on state and local geospatial content in North Carolina (state demonstration) Tied to NC OneMap initiative, which provides for seamless access to data, metadata, and inventories

3 NCGDAP Specifics Funding: $520,000 for $500,000 for 18 month extension Staff: 1.5 FTE at NCSU Approx. same at NCCGIA Website:

4 Selected Geospatial Data Archive Projects ProjectOrganizationsFunding Persistent Archives TestbedSan Diego Supercomputer Center, NARA NARA VanMapSan Diego Supercomputer Center Inter- PARES Geospatial Repository for Academic Deposit & Extraction EDINAJISC Geospatial Electronic RecordsCIESINNHPRC variousCarleton Universityvarious National Geospatial Digital Archive UC Santa BarbaraNDIIPP Maine GeoArchivesState of MaineNHPRC

5 Tracking data, map servers, and web services since 2000 Ranked 3 rd in traffic among entry points to entire library website Persistent identifiers usage tracking ID links used in other sites Community help in site maintenance Project Roots: NCSU Libraries Data Directory

6 100 Counties in North Carolina County Map and Data Services in NC

Carrboro, NC : Population 17,797 (2005 est.) 24 downloadable GIS data layers 4 WMS data layers 6 web mapping applications 9 downloadable PDF map layers

8 Value in Older Data: Cultural Heritage Future uses of data are difficult to anticipate (as with Sanborn Maps)

9 Downtown Raleigh Near State Capitol 1914 Sanborn Map

10 Downtown Raleigh Near State Capitol 1993 DOQQ

11 Downtown Raleigh Near State Capitol 1999 Wake County Ortho

12 Downtown Raleigh Near State Capitol 2005 Wake County Ortho

13 Downtown Raleigh Near State Capitol 2005 Wake County Ortho Imagery = Durable Static Simple structure Mostly open formats Vector data = Volatile Frequent update Complex structure Mostly proprietary formats Downtown Raleigh Near State Capitol 2005 Wake County Ortho Imagery = Durable Static Simple structure Mostly open formats Vector data = Volatile Frequent update Complex structure Mostly proprietary formats

14 Geospatial Data Types – Cartographic GIS Software –Software project file (.mxd,.apr, …) –Data layer file (.avl,.lyr, …) PDF, GeoPDF map exports Web Services-based representations

15 Geospatial Data Types – Spatial Databases Vector, raster, and tabular data Relationships Behaviors Annotation Data Models

16 Other Geospatial Data Types – Place-based Data Street Views Oblique Imagery 3D Images Present-day value in location- based services and mobile applications Future value for cultural heritage, descriptions of places Tax Dept. Photos

17 Other Geospatial Data Types: Web 2.0 Mashups

18 Geospatial Data: Compelling Issues Dynamic content Constantly updated information Data versioning Digital object complexity Spatially enabled databases Complicated, multi-component formats Proprietary formats

19 Digital Preservation Points of Failure Data is not saved, or … can’t be found, or … media is obsolete, or … media is corrupt, or … format is obsolete, or … file is corrupt, or … meaning is lost

20 Risks to Geospatial Data Producer focus on current data Data overwrite as common practice Future support of data formats in question No open, supported format for vector data Shift to web services-based access Data becoming more ephemeral Inadequate or nonexistent metadata Impedes discovery and use Increasing use of spatial databases for data management The whole is greater than the sum of the parts

21 Preservation Business Case Land use change analysis Site location analysis Real estate trends analysis Disaster response Resolution of legal challenges Impervious surface change mapping

22 Business Case: Identifying Land Use Changes Use case: Land use and impervious surface change analysis

23

24 Geospatial Data Preservation Challenges Data Capture Backups are common, but not long-term archives Producer focus is on current data Shift to web services-based access Inadequate or Nonexistent Metadata Consistent NC survey stats: Only 40% of data producers create and maintain metadata

25 Challenge: Vector Data Formats No widely-supported, open vector formats for geospatial data Spatial Data Transfer Standard (SDTS) not widely supported Geography Markup Language (GML) – diversity of application schemas and profiles a challenge for “permanent access” Spatial Databases The whole is more than the sum of the parts, and the whole is very difficult to preserve Can export individual data layers for curation, but relationships and context are lost

26 Challenge: Digital Object Complexity Files Multi-file dataset Georeferencing Metadata file Symbols file Additional documentation License Disclaimer More Metadata FGDC Acquisition metadata Transfer metadata Ingest metadata Archive rights Archive processes Collection metadata Series metadata Metadata Exchange Format (MEF) in GeoNetwork a form of content packaging

27 Challenge: Cartographic Representation Counterpart to the map is not just the dataset but also models, symbolization, classification, annotation, etc.

28 Other Challenges Rights management Data versioning Semantic issues Content Packaging Large scale content transfer Integrating older analog materials More …

29 Different Ways to Approach Preservation Technical solutions: How do we preserve acquired content over the long term? Cultural/Organizational solutions: How do we make the data more preservable—and more prone to be preserved—from point of production? Current use and data sharing requirements – not archiving needs – are most likely to drive improved preservability of content and improvement of metadata

30 Question: Frequency of Capture? Content Exchange – Getting Data in Motion Repository Development Repository of Temporal Data Snapshots

31 Frequency of Capture Issue: How frequently should county and municipal vector data layers be captured in archives? Parcels, centerlines, jurisdictions, zoning, … Parcel Boundary Changes , North Raleigh, NC

32 Frequency of Capture Surveys How often should continually changing vector datasets be captured? Tap into data custodian understanding of production patterns and uses Tap into local innovation Learn about local business drivers for data archiving 2006 and 2008 surveys of NC cities and counties 2008 survey of archival practice in state agencies in NC Planned survey of data users in NC

33 FOC 2006 Survey Results: Overview 58% response, two-thirds of whom create and retain periodic snapshots Long-term retention more common in counties with larger populations Storage environments vary, with servers and CD- ROMs most common Wide variation in frequencies of capture. Offsite storage (or both onsite and offsite) is used by nearly half of the respondents Popularity of historic images has resulted in scanning and geo-referencing of hardcopy aerial photos among one-third of the respondents

34 Content Exchange Infrastructure High volume of state/federal requests for local data Solving the present-day problems of data sharing is a pre-requisite to solving the problem of long-term access Leveraging more compelling business reasons to put the data in motion (disaster preparedness, business continuity, highway construction, census, …) Content exchange networks: Minimize need to make contact Add technical, administrative, descriptive metadata Establish rights and provenance

35 Content Exchange Infrastructure Nov. 2007: NC Geographic Information Coordinating Council (GICC): Ten Recommendations in Support of Geospatial Data Sharing released Recommendation: “Establish archive and long term data access strategies” Suggested best practices include: “Establish a policy and procedure for the provision of access to historic data, especially for framework data layers.” upportofGeospatialData/tabid/156/Default.aspx

36 Getting the Data in Motion Harvesting use cases for older data as part of outreach Survey of current archiving practice among NC counties and municipalities

37 Most costly part of archive development is identifying, negotiating acquisition, and then transferring data Important Objectives Minimize Direct Contact Document Data Clarify Rights Routinize Transfers Leverage other business uses that put data in motion: Continuity of operations Highway Planning Floodplain Mapping Getting the Data in Motion

38 Getting the Data in Motion Orthophoto Data Distribution System – “sneakernet” Transfer of large quantities of imagery Street Centerline Data Distribution System Efficient transfer of data from 100 counties, with metadata and clarified rights NC GIS Inventory Efficient data identification Adding preservation elements NC OneMap Data Download and Viewer Public access Data visualization

39 Repository Development Downloading or acquiring “low hanging fruit” Tapping into current data flows Developing our own metadata when necessary Converting and preserving vector data in shapefile format

40 Data Preservation Complex data representations can be made more preservable (yet less useful) through simplification. Conversion of various formats to shp Image outputs (web services, PDF maps, map image files) Very hard to preserve: Software project files Symbol sets What about symbology meanings? Layer definitions Web service or API interactions

41 Desiccated Data: PDF and GeoPDF Cartographic outputs – analogous to paper maps Combine Datasets Data models Classification Symbolization Annotation More data intelligence than in simple images

42 Desiccated Data: PDF and GeoPDF Explosion of geospatial PDF content in past few years Standards issues GeoPDF: TerraGo technology has withdrawn patent claim and is approaching OGC about open standards process PDF: open ISO standard with subset of geospatial functionality in ISO PDF standard part 2 Open PDF variants created through ISO standards process (PDF/E, PDF/X, PDF/A, …) PDF content retained in addition to, NOT instead of data

43 Cartographic Preservation Side Project: 1:500,000 – 1:2.5 M 1:31,680 – 1:430,000 1,200 – 24,000 Scanned, georeferenced, and compressed over 286 NC geologic maps, in cooperation with NC Geologic Survey

44 Repository Status Acquired 6+ TB of data with more on the way Disk space being used initially for “data staging” Inventorying In the process of ingesting content into DSpace Metadata generation

45 Engaging Spatial Data Infrastructure Cultural/Organizational solutions: How do we make the data more preservable—and more prone to be archived—from point of production? Engage and outreach to the data producer community and SDI Sell the problem to software vendors and standards development Find overlap with more compelling business problems: disaster preparedness, business continuity, road building, etc. Discuss roles at the local, state, and federal level

46 Data inventories support content identification Metadata standards support discoverability and use Content standards support data interoperability over time and help eliminate semantic confusion Data exchange networks: Minimize need to make contact Add technical, administrative, descriptive metadata Establish rights and provenance SDI Role in Data Preservation

47 NC Spatial Data Infrastructure: NC OneMap Next generation mechanism to coordinate and disseminate geographic information in North Carolina and interact with the NSDI. NC GICC Inventory for all geospatial data holdings – Develop content standards for key data themes One of the defined characteristics of NC OneMap is that “Historic and temporal data will be maintained and available”.

48 Archival and Long Term Access Working Group Initiated by NC Geographic Information Coordinating Council in 2008 to address growing concerns of state and local agencies about long-term access to data Federal, state, regional, and local agency representation Key focus Best practices for data snapshots and retention State Archives processes: appraisal, selection, retention schedules, etc. Valuable outcome of NCGDAP – multiple parties and levels discussing data archiving on their own.

49 Archival and Long Term Access Working Group Final Report to be presented to GICC in Nov. Best Practices for: Archiving Schedule Inventory Storage Medium Formats Naming Metadata Distribution Periodic Review Data Integrity Publicity

50 NDIIPP Multi-State Geospatial Project Lead organizations: North Carolina Center for Geographic Information & Analysis (NCCGIA) and State Archives of NC Partners: Leading state geospatial organizations of Kentucky and Utah State Archives of Kentucky and Utah NCSU Libraries in catalytic/advisory role State-to-state and geo-to-Archives collaboration 2 year project: Nov Dec Archives as part of Spatial Data Infrastructure

51 OGC Data Preservation Working Group Formed Dec Engage archival community Find points of intersection with other OGC activities: GML for archiving Content packaging Large scale data transfers Time in decision support

52 Cultural: Changing Industry Thinking Is the geospatial industry “temporally-impaired?” Lack of access to older data Lack for tool/model support for temporal analysis Metadata: poor support for changing data Education: building class projects around available data (i.e., not temporal) Increased interest now in temporal applications? Increased demand for temporal data? Improved tool support: ArcGIS 9.2 animation tools; Geodatabase History, etc. Emerging commercial market in older data

53 “Supporting temporal analysis requirements” gets more attention than “archiving and preservation” Leverage existing infrastructure Current data sharing needs drive infrastructure improvements that help archiving Leverage business needs that are more compelling than preservation (e.g., continuity of operations) Facilitate stakeholder ownership of the solutions Mine state and local archiving innovations Conclusions

54 Slide Presentation: Steve MorrisJeff Essic Head, Digital Library InitiativesGeospatial Data Services LibrarianNCSU Libraries ph: (919) ph: (919)