DA Task Report Report by Rick Lawford and Toshio Koike ADC Meeting September 2008 Boulder
DA Task Report (I) (September 2008) Short Background and objective for DA-07-06: It is expected that there will be a large increase in the volume of Earth Observation data in the next few years. In addition to distributed data archives and integration systems, data management facilities will be used for diverse and large-volume Earth Observation data from inhomogeneous information sources in cooperation with existing data centres. This Task will establish alliances among existing data centres for effective management of large volumes and diverse types of Earth observation data. The system development shall be coordinated with providers of satellite and in-situ data and model outputs including their metadata. System functionalities from input through processing, archiving, and dissemination, including reprocessing, analysis and visualization shall be coordinated with wide-range user communities. Case studies shall be designed as demonstration projects for at least 3 SBAs
DA Task Report (I) (September 2008) Primary Activities: The Task involves developing an inventory of all large data Centres, Systems, and Projects which could contribute to GEOSS alliances. This is being done in three phases: Phase 1. A test survey of active centres to determine what information is useful to characterize the centres and their potential contributions to alliances. Phase 2: A survey of active centres in one SBA (water) to characterize the centres and their potential contributions to alliances. Phase 3: A survey of active centres in all SBAs to characterize the centres and their potential contributions to alliances. Results of these surveys and inventories of centres will be maintained at the University of Tokyo. The Task will also develop a prototype and evaluate how the concept can be best implemented.
DA Task Report (II) (September 2008) Milestones achieved to date: - The questionnaire for surveying centres was prepared and distributed to 11 centres. - Responses have been received from 7 centres (with several more promised) - A preliminary analysis of the response has been undertaken. -A schedule of regular teleconference calls has been established with 8 calls held since January A draft white paper on data integration has been prepared. Expected milestones for 2008/ 2009: - The Phase 2 survey of data centres will be completed. - A prototype of a data integration system combined with user needs will be introduced. - A workshop is being planned for establishing a small scale alliance between Japan (DIAS) and Europe (FP-7). Extent of Participation/Recommendations for additional participants: - The number of large data centres participating in this project must be expanded to ensure it is a success. National, regional and global centres are being targeted for Phase 2.
Core Functions of Data Integration and Analysis System (DIAS) Climate Water cycle Eco- system Creation of success model Insurance and sanitary data schemer Water cycle data schemer Land utilizing data schemer Ontology Data Interoperability Metadata Schemer, large capacity and diversification Lifecycle Data Management Storage Capacity over 1 PB Advanced User Interface Data mining Data Integration
Numerical Climate Prediction Model Flood/Inundations → Evacuation Instruction Satellite River Management data Reference Site Data GIS/Basin Info. Support of data interoperability Ontology of dictionary Ontology of Geographic information Water Resource Management Flood Prediction Heavy Rainfall Prediction Satellite Data Assimilation Distributed Hydrological Model Operation Optimization
18-20 June 087General Assembly - DRL, Oberpfaffenhofen Repository On Demand product On Demand product GENESI-DR collaboration platform GENESI-DR project goals To provide a base for (establishing) a world-wide e-infrastructure for Earth Science repositories To provide reliable, easy, effective, and operational access to a variety of data sources (space and ground) To harmonise operations at key Earth Science data repositories To demonstrate effective curation and prepare the frame for long term preservation To validate capabilities to access distributed repositories for involving new communities, including education… To integrate new scientific and technological derived paradigms
Centres, Systems and Projects that completed the Phase I Survey: 1. WDC for Glaciology and Geocryology (Lanzhou, China) 2. Data Integration and Analysis System (DIAS) (University of Tokyo, Japan) 3. GENESI-DR: Ground European Network for Earth Science Interoperations – Digital Repositories (ESA, Italy) 4. The Global Observing Systems Information Center (GOSIC) (NOAA, USA) 5. Global Runoff Data Centre (GRDC) (Germany) 6. World Data Center Climate (WDCC) (Germany) 7. World Data Center for Glaciology, (CIRES,USA)
Very Preliminary Observations: 1.Most data centres already have a network of collaborators. 2.Some centres focus on acquiring data as a national archive while others focus on distributing data. 3. All centres appeared to be aware of GEO and the SBAs and with one or two exceptions they were engaged in GEO tasks. 4. Most centres provide both an automated and human interface with users. In cases where data restrictions exist they are less likely to provide automated interfaces. 5. There are some common features among centres focused on research versus government centres focused on operational services. 6. Language may be an issue for accessing some sites.
DA Task Report (III) (September 2008) Coordination points with other GEO work plan tasks: This task is being coordinated with SBA tasks that require a high level of data integration such as Task WA Potential risks that may impede completion of task: - None identified to date. However, GEO support will be critical to ensure that data centres provide the inputs to the survey and are favourably disposed to participating in the alliance(s). Gaps that have been identified that need to be addressed independently: - Resource requirements are being provided by JAXA, RESTEC and the University of Tokyo. Support from GEO will be needed for the workshops. Potential Contributions to the GEO Portal: - The Data centre registry being developed for the University of Tokyo will be linked to the User Community through the GEO Portal.
DA Task Report (IV) (September 2008) Expected products: - A registry of data centres. - A pilot data centre alliance that covers satellite and in-situ data centres, NWP Centres, ICSU Data services and research data centres. - A framework for future alliances. - Reports from several workshops. Expected date of completion of task: September (Due to the late start on this Task the deadline has been extended). Expectations of task lead from ADC: None identified