NERC DataGrid: Googling for Secure Data Introduce myself and Wendy and ask them to contact her. Bryan Lawrence on behalf of the NDG, BADC and BODC. Ray Cramer, Marta Gutierrez, Kerstin Kleese, Siva Kondapalli, Sue Latham, Roy Lowry, Kevin O’Neill, Ag Stephens, Andrew Woolf British Atmospheric Data Centre http://badc.nerc.ac.uk
British Atmospheric Data Centre Outline NDG Aims and Metadata Taxonomy Demonstration of NDG in action NDG Authorisation – the security bit! Status British Atmospheric Data Centre http://badc.nerc.ac.uk
Timelines & Bottom Line 2002: E-science arrives at NERC: Legacy Systems with millions of files and terabytes of data and existing access and authorisation systems that cannot easily be replaced. Complex existing DISCOVERY metadata systems. Discovery (where it exists) based on Z39.50 Utilisation based on file retrieval. 2004: NERC DataGrid ready to move forward New metadata systems describe data as well as datasets. OAI based harvesting supports scalable FAST data discovery. Requirements capture for new authorisation systems complete, and coding underway for implementation. New communities involved, and international discovery very close to operational reality. 2005: Utilisation based on metadata, on demand server side behaviours, grid-based back end parallelisation etc British Atmospheric Data Centre http://badc.nerc.ac.uk
British Atmospheric Data Centre Complexity + Volume + Remote Access = Grid Challenge British Atmospheric Data Centre Simulations Assimilation British Oceanographic Data Centre http://ndg.nerc.ac.uk British Atmospheric Data Centre http://badc.nerc.ac.uk
British Atmospheric Data Centre NDG Metadata Taxonomy British Atmospheric Data Centre http://badc.nerc.ac.uk
NDG Metadata Architecture Service based model: clear separation between discovery and use discovery service standards compliant and interoperable British Atmospheric Data Centre http://badc.nerc.ac.uk
British Atmospheric Data Centre (D) - Discovery Open Archives Initiative – Digital Library Protocol for harvesting metadata. NDG Supports Multiple Discovery Services – “build your own” OAI Multiple Protocol Support will be built into the “NDG Vanilla Discovery Service” OAI British Atmospheric Data Centre http://badc.nerc.ac.uk
British Atmospheric Data Centre http://badc.nerc.ac.uk
British Atmospheric Data Centre Discovery British Atmospheric Data Centre http://badc.nerc.ac.uk
British Atmospheric Data Centre Can order responses by title or data centre (or default random) Choose to go to A service or B service. Flexible Information Return Look at DIFs in either HTML or XML British Atmospheric Data Centre http://badc.nerc.ac.uk
British Atmospheric Data Centre Current Interface British Atmospheric Data Centre http://badc.nerc.ac.uk
British Atmospheric Data Centre http://badc.nerc.ac.uk
British Atmospheric Data Centre http://badc.nerc.ac.uk
British Atmospheric Data Centre http://badc.nerc.ac.uk
British Atmospheric Data Centre http://badc.nerc.ac.uk
British Atmospheric Data Centre http://badc.nerc.ac.uk
British Atmospheric Data Centre http://badc.nerc.ac.uk
British Atmospheric Data Centre http://badc.nerc.ac.uk
British Atmospheric Data Centre http://badc.nerc.ac.uk
British Atmospheric Data Centre http://badc.nerc.ac.uk
British Atmospheric Data Centre http://badc.nerc.ac.uk
British Atmospheric Data Centre http://badc.nerc.ac.uk
British Atmospheric Data Centre http://badc.nerc.ac.uk
British Atmospheric Data Centre http://badc.nerc.ac.uk
British Atmospheric Data Centre Background activity being parallelised with GODIVA/CCLRC e-science collaboration (spectral -> gridpoint + CDMS + visualisation tools) Download either plot or the data that went into the plot. British Atmospheric Data Centre http://badc.nerc.ac.uk
British Atmospheric Data Centre http://badc.nerc.ac.uk
International Dimension British Atmospheric Data Centre http://badc.nerc.ac.uk
Southampton Oceanography Centre British Atmospheric Data Centre http://badc.nerc.ac.uk
British Atmospheric Data Centre Authorisation Role-based access: <dataset> <host> badc.nerc.ac.uk </host> <name>ukmo-obs </name> <access-requires> researcher <access-requires> <access-requires> ukmo-obs </access-requires> <processing-requires> nerc </processing-requires> </dataset> Key concept: Only hosts that trust each other share data, even within a larger virtual organisation: e.g. at BADC: <trusted> <bodc> <host>ndg.bodc.nerc.ac.uk</host> <attribute remotename=”nerc”> nerc </attribute> <attribute remotename=”ashoe”> ashoe </attribute> <attribute remotename=”staff”> nerc </attribute> <other> bodc </other> </bodc> </trusted> Signed “conditions of use” form exists for this dataset British Atmospheric Data Centre http://badc.nerc.ac.uk
British Atmospheric Data Centre NDG Security Certificate based, pass encrypted credentials between user and gatekeeper. British Atmospheric Data Centre http://badc.nerc.ac.uk
British Atmospheric Data Centre Where are we? Migration to web services underway for some components, new A services in design phase, implementation details not yet obvious (e.g. GT4 etc). Major effort on defining feature types for observation types so we can build an OGC/ISO compatible data extractor for observations and numerical data. Security Infrastructure Development Collaboration with CCLRC e-science, ECOGrid Ongoing work on metadata definition and population: Oceanographic data Atmospheric Chemistry data Major issues with (un)controlled vocabularies Numerical Modelling data DIF numerical definition (moving to ISO), BADC and UK Community Katherine Bouton’s work at NCAS/CGAM (“B” MODEL METADATA) Remote Sensing Data Collaboration with NEODC and PML Ongoing work on databases and interfaces, DIF to ISO and “B” British Atmospheric Data Centre http://badc.nerc.ac.uk