Japanese & UK N+N Data, Data everywhere and … Prof. Malcolm Atkinson Director www.nesc.ac.uk 3 rd October 2003.

Slides:



Advertisements
Similar presentations
Delivery of Industrial Strength Middleware Federated Strengths Agility & Coordination Prof. Malcolm Atkinson Director 21 st January 2004.
Advertisements

Abstraction Layers Why do we need them? –Protection against change Where in the hourglass do we put them? –Computer Scientist perspective Expose low-level.
Chinese Delegation visit Malcolm Atkinson Director 18 th November 2004.
UK Role in Open Grid Services Architecture Towards an Architectural Road Map A Report to the Technical Advisory Group from The Architecture Task Force.
Research Councils ICT Conference Welcome Malcolm Atkinson Director 17 th May 2004.
National e-Science Centre Glasgow e-Science Hub Opening: Remarks NeSCs Role Prof. Malcolm Atkinson Director 17 th September 2003.
Open Grid Service Architecture - Data Access & Integration (OGSA-DAI) Dr Martin Westhead Principal Consultant, EPCC Telephone: Fax:+44.
UK e-Science Report on OGSA, OGSI & OGSA-DAI Malcolm Atkinson Director of National e-Science Centre 28 th October 2002 Meeting of the UK.
SWITCH Visit to NeSC Malcolm Atkinson Director 5 th October 2004.
OMII-UK Steven Newhouse, Director. © 2 OMII-UK aims to provide software and support to enable a sustained future for the UK e-Science community and its.
E-Science Update Steve Gough, ITS 19 Feb e-Science large scale science increasingly carried out through distributed global collaborations enabled.
Jose Jimenez Director. International Programmes Telefónica Digital.
Grid-Enabling Data: Sticking Plaster, Sellotape, & Chewing Gum? Colin C. Venters National Centre for e-Social Science University.
High Performance Computing Course Notes Grid Computing.
GENI: Global Environment for Networking Innovations Larry Landweber Senior Advisor NSF:CISE Joint Techs Madison, WI July 17, 2006.
GEODE Workshop 16 th January 2007 Issues in e-Science Richard Sinnott University of Glasgow Ken Turner University of Stirling.
EGEE is a project funded by the European Union under contract IST Grid Summer School Vico Equense, 27 July An Introduction.
EGEE is a project funded by the European Union under contract IST Grid Summer School Vico Equense, 27 July An Introduction.
NextGRID & OGSA Data Architectures: Example Scenarios Stephen Davey, NeSC, UK ISSGC06 Summer School, Ischia, Italy 12 th July 2006.
1 GGF International Summer School on Grid Computing Vico Equense (Naples), Italy Introduction to OGSA-DAI Prof. Malcolm Atkinson Director
The Open Grid Service Architecture (OGSA) Standard for Grid Computing Prepared by: Haoliang Robin Yu.
Grid Computing Net 535.
Designing and Building Grid Services GGF9 Chicago October 8, 2003 Organizers: Ian Foster, Marty Humphrey, Kate Keahey, Norman Paton, David Snelling.
Welcome e-Science in the UK Building Collaborative eResearch Environments Prof. Malcolm Atkinson Director 23 rd February 2004.
Database Taskforce and the OGSA-DAI Project Norman Paton University of Manchester.
INFSO-SSA International Collaboration to Extend and Advance Grid Education ICEAGE Forum Meeting at EGEE Conference, Geneva Malcolm Atkinson & David.
1 Dr. Markus Hillenbrand, ICSY Lab, University of Kaiserslautern, Germany A Generic Database Web Service for the Venice Service Grid Michael Koch, Markus.
User requirements for and concerns about a European e-Infrastructure Steven Newhouse, Director.
Ian Foster Argonne National Lab University of Chicago Globus Project The Grid and Meteorology Meteorology and HPN Workshop, APAN.
Extensible Framework for Data Access & Integration Malcolm Atkinson Director 10 th November 2004.
Virtual Data Grid Architecture Ewa Deelman, Ian Foster, Carl Kesselman, Miron Livny.
1 HPDC12 Seattle Structured Data and the Grid Access and Integration Prof. Malcolm Atkinson Director 23 rd June 2003.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Code Applications Tamas Kiss Centre for Parallel.
Interoperability Grids, Clouds and Collaboratories Ruth Pordes Executive Director Open Science Grid, Fermilab.
SEEK Welcome Malcolm Atkinson Director 12 th May 2004.
Futures Lab: Biology Greenhouse gasses. Carbon-neutral fuels. Cleaning Waste Sites. All of these problems have possible solutions originating in the biology.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
GRID ARCHITECTURE Chintan O.Patel. CS 551 Fall 2002 Workshop 1 Software Architectures 2 What is Grid ? "...a flexible, secure, coordinated resource- sharing.
1 ARGONNE  CHICAGO Grid Introduction and Overview Ian Foster Argonne National Lab University of Chicago Globus Project
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
IBM & HSBC visit Malcolm Atkinson Director & e-Science Envoy UK National e-Science Centre & e-Science Institute 30 th March 2006.
European Network Policy Group Malcolm Atkinson Director 28 th October 2004.
1 The Challenge of Data Integration Data + Grid = Discovery? Prof. Malcolm Atkinson Director 22 nd January 2003.
Remarks on OGSA and OGSI e-Science All Hands Meeting September Geoffrey Fox, Indiana University.
Utility Computing: Security & Trust Issues Dr Steven Newhouse Technical Director London e-Science Centre Department of Computing, Imperial College London.
7. Grid Computing Systems and Resource Management
OGSA-DAI & DAIT projects Update for TAG Prof. Malcolm Atkinson Director 30 th October 2003.
OGSA-DAI Users’ Meeting Introduction Malcolm Atkinson Director 7 th April 2004.
National e-Science Institute and National e-Science Centre The Way Ahead Prof. Malcolm Atkinson Director 30 th September 2003.
1 OGSA Transition ATF Migration Strategy Prof. Malcolm Atkinson Director 28 th April 2003.
The OGSA-DAI Project Databases and the Grid Neil Chue Hong Project Manager EPCC, Edinburgh
Toward a common data and command representation for quantum chemistry Malcolm Atkinson Director 5 th April 2004.
Data and storage services on the NGS.
Chinese Delegation Visit High Performance Computer Mission UK e-Science & The National e-Science Centre Prof. Malcolm Atkinson Director
The National Grid Service Mike Mineter.
Welcome Grids and Applied Language Theory Dave Berry Research Manager 16 th October 2003.
UK Role in Open Grid Services Architecture Towards an Architectural Road Map A Report to the Technical Advisory Group from The Architecture Task Force.
Welcome & The National e-Science Centre Prof. Malcolm Atkinson Director 6 th November 2003.
Research at the National e-Science Centre Dr. Dave Berry Research Manager 6 th November 2003.
Amy Krause EPCC OGSA-DAI An Overview OGSA-DAI on OMII 2.0 OMII The Open Middleware Infrastructure Institute NeSC,
The Open Grid Service Architecture (OGSA) Standard for Grid Computing
Welcome to National e-Science Centre Official Opening
UK e-Science OGSA-DAI November 2002 Malcolm Atkinson
University of Technology
The National Grid Service
Grid Introduction and Overview
Brian Matthews STFC EOSCpilot Brian Matthews STFC
Review of grid computing
Grid Systems: What do we need from web service standards?
Presentation transcript:

Japanese & UK N+N Data, Data everywhere and … Prof. Malcolm Atkinson Director 3 rd October 2003

Discovery is a wonderful thing

Web Hits - Domain

Our job: Make the Party a Success every time Computing Science Systems, Notations & Formal Foundation → Process & Trust Theory Models & Simulations → Shared Data Experiment & Advanced Data Collection → Shared Data Multi-national, Multi-discipline, Computer-enabled Consortia, Cultures & Societies Requires Much Engineering, Much Innovation Changes Culture, New Mores, New Behaviours

Integration is our Focus Supporting Collaboration Bring together disciplines Bring together people engaged in shared challenge Inject initial energy Invent methods that work Supporting Collaborative Research Integrate compute, storage and communications Deliver and sustain integrated software stack Operate dependable infrastructure service Integrate multiple data sources Integrate data and computation Integrate experiment with simulation Integrate visualisation and analysis High-level tools and automation essential Fundamental research as a foundation

Derived from Ian Foster’s slide at ssdbM July 03 It’s Easy to Forget How Different 2003 is From 1993 Enormous quantities of data: Petabytes For an increasing number of communities Gating step is not collection but analysis Ubiquitous Internet: >100 million hosts Collaboration & resource sharing the norm Security and Trust are crucial issues Ultra-high-speed networks: >10 Gb/s Global optical networks Bottlenecks: last kilometre & firewalls Huge quantities of computing: >100 Top/s Moore’s law gives us all supercomputers Ubiquitous computing (Moore’s law) 2 everywhere Instruments, detectors, sensors, scanners, …

Tera → Peta Bytes RAM time to move 15 minutes 1Gb WAN move time 10 hours ($1000) Disk Cost 7 disks = $5000 (SCSI) Disk Power 100 Watts Disk Weight 5.6 Kg Disk Footprint Inside machine RAM time to move 2 months 1Gb WAN move time 14 months ($1 million) Disk Cost 6800 Disks units + 32 racks = $7 million Disk Power 100 Kilowatts Disk Weight 33 Tonnes Disk Footprint 60 m 2 May 2003 Approximately Correct See also Distributed Computing Economics Jim Gray, Microsoft Research, MSR-TR

Dynamically Move computation to the data Assumption: code size << data size Develop the database philosophy for this? Queries are dynamically re-organised & bound Develop the storage architecture for this? Compute closer to disk?  System on a Chip using free space in the on-disk controller Data Cutter a step in this direction Develop the sensor & simulation architectures for this? Safe hosting of arbitrary computation Proof-carrying code for data and compute intensive tasks + robust hosting environments Provision combined storage & compute resources Decomposition of applications To ship behaviour-bounded sub-computations to data Co-scheduling & co-optimisation Data & Code (movement), Code execution Recovery and compensation Dave Patterson Seattle SIGMOD 98

OGSA Infrastructure Architecture OGSI: Interface to Grid Infrastructure Data Intensive Applications for Science X Compute, Data & Storage Resources Distributed Simulation, Analysis & Integration Technology for Science X Data Intensive X Scientists Virtual Integration Architecture Generic Virtual Data Access and Integration Layer Structured Data Integration Structured Data Access Structured Data Relational XML Semi-structured- Transformation Registry Job Submission Data TransportResource Usage Banking BrokeringWorkflow Authorisation

1a. Request to Registry for sources of data about “x” 1b. Registry responds with Factory handle 2a. Request to Factory for access to database 2c. Factory returns handle of GDS to client 3a. Client queries GDS with XPath, SQL, etc 3b. GDS interacts with database 3c. Results of query returned to client as XML SOAP/HTTP service creation API interactions RegistryFactory 2b. Factory creates GridDataService to manage access Grid Data Service Client XML / Relationa l database Data Access & Integration Services

GDTS 2 GDS 3 2 GDTS 1 S x S y 1a. Request to Registry for sources of data about “x” & “y” 1b. Registry responds with Factory handle 2a. Request to Factory for access and integration from resources Sx and Sy 2b. Factory creates GridDataServices network 2c. Factory returns handle of GDS to client 3a. Client submits sequence of scripts each has a set of queries to GDS with XPath, SQL, etc 3c. Sequences of result sets returned to analyst as formatted binary described in a standard XML notation SOAP/HTTP service creation API interactions Data Registry Data Access & Integration master Client Analyst XML database Relational database GDS GDTS 3b. Client tells analyst GDS 1 Future DAI Services “scientific” Application coding scientific insights Problem Solving Environment Semantic Meta data Application Code

A New World What Architecture will Enable Data & Computation Integration? Common Conceptual Models Common Planning & Optimisation Common Enactment of Workflows Common Debugging … What Fundamental CS is needed? Trustworthy code & Trustworthy evaluators Decomposition and Recomposition of Applications … Is there an evolutionary path?

Take Home Message Information Grids Support for collaboration Support for computation and data grids Structured data fundamental  Relations, XML, semi-structured, files, … Integrated strategies & technologies needed OGSA-DAI is here now A first step Try it Tell us what is needed to make it better Join in making better DAI services & standards

Cambridge Newcastle Edinburgh Oxford Glasgow Manchester Cardiff Southampton London Belfast Daresbury Lab RAL Hinxton NeSC in the UK National e- Science Centre HPC(x) Directors’ Forum Helped build a community Engineering Task Force Grid Support Centre Architecture Task Force UK Adoption of OGSA OGSA Grid Market Workflow Management Database Task Force OGSA-DAI GGF DAIS-WG GridNet e-Storm Globus Alliance