Semantic Web and the Grid Brian Matthews. 2 euroCRIS seminar 2004 2 Contents A Changing Environment for Research The Semantic Web The Grid The Semantic.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

What does LOFAR have to do with the Virtual Observatory (VO)? LOFAR Science Day 16 December 2003 Melbourne David Barnes The University of Melbourne.
A centre of expertise in digital information management IMS Digital Repositories Interoperability Andy Powell UKOLN,
Abstraction Layers Why do we need them? –Protection against change Where in the hourglass do we put them? –Computer Scientist perspective Expose low-level.
Environmental Information Data Centre: enabling the discovery of CEH-held data John Watkins Deputy Director EIDC.
Semantic Web Agents: Hope or Hype Nicholas Gibbins School of Electronics and Computer Science University of Southampton.
High Performance Computing Course Notes Grid Computing.
1 Technical Developments Related to Quality Issues Brian Kelly UK Web Focus UKOLN University of Bath Bath, BA2 7AY
Galia Angelova Institute for Parallel Processing, Bulgarian Academy of Sciences Visualisation and Semantic Structuring of Content (some.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Jennifer A. Dunne Santa Fe Institute Pacific Ecoinformatics & Computational Ecology Lab Rich William, Neo Martinez, et al. Challenges.
Future Software Architectures Combining the Web 2.0 with the Semantic Web to realize future Web Communities Maarten Visser
NextGRID & OGSA Data Architectures: Example Scenarios Stephen Davey, NeSC, UK ISSGC06 Summer School, Ischia, Italy 12 th July 2006.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
RDF: Building Block for the Semantic Web Jim Ellenberger UCCS CS5260 Spring 2011.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Chinese-European Workshop on Digital Preservation Beijing (China), July.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
Advances in Technology and CRIS Nikos Houssos National Documentation Centre / National Hellenic Research Foundation, Greece euroCRIS Task Group Leader.
Requirements for Epidemic Information Management Farrukh Najmi XML Standards Architect Sun Microsystems
Practical RDF Chapter 1. RDF: An Introduction
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
Deploying Trust Policies on the Semantic Web Brian Matthews and Theo Dimitrakos.
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
Brian Matthews, CRIS 2002, 30/08/02 ERIS Workshop, CRIS2002 Architecture Brian Matthews, Business & Information Technology Dept, CLRC
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
Brian Matthews, DeFINE, Pisa 26/11/02 Trust and the Semantic Web Brian Matthews, Business & Information Technology Dept, CLRC
Research and Educational Networking and Cyberinfrastructure Russ Hobby, Internet2 Dan Updegrove, NLR University of Kentucky CI Days 22 February 2010.
EU Project proposal. Andrei S. Lopatenko 1 EU Project Proposal CERIF-SW Andrei S. Lopatenko Vienna University of Technology
19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick.
ATLAS and GridPP GridPP Collaboration Meeting, Edinburgh, 5 th November 2001 RWL Jones, Lancaster University.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Code Applications Tamas Kiss Centre for Parallel.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
Grid Computing & Semantic Web. Grid Computing Proposed with the idea of electric power grid; Aims at integrating large-scale (global scale) computing.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
GO-ESSP Workshop, LLNL, Livermore, CA, Jun 19-21, 2006, Center for ATmosphere sciences and Earthquake Researches Construction of e-science Environment.
CLRC and the European DataGrid Middleware Information and Monitoring Services The current information service is built on the hierarchical database OpenLDAP.
Joint Information Systems Committee Supporting Higher and Further Education Rachel Bruce Programme Manager, JISC Executive Collection.
Infrastructures for Social Simulation Rob Procter National e-Infrastructure for Social Simulation ISGC 2010 Social Simulation Tutorial.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Applications.
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
Cooperative experiments in VL-e: from scientific workflows to knowledge sharing Z.Zhao (1) V. Guevara( 1) A. Wibisono(1) A. Belloum(1) M. Bubak(1,2) B.
Introduction to the Semantic Web and Linked Data
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
Data Integration in Bioinformatics Using OGSA-DAI The BioDA Project Shirley Crompton, Brian Matthews (CCLRC) Alex Gray, Andrew Jones, Richard White (Cardiff.
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
Securing the Grid & other Middleware Challenges Ian Foster Mathematics and Computer Science Division Argonne National Laboratory and Department of Computer.
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
Steven Perry Dave Vieglais. W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Overview WASABI is a framework for.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Brian Matthews, euroCRIS, 18/09/03 CRIS architecture to support an ERA Brian Matthews.
Distributed Archives Interoperability Cynthia Y. Cheung NASA Goddard Space Flight Center IAU 2000 Commission 5 Manchester, UK August 12, 2000.
CIMA and Semantic Interoperability for Networked Instruments and Sensors Donald F. (Rick) McMullen Pervasive Technology Labs at Indiana University
Welcome Grids and Applied Language Theory Dave Berry Research Manager 16 th October 2003.
Grid Execution Management for Legacy Code Architecture Exposing legacy applications as Grid services: the GEMLCA approach Centre.
September 2003, 7 th EDG Conference, Heidelberg – Roberta Faggian, CERN/IT CERN – European Organization for Nuclear Research The GRACE Project GRid enabled.
Linked Data Publishing on the Semantic Web Dr Nicholas Gibbins
GISELA & CHAIN Workshop Digital Cultural Heritage Network
Flanders Marine Institute (VLIZ)
OGSA Data Architecture Scenarios
VI-SEEM Data Repository
University of Technology
GISELA & CHAIN Workshop Digital Cultural Heritage Network
Presentation transcript:

Semantic Web and the Grid Brian Matthews

2 euroCRIS seminar Contents A Changing Environment for Research The Semantic Web The Grid The Semantic Grid What does that mean for CRIS and OA? Conclusion

Brian Matthews 3 euroCRIS seminar A Future Environment for Research OA and CRIS as drivers for the management and access to information Need for shared metadata and exchange mechanisms Central control impossible/undesirable –a loosely coupled federated approach –based on common interchange and access standards –W3C, GGF, IETF, OASIS, EuroCRIS, WfMC etc Changes in technology –resource discovery –enables access Two leading technology opportunities –Semantic Web and the GRID

Brian Matthews 4 euroCRIS seminar The Semantic Web Adding machine readable information about the web, to the web. The Web is chaotic - why are resources are linked? –Imagine a library where all the books have the same text on the cover, and the only catalogues are compiled by photocopying the books, cutting up the copies, and arranging the words in the order of frequency. Johan Hjelm Google is great at returning all the pages on the web that mention "Tim Berners-Lee“ –But what about returning those pages written by Tim Berners-Lee? The Semantic Web adds well-defined meaning to describe the Web (Metadata). The Semantic Web is an extension of the current web in which the information is given well-defined meaning, better enabling computers and people to work in cooperation –Tim Berners-Lee, James Hendler and Ora Lassila The Semantic Web, Scientific American, May 2001

Brian Matthews 5 euroCRIS seminar Add Meaning to Resources

Brian Matthews 6 euroCRIS seminar Semantic Web: A Layered Architecture Basic Syntax of the Web Language of triples for describing resources Formalism for defining and sharing vocabularies Reasoning over statements about resources “The Web of Trust”

Brian Matthews 7 euroCRIS seminar Machine Readable Meaning Meaning becomes machine readable - so software agents can use it for: –Improving searches (indexing, cataloguing) –Convey information on the usage of the resource (access control, IPR). –Convey information on the actors involved (user preferences, device profiles, privacy preferences) –Give third party opinions on the content of another site (rating services, brokering). Essentially, Metadata of all kinds

Brian Matthews 8 euroCRIS seminar Progress so far A lot more than you might think! Base standards are now mature: –RDF, RDF Schema, OWL –many others reaching maturity: Many shared vocabularies emerging –DC, DMoz, Prism, FOAF, VCard, SKOS, RSS…. Lots of RDF out there! –Mozilla, Adobe, RSS, Still a lot of work to do –reasoning, trust, provenance, tools, But we are getting there!

Brian Matthews 9 euroCRIS seminar Example: SKOS Community effort led by CCLRC/W3C A vocabulary to represent Thesauruses Heavily used in the library community –but traditionally locked up in institutional databases Allow people to share controlled vocabularies for cataloguing resources Examples –GEMET – environmental data –GCL – e-Government –English Heritage –W3C glossary CRIS 2 CRIS 1 CRIS portal Query distributor and collator Users Thesaurus Service  

Brian Matthews 10 euroCRIS seminar Example: Simile Project of MIT + HP Labs + W3C Publishing digital library information onto the semantic web. Make semantic interoperability of metadata a reality for digital libraries by: –providing reusable software for browsing, searching and mapping heterogeneous metadata –using semantic web technologies –identifying issues, gaps and best practices allow libraries to share information Provide semantic web browser, and RDF based datasets –for art history information –combined from different sources Using SKOS as the thesaurus format. OA within the Semantic Web

Brian Matthews 11 euroCRIS seminar Semantic Web and OA Semantic web provides an underlying mechanism to support OA: –common metadata –data exchange mechanism –searching and browsing across web –query language and logic –interoperability –lose coupling. Can also support CRIS this way too. –CERIF in OWL (Lopatenko) And also Data Sets –CCLRC Metadata format – also in RDF Schema But that is not the only main technology change

Brian Matthews 12 euroCRIS seminar The Grid The Grid provides an environment that enable software applications to integrate instruments, displays, computational and information resources that are managed by diverse organisations in widespread locations. Provide access to a global distributed computing environment –via authentication, authorisation, negotiation, security Identify and allocate appropriate resources –interrogate information services -> resource discovery –enquire current status/loading via monitoring tools –decide strategy - eg move data or move application –(co-)allocate resources -> process flow Schedule tasks and analyse results –ensure required application code is available on remote machine –transfer or replicate data and update catalogues –monitor execution and resolve problems as they occur –retrieve and analyse results - eg using local visualization So far typically in large-scale science and engineering.

Brian Matthews 13 euroCRIS seminar To make this happen you need... agreed protocols (cf WWW -> W3C) defined application programming interfaces (APIs) existence of directories for both system and application distributed data management availability of current status of resources monitoring tools accepted authentication procedures and policies network traffic management provided by Grid-based toolkits and services

Brian Matthews 14 euroCRIS seminar 2004 GRID History mid 90s – Globus The GRID Bible Based on “traditional” protocols (IETF) Taken up by e- Science Standardised via GGF Now converging with Web –Web Services - WSRF

Brian Matthews 15 euroCRIS seminar Computer simulations real-time collection Multi-source Data Analysis desktop & VR clients with shared controls Unitary Plan Wind Tunnel Example: NASA IPG archival storage

Brian Matthews 16 euroCRIS seminar Example: DataGrid LHC will produce several PBs of data per year for at least 10 years from Data analysis will be carried out by farms of 1000’s of commodity processors (the “computing fabric”) in each of about 10 regional Tier1 centres - RAL is UK Tier1 Each Tier1 centre will need to hold several PBs of raw data and results of physics analysis Strong focus on middleware and testbeds - open source

Brian Matthews 17 euroCRIS seminar What Next? The Semantic Grid Semantic Grid distributed computation GRID WEB Semantic Web machine readable semantics thanks to Dave de Roure

Brian Matthews 18 euroCRIS seminar What Next? The Semantic Grid Current GRID is “hand-crafted” –users have to know a lot about the available resources –users have to “write scripts” to use the GRID Add machine readable semantics (metadata) –The Semantic GRID Semantic Grid distributed computation GRID WEB Semantic Web machine readable semantics thanks to Dave de Roure “the GRID is an application of the Semantic Web” de Roure, Goble

Brian Matthews 19 euroCRIS seminar But what does that mean? more automation more negotiation more autonomy more self-monitoring and control use of autonomous agents Will make the Grid much more like the electricity Grid –You don’t need to know where the stuff comes from.

Brian Matthews 20 euroCRIS seminar Major UK e-Science project –Bio-informatics –In-silico experimentation – Based on a GRID architecture Uses Semantic Web Tools for –Workflow and service discovery Prior to and during enactment Semantic registration –Workflow assembly Semantic service typing of inputs and outputs –Provenance of workflows and other entities –Experimental metadata glue –Use of RDF, RDFS, DAML+OIL/OWL Instance store, ontology server, reasoner Materialised vs at point of delivery reasoning. –myGrid Information Model About to join them to work on workflow Semantic Grid Example

Brian Matthews 21 euroCRIS seminar What does this mean for CRIS & OA? Portal with knowledge-assisted user interface Digital Curation Facility SCIENTIFIC DATASETS metadata PUBLICATIONS metadata CRIS metadata publish validate GRIDs Ambient, Pervasive Access The Semantic Grid is what makes this work!

Brian Matthews 22 euroCRIS seminar Example: Validation Validate results from paper –need to access paper (OA) –need to link to data (and metadata) –need to access analysis and visualisation tools –need common metadata and access to resources across Grid. Grid middleware Local data Local metadata DA 1 Data PortalPub Portal Local data Local metadata DA 2 Local data Local metadata IR 1 Local data Local metadata IR 2

Brian Matthews 23 euroCRIS seminar Example: Science as a process Within a Grid environment Submit proposal Prepare experiment Generate results Analyse results Write report Provenance metadata + access conditions data description +++ data location Related material Collecting the metadata can then become part of the experimental support environment CRIS DAIR

Brian Matthews 24 euroCRIS seminar Example: the Nature of a Publication Traditional publication as continuous text, with static graphs and images Change the notion of the content of the publication –hypertext –include active components – links to simulations, visualisations a much more dynamic document –a multimedia presentation How will publishers cope? How will publication archives cope?

Brian Matthews 25 euroCRIS seminar So how to achieve this? Resource discovery –good metadata –common formats –standards Resource negotiation –for data and services Quality of service guarantees Policies and contracts Security and trust Provenance Monitoring and payment Work flow Reasoning tools Autonomous agents Autonomic systems Links to legacy –especially database systems –querying systems Collaborative working environments Design methods

Brian Matthews 26 euroCRIS seminar Progress Moving quite fast on this from many different directions –e-Science –Next Generation Grid Report –FP6/7 –Semantic Grid at GGF –OA initiatives –Digital Curation a major concern Real exciting opportunity to pull it all together

Brian Matthews 27 euroCRIS seminar Conclusions Semantic Grid and Open Access –enables –enabling CRIS as an information coordinator Archiving and curation –need to archive much more –data, programs, visualisation and analysis tools, formats, calibrations, versions, OS …… Workflow a key component Metadata collection and maintenance is a big problem.