Environment from the Molecular Level: An e-science project for modelling the atomistic processes involved in environmental issues (funded by NERC)

Slides:



Advertisements
Similar presentations
The e-Framework Bill Olivier Director Development, Systems and Technology JISC.
Advertisements

Using eScience to calibrate our tools: parameterisation of quantum mechanical calculations with grid technologies Kat Austen Dept. of Earth Sciences, University.
OMII-UK Steven Newhouse, Director. © 2 OMII-UK aims to provide software and support to enable a sustained future for the UK e-Science community and its.
Experience of the SRB in support of collaborative grid computing Martin Dove University of Cambridge.
BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July 2005 Resource wrappers, web services, grid services Jaspreet Singh School of Computer.
Daresbury Laboratory Enabling Science with Grid Technology Jamie Rintelman, Kerstin Kleese-Van Dam, Rik Tyer STFC-Daresbury Laboratory; Daresbury, Cheshire,
R.J. Allan Portals and User Interfaces for Data Management and Grid Computing Rob Allan Leader of the Grid Technology Group 9 th.
Holding slide prior to starting show. Supporting Collaborative Working of Construction Industry Consortia via the Grid - P. Burnap, L. Joita, J.S. Pahwa,
1 Richard White Design decisions: architecture 1 July 2005 BiodiversityWorld Grid Workshop NeSC, Edinburgh, 30 June - 1 July 2005 Design decisions: architecture.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
John Kewley e-Science Centre GIS and Grid Computing Workshop 13 th September 2005, Leeds Grid Middleware and GROWL John Kewley
Page 1 Building Reliable Component-based Systems Chapter 18 - A Framework for Integrating Business Applications Chapter 18 A Framework for Integrating.
15th January, NGS for e-Social Science Stephen Pickles Technical Director, NGS Workshop on Missing e-Infrastructure Manchester, 15 th January, 2007.
Workflows Information Flows Prof. Silvia Olabarriaga Dr. Gabriele Pierantoni.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 18 Slide 1 Software Reuse.
- 1 - Grid Programming Environment (GPE) Ralf Ratering Intel Parallel and Distributed Solutions Division (PDSD)
User requirements for and concerns about a European e-Infrastructure Steven Newhouse, Director.
Holding slide prior to starting show. A Grid-based Problem Solving Environment for GECEM Maria Lin and David Walker Cardiff University Yu Chen and Jason.
Service-enabling Legacy Applications for the GENIE Project Sofia Panagiotidi, Jeremy Cohen, John Darlington, Marko Krznarić and Eleftheria Katsiri.
CoG Kit Overview Gregor von Laszewski Keith Jackson.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
1 Technologies for distributed systems Andrew Jones School of Computer Science Cardiff University.
Grids and Portals for VLAB Marlon Pierce Community Grids Lab Indiana University.
1 Overview of the Application Hosting Environment Stefan Zasada University College London.
Grid tool integration within the eMinerals project Mark Calleja.
1 All-Hands Meeting 2-4 th Sept 2003 e-Science Centre The Data Portal Glen Drinkwater.
Javascript Cog Kit By Zhenhua Guo. Grid Applications Currently, most grid related applications are written as separate software. –server side: Globus,
ILDG Middleware Status Chip Watson ILDG-6 Workshop May 12, 2005.
Shannon Hastings Multiscale Computing Laboratory Department of Biomedical Informatics.
E-CCP: Collaborative Computing Projects e-Science Environment STFC Daresbury Resources: Funding100k / year Staff: Phil Couch Resources: access to SCARF,
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Code Applications Tamas Kiss Centre for Parallel.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
STASIS Technical Innovations - Simplifying e-Business Collaboration by providing a Semantic Mapping Platform - Dr. Sven Abels - TIE -
Supporting education and research Security and Authentication for the Grid Alan Robiette, JISC Development Group.
Grids - the near future Mark Hayes NIEeS Summer School 2003.
NGS Portal.
Rob Allan Daresbury Laboratory A Web Portal for the National Grid Service Xiaobo Yang, Dharmesh Chohan, Xiao Dong Wang and Rob Allan CCLRC e-Science Centre,
Holding slide prior to starting show. A Portlet Interface for Computational Electromagnetics on the Grid Maria Lin and David Walker Cardiff University.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
The eMinerals minigrid and the national grid service: A user’s perspective NGS169 (A. Marmier)
ModelPedia Model Driven Engineering Graphical User Interfaces for Web 2.0 Sites Centro de Informática – CIn/UFPe ORCAS Group Eclipse GMF Fábio M. Pereira.
Building the e-Minerals Minigrid Rik Tyer, Lisa Blanshard, Kerstin Kleese (Data Management Group) Rob Allan, Andrew Richards (Grid Technology Group)
The NGS Grid Portal David Meredith NGS + Grid Technology Group, e-Science Centre, Daresbury Laboratory, UK
EGEE-II INFSO-RI Enabling Grids for E-sciencE The GILDA training infrastructure.
Metadata Mòrag Burgon-Lyon University of Glasgow.
The EDGeS project receives Community research funding 1 Porting Applications to the EDGeS Infrastructure A comparison of the available methods, APIs, and.
Project Database Handler The Project Database Handler is a brokering application that mediates interactions between the project database and the external.
1 e-Science AHM st Aug – 3 rd Sept 2004 Nottingham Distributed Storage management using SRB on UK National Grid Service Manandhar A, Haines K,
Design Reuse Earlier we have covered the re-usable Architectural Styles as design patterns for High-Level Design. At mid-level and low-level, design patterns.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
GridChem Architecture Overview Rion Dooley. Presentation Outline Computational Chemistry Grid (CCG) Current Architectural Overview CCG Future Architectural.
John Kewley e-Science Centre All Hands Meeting st September, Nottingham GROWL: A Lightweight Grid Services Toolkit and Applications John Kewley.
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
Rob Allan Daresbury Laboratory NW-GRID Training Event 26 th January 2007 Next Steps R.J. Allan CCLRC Daresbury Laboratory.
The National Grid Service Mike Mineter.
DS-Grid: Large Scale Distributed Simulation on the Grid Georgios Theodoropoulos Midlands e-Science Centre University of Birmingham, UK Stephen John Turner,
The NGS Grid Portal David Meredith NGS + Grid Technology Group, e-Science Centre, Daresbury Laboratory, UK
Holding slide prior to starting show. Lessons Learned from the GECEM Portal David Walker Cardiff University
LHCC Referees Meeting – 28 June LCG-2 Data Management Planning Ian Bird LHCC Referees Meeting 28 th June 2004.
Grid Execution Management for Legacy Code Architecture Exposing legacy applications as Grid services: the GEMLCA approach Centre.
Collaborative Tools for the Grid V.N Alexandrov S. Mehmood Hasan.
Tutorial on Science Gateways, Roma, Catania Science Gateway Framework Motivations, architecture, features Riccardo Rotondo.
Campus grids: e-Infrastructure within a University Mike Mineter National e-Science Centre 22 February 2006.
ACGT Architecture and Grid Infrastructure Juliusz Pukacki ‏ EGEE Conference Budapest, 4 October 2007.
Introduction to the Application Hosting Environment
E-Science Centre, CCLRC Daresbury Laboratory
June 1, 2008 Michael Erdmann, Peter Haase, Holger Lewen, Rudi Studer
Grid Portal Services IeSE (the Integrated e-Science Environment)
Module 01 ETICS Overview ETICS Online Tutorials
SDMX IT Tools SDMX Registry
Presentation transcript:

Environment from the Molecular Level: An e-science project for modelling the atomistic processes involved in environmental issues (funded by NERC)

Radioactive waste disposal Crystal growth and scale inhibition Pollution: molecules and atoms on mineral surfaces Crystal dissolution and weathering Molecular Environmental Issues

Rocks and Mineral Structures Radioactive waste disposal Crystal growth and scale inhibition Pollution: molecules and atoms on mineral surfaces Crystal dissolution and weathering

The “Grand Challenge”. Level of theory Adsorbing surface Contaminant Quantum Monte Carlo Large empirical models Linear-scaling quantum mechanics Organic molecules Halogens Metallic elements Clays, micas Aluminosilicates Natural organic matter Phosphates Carbonates Oxides/hydroxides Sulphides Requires scientists to work together in teams - a Virtual Organisation

Design Approach taken: –Over approx 3 years we have engaged in many workshops, tutorials and prototyping with developers and users. Teaching users what e-Science can “do for them”, including security. Cooperation between CCLRC and NIEeS in Cambridge. –Planned to integrate together some tools which had already been developed/ prototyped at CCLRC, UCL and Reading. A service-oriented approach is used for certain aspects: Grid, data management, user interfaces, metadata management. Workflow was found to be important to users, e.g. for combinatorial studies. Several iterations of software have enabled some usability issues to be addresses. –Originally envisaged an “Integrated Portal Architecture” linking HPCPortal, DataPortal and visualisation services. We thought we knew what users would like, but actually they preferred a simpler incremental approach; Workflow scripting was preferred to a single portal. There are now several separate tools in use.

E-Minerals Portal

Technical Strategy Technology considerations: –Considered: Globus GT2, SRB, Harness, CCF, Portal, Web services, visualisation tools Various tool sets were tried and the users “voted with their feet” –Used: Globus, Condor, SRB, AG, MAST, RCommands, Metadata Editor, Workflow scripts, Web services, XML/ RDF/ OWL for data interoperability. Infrastructure –E-Minerals “mini-Grid” was a great success, based on earlier work at Daresbury and Manchester on Grid evaluation. Mini-Grid focuses resources of the e-Minerals VO and includes large campus Condor pools and parallel computers. Using Globus, Condor and GSI. Data managed using SRB. Collaboration tools –Access Grid, MAST, Wiki

Integrated Portal Architecture Generic portal design using Globus and Web Services: Visualisation DataPortal HPCPortal HPC Systems Data Systems Web Services Working with GGF Grid Computing Environments Research Group GridFTP GSI Globus

Development Issues Constraints and other issues: –Project divided from outset into: development team; application team; science team. –All teams work together and collaborate on papers –Tools written in C to integrate with existing “heritage” applications, e.g. from the Collaborative Computational Projects (CCPs) –Other interoperability issues addressed using Web services, e.g. gSOAP (client) +AXIS (server), XML-based data models and Semantic Grid technologies RDF+OWL –Constraints: short term goals, no prior experience of e-Science, new technology must not disrupt current work. –High requirements on computing resources for simulation studies This lead to a focus on workflows for repeated calculations, data management for storing and retrieving results, semantic Web technologies for data interoperability between codes

Evaluation Papers presented at All Hands 2005 included: –E-Science Usability: the e-Minerals Experience (paper 425) –The e-Minerals Project: Developing the Concept of the Virtual Organisation to support Collaborative Work on Molecular-scale Environmental Simulations (paper 518) User engagement and evaluation: –Looked at the Usability Task Force metrics. –Our approach did not readily map onto them, but there are overlaps –Key: understand the science users, their needs, and their natural ways of working. –Good and bad points summarised on next slides

Lessons Learnt What was usable? –Keep it simple – use effective lightweight tools for the job –Condor and Globus – Condor job scripts were accepted readily. Condor-G and DAGMan now used. RSL also embedded in scripts. –SRB – required little training and was found to be useful, SCommands in scripts. –Resource Management – Globus-based resource-monitoring tool was developed (in the Portal). A meta-scheduler is being developed. –Security – GSI proved “easy for users to work with”. The Portal uses MyProxy to ensure pervasive access. Certificates were not a problem – we offered training from Day 1. –Collaboration tools – desktop use of AG enables ad hoc meetings + MAST (Multi-cast Application Sharing Tool). Wiki and Instant Messaging also used. –Semantic technologies. CML was initially used with XSLT and SVG. This now extended in the AgentX toolkit.

Lessons Learnt What was not usable? –Client tools * – installation has caused difficulties, e.g. Globus. Initially used “submit machines”. Solutions investigated include: Portal – hides the complexity behind a Web interface, user doesn’t install anything; Web service interfaces – for Condor (Chapman et al.), GROWL for Globus and SRB (Allan et al.); BPEL interface – work at UCL/ OMII – plug-in for Eclipse. –Firewall issues – for both users and infrastructure – changes to rules lead to instability. Portal and Web services solve this problem for users. –Meta-data – tools are available, but automatic harvesting required to avoid mistakes. RCommands developed to improve this, can be linked into the workflow scripts. * A recent workshop “Lightweight Grid Computing” was held 2-3/5/06 at Losehill Hall. Attendees from GROWL, RealityGrid, Imperial College, e-Minerals, e-CCP… Transcript of discussions on usability issues is available giving more detailed information.

Future Plans Current and Future development plans: –New tools are being developed, for instance recently the meta-data editor and RCommands were added to the suite. –AgentX data-interoperability tools have been added from e-CCP extending the use of CML. Such work is now timely and illustrates how existing large codes, e.g. Siesta and GULP from CCP5 can be integrated easily with visualisation tools. –Development staff also work on other projects and with other developers. E-Minerals tools are now being evaluated in other areas, e.g. Integrative Biology and e-CCP. There are key synergies and critical mass, sharing of experiences and code/ services. –Full integration via a portal interface was not initially wanted, and also could not be achieved at the start of the project as the technology was not adequate (we tried PHP, now have JSR-168). This is now being re-visited as it provides a good solution to many of the problems highlighted. –Re-usable portlet-based tools from the NGS Portal can be re-used, already done for Integrative Biology and other projects. Can be combined with Wiki etc. Some following slides show more details of some of the tools. Blatant advert: Portals and Portlets

MOLECULE “Mol_frag_id” ATOM “Atom_frag_id” xCoordinate “xCoor_frag_id” locator O H H AgentX Framework - Overview Specify how to locate data (XML, CML, XLink) with a particular meaning Applications can use tools (AgentX library) that work with the specification to obtain information Classes and properties of entities are specified in an ontology (OWL, RDF/ XML) Mappings (RDF/ XML) associate classes and properties with fragment identifiers (XPointer) Fragment identifiers can be used to locate logical collections (classes) and data items (properties) Ontology Mappings Data

AgentX Framework - Example CONTROL CONFIG.xml Mappings DL_POLY3 AgentX core Fortran wrapper Standard Ontology Standard Mappings AgentX core Python wrapper REVCON.xml Mappings CCP1 GUI DL_POLY3 (CCP5) integrated with CCP1 GUI AgentX - Core library written in C - Wrappers for Python, Perl and Fortran - Hides the complexities of dealing with XML - Simple API - Enables straightforward exchange of information

RCommands RCommands are shell tools and associated Web services for meta- data manipulation RCommands primary use case is within e-Minerals workflow, i.e. to allow automatic insertion of meta- data as a post processing action Function DomainRCommand Authentication / Session Rinit Rexit Rpasswd Entity Operations Rls Rcreate Rrm Parameter Operations Rannotate Rsearch PermissionsRchmod

RCommands Service-based Arch RCommands gSOAP RCommand Server Code JDBC Axis Relational Database Client Side Server Side BPEL Engine SOAP Link into workflows

Subset of Schema Name Value Pairs Title Description Notes Start / End Dates Originator Name Description Name URI

Royal Institution University of Reading