Comb-e-Chem Jeremy Frey Sept 2004 Drug Design & Delivery: The role of e-Science Jeremy Frey School of Chemistry University of Southampton, UK X-ray single.

Slides:



Advertisements
Similar presentations
IATUL Porto, May 21, 2006 DOI and e-Science Dr Anne E Trefethen Oxford e-Research Centre
Advertisements

Comb-e-Chem Jeremy Frey Sept 2003 From e-Science to Jeremy Frey School of Chemistry University of Southampton, UK X-ray single Mol STM.
CoAKTing IFD Dave in Hawaii. AKT Workshop January CoAKTing IFD n Objective is to advance the state of the art in collaborative mediated spaces.
CoAKTing IFD Dave in Hawaii. 2 CoAKTing IFD n Objective is to advance the state of the art in collaborative mediated spaces for distributed e- Science.
National e-Science Centre Glasgow e-Science Hub Opening: Remarks NeSCs Role Prof. Malcolm Atkinson Director 17 th September 2003.
© S.J. Coles 2006 Usability WS, NeSC Jan 06 Experiences in deploying a useable Grid-enabled service for the National Crystallography Service Simon J. Coles.
S.J. Coles a*, M.B. Hursthouse a, R.A. Stephenson a, P. Cliff b, E. Lyon b, M. Patel b J. Downing c & P. Murray-Rust.
© S.J. Coles 2006 Usability WS, NeSC Jan 06 Enabling the reusability of scientific data: Experiences with designing an open access infrastructure for sharing.
Crystal Structure EPrints: Source Through the Open Archive Initiative S.J. Coles a*, J.G. Frey a, M.B. Hursthouse a, L. Carr b & C.J. Gutteridge.
Less is More Lightweight Ontologies and User Interfaces for Smart Labs J. G. Frey, G. V. Hughes, H. R. Mills, m. c. schraefel, G. M. Smith, David De Roure.
National Crystallography Grid Service Comb-e-Chem
Opening the Research Data Lifecycle Workshop Capturing and Sharing Research Data Simon Coles School of Chemistry, University of Southampton, U.K.
Crystallographic Metadata Simon Coles CrystalGrid Collaboratory Foundation Meeting September 2004.
Peter Berrisford RAL – Data Management Group SRB Services.
S.J. Coles a*, J.G. Frey a, M.B. Hursthouse a, L. Carr b & C.J. Gutteridge b. a School of Chemistry, University of Southampton, UK.; b School of Electronics.
© S.J. Coles 2006 Digital Repositories as a Mechanism for the Capture, Management and Dissemination of Chemical Data Simon Coles School of Chemistry, University.
RCUK, Octiber Archiving research data and research publications. Dr Leslie Carr, Intelligence, Agents Multimedia, University of Southampton Dr Simon.
Streaming Video: Overcoming Barriers for Teaching and Learning JISC/NSF Digital Libraries Initiative All Projects Meeting, Edinburgh.
DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
Integrating research data into the publication workflow: eBank UK experience Rachel Heery, UKOLN, University of Bath
Terminologies: An e-Science perspective Nicholas Gibbins Intelligence, Agents, Multimedia University of Southampton.
Federation eCrystals Federation: Open Repositories for Data-driven Science Dr Liz Lyon, UKOLN, University of Bath, UK Dr Simon Coles, University of Southampton,
UKOLN is supported by: Enhancing access to research data: the challenge of crystallography Rachel Heery, Monica Duke, Michael Day UKOLN, University of.
EBankII Workshop 1 Making Scientific Data Openly Available Simon Coles School of Chemistry, University of Southampton.
EBank UK CCLRC Workshop February eBank and CCLRC Workshop February 2005 University of Bath.
Digital Repositories: interoperability & common services Closing Remarks Dr Liz Lyon, UKOLN, University of Bath, UK
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
The Data Lifecycle and the Curation of Laboratory Experimental Data Tony Hey Corporate VP for Technical Computing Microsoft Corporation.
The Central Role of Data ‘Capturing and Sharing Chemistry Research Data’ Simon Coles School of Chemistry, University of Southampton, U.K.
28 October 2005Jeremy Frey, University of Southampton1 “The CombeChem Experience” CICC Workshop 28 October 2005 Bloomington Indiana.
Jeffery Loo NLM Associate Fellow ’03 – ’05 chemicalinformaticsforlibraries.
EPrints Workshop, January eBank UK: Dissemination of research data using EPrints Simon Coles, School of Chemistry, University of Southampton.
© S.J. Coles 2005 ACS 2005, San Diego Furthering Chemoinformatics through ‘Crystalloinformatics’ Simon J. Coles EPSRC National Crystallography Service.
© S.J. Coles 2005 eChemInfo2005 Open Archives as a Route for Capture, Dissemination and Access to Chemical Data and Information Simon Coles School of Chemistry,
21 Nov 2006 Jeremy G. Frey University of Southampton DCC Conference Glasgow The curation of laboratory experimental data as part of the overall data lifecycle.
1 The Discovery Informatics Framework Pat Rougeau President and CEO MDL Information Systems, Inc. Delivering the Integration Promise American Chemical.
Discussion and conclusion The OGC SOS describes a global standard for storing and recalling sensor data and the associated metadata. The standard covers.
BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July 2005 Metadata Agents and Semantic Mediation Mikhaila Burgess Cardiff University.
Integrated e-Infrastructure for Scientific Facilities Kerstin Kleese van Dam STFC- e-Science Centre Daresbury Laboratory
Research Information System for Materials - Database, Simulation and Knowledge Toshihiro Ashino Toyo University
Meeting Capture and Structural Replay Compendium in Meeting Replay Web Interface BuddySpace I-X Process System Mars Exploration Mission
CSED Computational Science & Engineering Department CHEMICAL DATABASE SERVICE The Current Service is Well Regarded The CDS has a long and distinguished.
From GEANT to Grid empowered Research Infrastructures ANTONELLA KARLSON DG INFSO Research Infrastructures Grids Information Day 25 March 2003 From GEANT.
EBank UK: linking scientific data, scholarly communication and learning Michael Day and Rachel Heery UKOLN, University of Bath
Linked-data and the Internet of Things Payam Barnaghi Centre for Communication Systems Research University of Surrey March 2012.
Smart Lab, Smart Tea H. R. Mills, G. V. Hughes, m. c. schraefel, J. G. Frey, G. M. Smith, David De Roure CombeChem Project Electronics and Computer Science.
Perspectives on Cyberinfrastructure Daniel E. Atkins Professor, University of Michigan School of Information & Dept. of EECS October 2002.
Interoperability Grids, Clouds and Collaboratories Ruth Pordes Executive Director Open Science Grid, Fermilab.
11 Curation of Chemistry Data from the Laboratory to Publication Jeremy Frey & Simon Coles School of Chemistry University of Southampton Jeremy Frey &
The Grid in a Combinatorial Laboratory Jeremy Frey Department of Chemistry University of Southampton.
Grid-enabling the Australian Synchrotron 24 February 2005.
ESFRI & e-Infrastructure Collaborations, EGEE’09 Krzysztof Wrona September 21 st, 2009 European XFEL.
CombiChem IBM Structure-Property Mapping Combinatorial Chemistry and the Grid J Frey Department of Chemistry University of Southampton.
UKOLN is supported by: Introduction to UKOLN Dr Liz Lyon, Director UKOLN, University of Bath, UK Grand Challenge Meeting, June a centre.
An Introduction to UK e-Science Anne E Trefethen Deputy Director UK e-Science Core Programme.
CombeDay Making Data Openly Available Simon Coles.
Partnerships in Innovation: Serving a Networked Nation Grid Technologies: Foundations for Preservation Environments Portals for managing user interactions.
Oct 2004 Jeremy Frey Informatics1 Automation and Semantics: The CombeChem Experience Jeremy Frey CombeDay Feb 2005.
David De Roure Workflows in Support of Large-Scale Science Provenance, a.
Welcome Grids and Applied Language Theory Dave Berry Research Manager 16 th October 2003.
Ian Bruno, Suzanna Ward The Cambridge Crystallographic Data Centre
GISELA & CHAIN Workshop Digital Cultural Heritage Network
Scientific Computing Department
Design and Manufacturing in a Distributed Computer Environment
Welcome to National e-Science Centre Official Opening
UK e-Science OGSA-DAI November 2002 Malcolm Atkinson
Grid Portal Services IeSE (the Integrated e-Science Environment)
JISC Joint Programmes Meeting 2005
GISELA & CHAIN Workshop Digital Cultural Heritage Network
Presentation transcript:

Comb-e-Chem Jeremy Frey Sept 2004 Drug Design & Delivery: The role of e-Science Jeremy Frey School of Chemistry University of Southampton, UK X-ray single Mol STM Raman Ocean Monolayer

Jeremy G. Frey e-Science e-Science is about global collaboration in key areas of science, and the next generation of infrastructure that will enable it. e-Science will change the dynamic of the way science is undertaken. John Taylor, DG of UK OST [The Grid] intends to make access to computing power, scientific data repositories and experimental facilities as easy as the Web makes access to information. Tony Blair, 2002

Jeremy G. Frey The UK e-Science Challenge £120M over a 3 Year Programme to create the next generation IT infrastructure to support e-Science and Business £120M over a 3 Year Programme to create the next generation IT infrastructure to support e-Science and Business Essential that UK plays a leading role in Global Grid development with the USA and EU Essential that UK plays a leading role in Global Grid development with the USA and EU Phase 1: Started roll out of plan for Grid Research, Development and Support of e-Science Pilot Projects Phase 1: Started roll out of plan for Grid Research, Development and Support of e-Science Pilot Projects

Jeremy G. Frey Cambridge Newcastle Edinburgh Oxford Glasgow Manchester Cardiff Southampton London Belfast DL RAL Hinxton UK e-Science Grid

Jeremy G. Frey National e-Science Centre (NeSC) NeSC is in Edinburgh NeSC is in Edinburgh Provides Courses & Meetings Provides Courses & Meetings Also has some funding for fellowships to visit NeSC Also has some funding for fellowships to visit NeSC

Jeremy G. Frey The Collaboratory Concept In 1989, William Wulf, then with the U.S. National Science Foundation, defined a collaboratory as In 1989, William Wulf, then with the U.S. National Science Foundation, defined a collaboratory as "a center without walls, in which the nation's researchers can perform their research without regard to geographical location, interacting with colleagues, accessing instrumentation, sharing data and computational resources, and accessing information in digital libraries."

Jeremy G. Frey HPC Analysis Storage Analysis Experiment Computing HPC Scientist The Current Client – Server ad hock model

Jeremy G. Frey The Future The Grid Model - Information Utilities MIDLEWAREMIDLEWARE Experiment Computing Storage Analysis Scientist

Jeremy G. Frey Access Grid Full multi-site video conferencing over the IP network Full multi-site video conferencing over the IP network Many sites now in the UK all running the same system Many sites now in the UK all running the same system System originated in the USA so also sites there. System originated in the USA so also sites there.

Jeremy G. Frey Access Grid nodes Access Grid

Jeremy G. Frey The Grid Grid is needed because Grid is needed because – Volume of data (real time data, images, video) – Scale of computation (analysis, simulation) – Complexity of process (automation) – Variable demands on computation – Provenance (audit trials, timestamps, process)

Jeremy G. Frey Bristol Chemistry ECS Stats Chemistry Combi Centre Southampton NCS IUPAC RSC IBM CCDC Pfizer IT Innovation Comb-e-Chem Partners GSK AZ

Jeremy G. Frey CombeChem People & Places IBM GSK Pfizer AZ

Jeremy G. Frey People Chemistry (Southampton & Bristol) – –Mike Hursthouse, Chris Frampton, Jon Essex, Jeremy Frey, Guy Orpen, Stephan Christensen, Thomas Gelbrich, Sam Peppe, Hongchen Fu, Graham Tizard, Suzanna Ward, Lefteris Danos National Crystallography Service (NCS) – –Simon Coles, Mark Light, Ann Bingham Electronics and Computer Science (Southampton) – –Dave De Roure, Luck Moreau, Mike Luck, Hugo Mills, Graham Smith, Simon Miles, Nicky Harding, Gareth Hughes, monica Schraefel, Terry Payne It-Innovation (Southampton) – –Mike Surridge, Ken Meacham, Steve Taylor, Daren Marvin Statistics (Southampton) – –Alan Welsh, Sue Lewis, Ralph Manson, Dave Woods Rutherford Appleton Laboratory

Jeremy G. Frey Synthesis Structure Analysis & Correlation Modelling Dissemination Prediction Design PlanGoal Properties All steps must be Grid Aware I will illustrate the application of e-Science to some of these stages using examples from the Comb-e-Chem Project

Jeremy G. Frey Synthesis Structure Analysis & Correlation Modelling Dissemination Prediction Design PlanGoal Properties All steps must be Grid Aware Salt Selection Smart Lab Crystallography Structural Similarities Non-linear optical effects Simulations Combinatorial Chemistry Semantic Grid Descriptors With examples…….

Jeremy G. Frey The Comb- e -Chem Project The exponential world of Combinatorial Synthesis and High throughput analysis meets the exponentially growing power of computing Funding EPSRC, IBM, GSK, AZ, Southampton

Jeremy G. Frey The Comb-e- Chem Vision Structures DB Properties DB Structure + PropertiesKnowledge + Prediction Automation & Remote interaction Co-Laboratory Interaction between users & Dark Labs Simulation and calculation

Jeremy G. Frey Design Automation Analysis Structures Models Properties Experiment

Jeremy G. Frey All about Automation Experiments Information & Knowledge Design Design Synthesis Synthesis Measurement Measurement Analysis Analysis Databases Databases Agents Agents

Jeremy G. Frey Plan & COSHH Digital Model Information Integration Report Knowledge Goal Literature Synthesis Smart Laboratory Analysis

Jeremy G. Frey Plan & COSHH Digital Model Information Integration Report Knowledge Goal Literature Synthesis not just one laboratory but many co-laboratories working together Analysis Smart Laboratory

Jeremy G. Frey Making best use of the Plan COSHH

Jeremy G. Frey Smart Lab

Jeremy G. Frey Smart Help

Jeremy G. Frey Laboratory Context COSHHPlanRecord Annotation Guide Experimenters Digital Context

Jeremy G. Frey Chemistry Starts in the Lab Lab NCS Structure Raw data DatabasePublication

Jeremy G. Frey Chemistry Starts in the Lab Lab NCS Structure Raw data DatabasePublication URI

Jeremy G. Frey Semantic Grid Project Inference based on the semantics Importance of Ontology But problem of contradictions even within a domain This is not an avoidable issue

Jeremy G. Frey XML Gaussian ab initio program Gaussian ab initio program XML wrapper Simulation program XML wrapper Interface Personal Agent But need more general descriptions for services RDF – resource description framework DAML-S (for describing services)

Jeremy G. Frey Databases Database will become the key method of handling all data Database will become the key method of handling all data Metadata must be generated at inception and added as data traverses the workflow Metadata must be generated at inception and added as data traverses the workflow Version control, audit and backup handled at the database level. Version control, audit and backup handled at the database level.

Jeremy G. Frey Talk The UK e-Science Programme The Comb-e-Chem Project Smart Lab NCS Grid Service Structure Analysis Services Dissemination & Publication

Jeremy G. Frey Users ExperimentExpert Data & control links Access Grid links Experiment Remote (Dark) Laboratory Centralised remote equipment, multiple users, few experts Model for National crystallographic Service NCSModel for National crystallographic Service NCS

Jeremy G. Frey Expert Manufacturer Support Service Users Experiment Users Experiment Users Experiment Local link External link Access grid & control links Expert is the central resource in short supply Model for Combinatorial Raman ProjectModel for Combinatorial Raman Project

Jeremy G. Frey Sample Raw images Processed diffraction pattern Structure CIF Database Validation Journal Synthesis Smart LabsNCSArchive CCDC metadata Automated structure determination

Jeremy G. Frey Archiving of Data RAW DATA: Automatic archiving and retrieval with Atlas Datastore (RAL) Development of schema for retrieval of crystallographic metadata from relational databases (ISIS Data analysis group) Storage Resource Broker (SRB): Uniform access interface to different types of storage devices RESULTS DATA: Automatic deposition of CIF data with CCDC GRID- enabled pre-deposition database

Jeremy G. Frey Data Trail Drill down through the analysis path Drill down through the analysis path Look at increasingly raw data Look at increasingly raw data Often large expansion in quantity and variety at each stage Often large expansion in quantity and variety at each stage

Jeremy G. Frey Must be able to track back to the original data Must be able to track back to the original data Primary reason is to allow new analysis in the future by other researchers. Primary reason is to allow new analysis in the future by other researchers. In a university environment this may be viewed as a public responsibility in business environment ensuring maximum value from investment. In a university environment this may be viewed as a public responsibility in business environment ensuring maximum value from investment. Does have implications for provenance and even fraud! Does have implications for provenance and even fraud!

Jeremy G. Frey Journals: source Journal MaterialsDatabaseMultimediaLaboratory DataPaper Full record

Jeremy G. Frey Context Most important provenance provides context Most important provenance provides context Needed to provide the Semantics Needed to provide the Semantics Allows other programs to understand the information (i.e. not just informed human) Allows other programs to understand the information (i.e. not just informed human) Allows inference Allows inference Also useful in synthetic laboratory Also useful in synthetic laboratory

Jeremy G. Frey Publication Chain Institution Laboratory Student Journal Bibliography Professional Body Archive

Jeremy G. Frey e-Bank Project Link comb-e-chem and other semantic grid science projects to the e-print system at Southampton Link comb-e-chem and other semantic grid science projects to the e-print system at Southampton Provide dissemination and provenance Provide dissemination and provenance

Jeremy G. Frey Changing the way we work Data Provenance Quantum Mechanical Analysis Properties Prediction Data Mining, QSAR, etc Design of Experiment E-Lab: Combinatorial Synthesis E-Lab: Properties Measurement E-Lab: X-Ray Crystallography Laboratory Processes Laboratory Processes Structures DB Properties DB Data Streaming Authorship/ Submission Visualisation Agent Assistant Laboratory Processes Samples