Comb-e-Chem Jeremy Frey Sept 2003 From e-Science to Jeremy Frey School of Chemistry University of Southampton, UK X-ray single Mol STM.

Slides:



Advertisements
Similar presentations
Common Instrument Middleware Architecture and Federation of Instrument Resources for X-ray Crystallography Rick McMullen Indiana University.
Advertisements

IATUL Porto, May 21, 2006 DOI and e-Science Dr Anne E Trefethen Oxford e-Research Centre
CoAKTing IFD Dave in Hawaii. 2 CoAKTing IFD n Objective is to advance the state of the art in collaborative mediated spaces for distributed e- Science.
DRIVER Step One towards a Pan-European Digital Repository Infrastructure Norbert Lossau Bielefeld University, Germany Scientific coordinator of the Project.
Geo-spatial and Visualisation L&T materials - the e-MapScholar project Moira Massey ALT-C 2002 University of Sunderland.
120K293K 395K S.J. Coles a, P.N. Horton a, M.B. Hursthouse a, W. Clegg b & R.W. Harrington b. a School of Chemistry, University of Southampton, UK.; b.
© S.J. Coles 2006 Usability WS, NeSC Jan 06 Experiences in deploying a useable Grid-enabled service for the National Crystallography Service Simon J. Coles.
S.J. Coles a*, M.B. Hursthouse a, R.A. Stephenson a, P. Cliff b, E. Lyon b, M. Patel b J. Downing c & P. Murray-Rust.
© S.J. Coles 2006 Usability WS, NeSC Jan 06 Enabling the reusability of scientific data: Experiences with designing an open access infrastructure for sharing.
Crystal Structure EPrints: Source Through the Open Archive Initiative S.J. Coles a*, J.G. Frey a, M.B. Hursthouse a, L. Carr b & C.J. Gutteridge.
National Crystallography Grid Service Comb-e-Chem
Opening the Research Data Lifecycle Workshop Capturing and Sharing Research Data Simon Coles School of Chemistry, University of Southampton, U.K.
Crystallographic Metadata Simon Coles CrystalGrid Collaboratory Foundation Meeting September 2004.
A distributed architecture for crystallography data, metadata, and applications John C. Bollinger Indiana University Molecular Structure Center, Bloomington,
© S.J. Coles 2006 Digital Repositories as a Mechanism for the Capture, Management and Dissemination of Chemical Data Simon Coles School of Chemistry, University.
Linking Data and Publications: the Chemistry Way Simon Coles School of Chemistry, University of Southampton, U.K. CLADDIER workshop.
Data and metadata in the Reciprocal Net John C. Bollinger Indiana University Molecular Structure Center, Bloomington, IN.
S.J. Coles a*, J.G. Frey a, M.B. Hursthouse a, L. Carr b & C.J. Gutteridge b. a School of Chemistry, University of Southampton, UK.; b School of Electronics.
© S.J. Coles 2006 Digital Repositories as a Mechanism for the Capture, Management and Dissemination of Chemical Data Simon Coles School of Chemistry, University.
RCUK, Octiber Archiving research data and research publications. Dr Leslie Carr, Intelligence, Agents Multimedia, University of Southampton Dr Simon.
Comb-e-Chem Jeremy Frey Sept 2004 Drug Design & Delivery: The role of e-Science Jeremy Frey School of Chemistry University of Southampton, UK X-ray single.
Integrating research data into the publication workflow: eBank UK experience Rachel Heery, UKOLN, University of Bath
Terminologies: An e-Science perspective Nicholas Gibbins Intelligence, Agents, Multimedia University of Southampton.
Data Curation in Crystallography: Publisher Perspectives JISC Data Cluster Consultation Workshop CCLRC, Didcot, Oxon 10 October 2006.
UKOLN is supported by: eBank UK : linking research data, scholarly communications and learning. Dr Liz Lyon, UKOLN, University of Bath, UK JISC CNI Conference.
Federation The eCrystals Federation Dr Simon Coles, University of Southampton, UK Dr Liz Lyon, UKOLN, University of Bath, UK Open Repositories 2008, University.
A centre of expertise in digital information management UKOLN is supported by: UK Perspectives on the Curation and Preservation of Scientific.
Federation eCrystals Federation: Open Repositories for Data-driven Science Dr Liz Lyon, UKOLN, University of Bath, UK Dr Simon Coles, University of Southampton,
UKOLN is supported by: Enhancing access to research data: the challenge of crystallography Rachel Heery, Monica Duke, Michael Day UKOLN, University of.
EBankII Workshop 1 Making Scientific Data Openly Available Simon Coles School of Chemistry, University of Southampton.
EBank UK CCLRC Workshop February eBank and CCLRC Workshop February 2005 University of Bath.
Digital Repositories: interoperability & common services Closing Remarks Dr Liz Lyon, UKOLN, University of Bath, UK
From AgentLink II to AgentLink III Co-ordinators: Peter McBurney, University of Liverpool, UK Terry Payne, University of Southampton, UK.
PaN-data WP7 - Integration Brian Matthews STFC-e-Science.
The Data Lifecycle and the Curation of Laboratory Experimental Data Tony Hey Corporate VP for Technical Computing Microsoft Corporation.
28 October 2005Jeremy Frey, University of Southampton1 “The CombeChem Experience” CICC Workshop 28 October 2005 Bloomington Indiana.
What is e-Science? e-Science refers to large scale science that will increasingly be carried out through distributed global collaborations enabled by the.
University of Southampton, U.K.
Jeffery Loo NLM Associate Fellow ’03 – ’05 chemicalinformaticsforlibraries.
EPrints Workshop, January eBank UK: Dissemination of research data using EPrints Simon Coles, School of Chemistry, University of Southampton.
Disseminating crystallography results the Indiana way John C. Huffman and John C. Bollinger Indiana University Molecular Structure Center, Bloomington,
Crystallographic Data Publication at Source International Union of Crystallography Peter R. Strickland and Brian McMahon IUCr 5 Abbey Square Chester CH1.
"Keeping alert: issues to know today for long-term digital preservation with repositories" Neil Beagrie Fedora Users Group Open Repositories Southampton.
© S.J. Coles 2006 Data Management in the Chemistry Domain Simon Coles School of Chemistry, University of Southampton, U.K.
© S.J. Coles 2005 eChemInfo2005 Open Archives as a Route for Capture, Dissemination and Access to Chemical Data and Information Simon Coles School of Chemistry,
21 Nov 2006 Jeremy G. Frey University of Southampton DCC Conference Glasgow The curation of laboratory experimental data as part of the overall data lifecycle.
Knowledge Environments for Science and Engineering: Overview of Past, Present and Future Michael Pazzani, Information and Intelligent Systems Division,
1 The Discovery Informatics Framework Pat Rougeau President and CEO MDL Information Systems, Inc. Delivering the Integration Promise American Chemical.
Dataset Citation: From Pilot to Production Mark Martin Assistant Director, Office of Scientific and Technical Information U.S. Department of Energy.
Meeting Capture and Structural Replay Compendium in Meeting Replay Web Interface BuddySpace I-X Process System Mars Exploration Mission
CSED Computational Science & Engineering Department CHEMICAL DATABASE SERVICE The Current Service is Well Regarded The CDS has a long and distinguished.
From GEANT to Grid empowered Research Infrastructures ANTONELLA KARLSON DG INFSO Research Infrastructures Grids Information Day 25 March 2003 From GEANT.
EBank UK: linking scientific data, scholarly communication and learning Michael Day and Rachel Heery UKOLN, University of Bath
Linked-data and the Internet of Things Payam Barnaghi Centre for Communication Systems Research University of Surrey March 2012.
Perspectives on Cyberinfrastructure Daniel E. Atkins Professor, University of Michigan School of Information & Dept. of EECS October 2002.
11 Curation of Chemistry Data from the Laboratory to Publication Jeremy Frey & Simon Coles School of Chemistry University of Southampton Jeremy Frey &
UKOLN is supported by: Digital Preservation Benefits Tools Project Dissemination Workshop Dr Liz Lyon, Associate Director, UK Digital Curation Centre Director,
The Grid in a Combinatorial Laboratory Jeremy Frey Department of Chemistry University of Southampton.
CombiChem IBM Structure-Property Mapping Combinatorial Chemistry and the Grid J Frey Department of Chemistry University of Southampton.
UKOLN is supported by: Introduction to UKOLN Dr Liz Lyon, Director UKOLN, University of Bath, UK Grand Challenge Meeting, June a centre.
26/05/2005 Research Infrastructures - 'eInfrastructure: Grid initiatives‘ FP INFRASTRUCTURES-71 DIMMI Project a DI gital M ulti M edia I nfrastructure.
CombeDay Making Data Openly Available Simon Coles.
Introduction to the VO ESAVO ESA/ESAC – Madrid, Spain.
Oct 2004 Jeremy Frey Informatics1 Automation and Semantics: The CombeChem Experience Jeremy Frey CombeDay Feb 2005.
Project number: ENVRI and the Grid Wouter Los 20/02/20161.
David De Roure Workflows in Support of Large-Scale Science Provenance, a.
Afternoon session: The archival problem and infrastructure for solutions Prof John R Helliwell Interactive Publications.
Clouds , Grids and Clusters
RDA US Science workshop Arlington VA, Aug 2014 Cees de Laat with many slides from Ed Seidel/Rob Pennington.
CICC Combines Grid Computing with Chemical Informatics
Presentation transcript:

Comb-e-Chem Jeremy Frey Sept 2003 From e-Science to Jeremy Frey School of Chemistry University of Southampton, UK X-ray single Mol STM Raman Ocean Monolayer

19 Feb 2004 OAI Meeting Jeremy G. Frey & Mike Hursthouse e-Science e-Science is about global collaboration in key areas of science, and the next generation of infrastructure that will enable it. e-Science will change the dynamic of the way science is undertaken. John Taylor, DG of UK OST [The Grid] intends to make access to computing power, scientific data repositories and experimental facilities as easy as the Web makes access to information. Tony Blair, 2002

19 Feb 2004 OAI Meeting Jeremy G. Frey & Mike Hursthouse The Collaboratory Concept In 1989, William Wulf, then with the U.S. National Science Foundation, defined a collaboratory as In 1989, William Wulf, then with the U.S. National Science Foundation, defined a collaboratory as "a center without walls, in which the nation's researchers can perform their research without regard to geographical location, interacting with colleagues, accessing instrumentation, sharing data and computational resources, and accessing information in digital libraries."

19 Feb 2004 OAI Meeting Jeremy G. Frey & Mike Hursthouse The Comb- e -Chem Project The exponential world of Combinatorial Synthesis and High throughput analysis meets the exponentially growing power of computing

19 Feb 2004 OAI Meeting Jeremy G. Frey & Mike Hursthouse Bristol Chemistry ECS Stats Chemistry Combi Centre Southampton NCS IUPAC RSC IBM CCDC Pfizer IT Innovation Comb-e-Chem Partners GSK AZ

19 Feb 2004 OAI Meeting Jeremy G. Frey & Mike Hursthouse The Comb-e- Chem Vision Structures DB Properties DB Structure + PropertiesKnowledge + Prediction Automation & Remote interaction Co-Laboratory Interaction between users & Dark Labs Simulation and calculation

19 Feb 2004 OAI Meeting Jeremy G. Frey & Mike Hursthouse Comb-e-Chem Project - Automation X-Ray e-Lab Analysis Properties Properties e-Lab Simulation Video Diffractometer Grid Structures Database

19 Feb 2004 OAI Meeting Jeremy G. Frey & Mike Hursthouse HPC Analysis Storage Analysis Experiment Computing HPC Scientist Scientist at the Centre of an Information Web By access variable and difficult

19 Feb 2004 OAI Meeting Jeremy G. Frey & Mike Hursthouse The Future The Grid Model - Information Utilities Uniform access MIDLEWAREMIDLEWARE Experiment Computing Storage Analysis Scientist Remember that you contribute to other peoples information web

19 Feb 2004 OAI Meeting Jeremy G. Frey & Mike Hursthouse End - to - end connectivity Provide the smooth connection between the sources of data & information Provide the smooth connection between the sources of data & information From literature to the laboratory bench and back via all stages of analysis and discussion From literature to the laboratory bench and back via all stages of analysis and discussion Thus the need for a Data Grid or Grids Thus the need for a Data Grid or Grids Al steps need to be Grid aware Al steps need to be Grid aware

19 Feb 2004 OAI Meeting Jeremy G. Frey & Mike Hursthouse Plan & COSHH Digital Model Information Integration Report Knowledge Goal Literature Synthesis Smart Laboratory Analysis Generate information within & for the grid context

19 Feb 2004 OAI Meeting Jeremy G. Frey & Mike Hursthouse Variety of data

19 Feb 2004 OAI Meeting Jeremy G. Frey & Mike Hursthouse The Grid Grid is needed because Grid is needed because – Complexity of data – Volume of data (real time data, images, video) – Scale of computation (analysis, simulation) – Complexity of process (automation) – Variable demands on computation – Provenance (audit trials, timestamps, process)

19 Feb 2004 OAI Meeting Jeremy G. Frey & Mike Hursthouse Dissemination & Publication A different approach is required to provide data to the community The grid provides the necessary medium What & How do we want to make available

19 Feb 2004 OAI Meeting Jeremy G. Frey & Mike Hursthouse Journals: source Journal MaterialsDatabaseMultimediaLaboratory DataPaper Full record

19 Feb 2004 OAI Meeting Jeremy G. Frey & Mike Hursthouse Data Trail Drill down through the analysis path Drill down through the analysis path Look at increasingly raw data Look at increasingly raw data Often large expansion in quantity and variety at each stage Often large expansion in quantity and variety at each stage Need URIs for everything Need URIs for everything

19 Feb 2004 OAI Meeting Jeremy G. Frey & Mike Hursthouse Must be able to track back to the original data Must be able to track back to the original data Primary reason is to allow new analysis in the future by other researchers. Primary reason is to allow new analysis in the future by other researchers. In a university environment this may be viewed as a public responsibility in business environment ensuring maximum value from investment. In a university environment this may be viewed as a public responsibility in business environment ensuring maximum value from investment. Does have implications for provenance and even fraud! Does have implications for provenance and even fraud!

19 Feb 2004 OAI Meeting Jeremy G. Frey & Mike Hursthouse Publication Chain Institution Laboratory Student Journal Bibliography Professional Body Archive

19 Feb 2004 OAI Meeting Jeremy G. Frey & Mike Hursthouse Sample Raw images Processed diffraction pattern Structure CIF Database Validation Journal Synthesis Smart LabsNCSArchive CCDC metadata Automated structure determination

Comb-e-Chem Jeremy Frey Sept 2003 Chemical Crystallography: A Suitable Case for OA Therapy Chemical Crystallography: A Suitable Case for OA Therapy Mike Hursthouse Department of Chemistry and Combinatorial Centre of Excellence, EPSRC National Service for Crystallography University of Southampton, UK

19 Feb 2004 OAI Meeting Jeremy G. Frey & Mike Hursthouse Characterisation technique for Chemical Structure. Characterisation technique for Chemical Structure. Use XRD. Use XRD. Provides high level of chem knowledge Provides high level of chem knowledge Structure – molecular or crystal Structure – molecular or crystal Previously focussed on molecular structure – chemical props Previously focussed on molecular structure – chemical props Now focus on crystal structure – physical props Now focus on crystal structure – physical props Change in interest facilitated by availability of database archive. Change in interest facilitated by availability of database archive. However, woefully incomplete However, woefully incomplete ChemCryst

19 Feb 2004 OAI Meeting Jeremy G. Frey & Mike Hursthouse Database Archive – ca entries – all Database Archive – ca entries – all published structures published structures >10M chemical compounds known >10M chemical compounds known Probably 1.5M structures known Probably 1.5M structures known Why shortfall? Archaic publishing methods. Why shortfall? Archaic publishing methods. Solution? Solution? ChemCryst

19 Feb 2004 OAI Meeting Jeremy G. Frey & Mike Hursthouse ChemCryst results New dissemination strategy ChemCryst results New dissemination strategy E-Prints of Structure Reports E-Prints of Structure Reports Can be created automatically. Can be created automatically. Work can be validated automatically. Work can be validated automatically. All data (raw, processed, meta…) included. All data (raw, processed, meta…) included. Hence bypass Journal sponsored refereeing Hence bypass Journal sponsored refereeing Still need to decide on publication of science Still need to decide on publication of science ChemCryst

19 Feb 2004 OAI Meeting Jeremy G. Frey & Mike Hursthouse e-Bank Project JISC project with UKOLN Link comb-e-chem and other semantic grid science projects to the e-print system at Southampton Link comb-e-chem and other semantic grid science projects to the e-print system at Southampton Provide dissemination and provenance Provide dissemination and provenance

19 Feb 2004 OAI Meeting Jeremy G. Frey & Mike Hursthouse Changing the way we work Data Provenance Quantum Mechanical Analysis Properties Prediction Data Mining, QSAR, etc Design of Experiment E-Lab: Combinatorial Synthesis E-Lab: Properties Measurement E-Lab: X-Ray Crystallography Laboratory Processes Laboratory Processes Structures DB Properties DB Data Streaming Authorship/ Submission Visualisation Agent Assistant Laboratory Processes Samples