28 October 2005Jeremy Frey, University of Southampton1 “The CombeChem Experience” CICC Workshop 28 October 2005 Bloomington Indiana.

Slides:



Advertisements
Similar presentations
Comb-e-Chem Jeremy Frey Sept 2003 From e-Science to Jeremy Frey School of Chemistry University of Southampton, UK X-ray single Mol STM.
Advertisements

S.J. Coles a*, M.B. Hursthouse a, R.A. Stephenson a, P. Cliff b, E. Lyon b, M. Patel b J. Downing c & P. Murray-Rust.
© S.J. Coles 2006 Usability WS, NeSC Jan 06 Enabling the reusability of scientific data: Experiences with designing an open access infrastructure for sharing.
Crystal Structure EPrints: Source Through the Open Archive Initiative S.J. Coles a*, J.G. Frey a, M.B. Hursthouse a, L. Carr b & C.J. Gutteridge.
Less is More Lightweight Ontologies and User Interfaces for Smart Labs J. G. Frey, G. V. Hughes, H. R. Mills, m. c. schraefel, G. M. Smith, David De Roure.
Digital Preservation Tools for Repository Managers A practical course in five parts presented by the KeepIt project in association with School of ECS,
Design of a Grid Enabled Database System to Facilitate Reuse, Provenance Tracking and Automated Processing of Chemical Information Robert Gledhill University.
A distributed architecture for crystallography data, metadata, and applications John C. Bollinger Indiana University Molecular Structure Center, Bloomington,
Data and metadata in the Reciprocal Net John C. Bollinger Indiana University Molecular Structure Center, Bloomington, IN.
S.J. Coles a*, J.G. Frey a, M.B. Hursthouse a, L. Carr b & C.J. Gutteridge b. a School of Chemistry, University of Southampton, UK.; b School of Electronics.
© S.J. Coles 2006 Digital Repositories as a Mechanism for the Capture, Management and Dissemination of Chemical Data Simon Coles School of Chemistry, University.
RCUK, Octiber Archiving research data and research publications. Dr Leslie Carr, Intelligence, Agents Multimedia, University of Southampton Dr Simon.
Integrating research data into the publication workflow: eBank UK experience Rachel Heery, UKOLN, University of Bath
Terminologies: An e-Science perspective Nicholas Gibbins Intelligence, Agents, Multimedia University of Southampton.
Federation eCrystals Federation: Open Repositories for Data-driven Science Dr Liz Lyon, UKOLN, University of Bath, UK Dr Simon Coles, University of Southampton,
EBankII Workshop 1 Making Scientific Data Openly Available Simon Coles School of Chemistry, University of Southampton.
EBank UK CCLRC Workshop February eBank and CCLRC Workshop February 2005 University of Bath.
Digital Repositories: interoperability & common services Closing Remarks Dr Liz Lyon, UKOLN, University of Bath, UK
The CLARION Project for the Infrastructure for Integration in Structural Sciences (I2S2) mtg, Rutherford Labs, 11 th February 2010 CLARION – Chemical Laboratory.
Semantic Web Thanks to folks at LAIT lab Sources include :
Open Access; Open Data I590 Spring Budapest Open Access Initiative Based on: –Self archiving by authors –Open Access journals, e.g., BioMed Central.
The Data Lifecycle and the Curation of Laboratory Experimental Data Tony Hey Corporate VP for Technical Computing Microsoft Corporation.
The Central Role of Data ‘Capturing and Sharing Chemistry Research Data’ Simon Coles School of Chemistry, University of Southampton, U.K.
Achieving Competitive Advantage and ROI with MetaManager  Metadata Management  Content Enhancements  Standardization  Security and more…
Provenance in Open Distributed Information Systems Syed Imran Jami PhD Candidate FAST-NU.
Semantic Web 2 06 T 0006 Yoshiyuki Osawa. Aim of Semantic Web Information which users needs is collected by using a computer. Information on the web is.
©STFC/Keith G Jeffery Metadata in the European e-Infrastructure Metadata in the European e-Infrastructure Keith G Jeffery Science and Technology.
University of Southampton, U.K.
A Secure Interoperable Infrastructure For Healthcare Information System Ehsan ul Haq Abrar Ahmed Sair
Click to edit Master subtitle style JISC XYZ Project Principal Investigator: Peter Murray-Rust Project Team: Nick England, Brian Brooks Unilever Centre,
Jeffery Loo NLM Associate Fellow ’03 – ’05 chemicalinformaticsforlibraries.
EPrints Workshop, January eBank UK: Dissemination of research data using EPrints Simon Coles, School of Chemistry, University of Southampton.
Experiences with Repositories and Blogs in Laboratories or ‘R4L: The Repository for the Laboratory’ Leslie Carr, Simon Coles & Jeremy.
21 Nov 2006 Jeremy G. Frey University of Southampton DCC Conference Glasgow The curation of laboratory experimental data as part of the overall data lifecycle.
Discussion and conclusion The OGC SOS describes a global standard for storing and recalling sensor data and the associated metadata. The standard covers.
1 CSE 2102 CSE 2102 Ph.D. Proposal A Process Framework For Ontology Modeling, Design, And Development Realized By Extending OWL and ODM Candidate: Rishi.
Towards a Provenance Architecture Karen Schuchardt PNNL.
Research Information System for Materials - Database, Simulation and Knowledge Toshihiro Ashino Toyo University
Integrating Live Plant Images with Other Types of Biodiversity Records Steve Baskauf Vanderbilt Dept. of Biological Sciences
Semantic Web Applications GoodRelations BBC Artists BBC World Cup 2010 Website Emma Nherera.
Data Management Information Management Knowledge Management Data and Applications Security Challenges Bhavani Thuraisingham October 2006.
EBank UK: linking scientific data, scholarly communication and learning Michael Day and Rachel Heery UKOLN, University of Bath
E-science in the Netherlands Maria Heijne TU Delft Library Director / Chair Consortium of University Libraries and National Library.
Smart Lab, Smart Tea H. R. Mills, G. V. Hughes, m. c. schraefel, J. G. Frey, G. M. Smith, David De Roure CombeChem Project Electronics and Computer Science.
EU Project proposal. Andrei S. Lopatenko 1 EU Project Proposal CERIF-SW Andrei S. Lopatenko Vienna University of Technology
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Scientific Applications of XML Arvind Hulgeri, Shantanu Godbole
11 Curation of Chemistry Data from the Laboratory to Publication Jeremy Frey & Simon Coles School of Chemistry University of Southampton Jeremy Frey &
CombiChem IBM Structure-Property Mapping Combinatorial Chemistry and the Grid J Frey Department of Chemistry University of Southampton.
Implementing an RDF Schema for Pathology Images, From the Association for Pathology Informatics Jules J. Berman, Ph.D., M.D. APIII, Pittsburgh, PA Monday,
Understanding RDF. 2/30 What is RDF? Resource Description Framework is an XML-based language to describe resources. A common understanding of a resource.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
1 Web 2.0 and Grids for Scholarly Research Peking University July Geoffrey Fox Computer Science, Informatics, Physics Pervasive Technology Laboratories.
UKOLN is supported by: Introduction to UKOLN Dr Liz Lyon, Director UKOLN, University of Bath, UK Grand Challenge Meeting, June a centre.
KAnOE: Research Centre for Knowledge Analytics and Ontological Engineering Managing Semantic Data NACLIN-2014, 10 Dec 2014 Dr. Kavi Mahesh Dean of Research,
CombeDay Making Data Openly Available Simon Coles.
The Collaborative Imaging Grid Paul Javid, Kurtis Heimerl A collaborative research environment enabling Researchers to learn from images when computer.
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Applications and Requirements for Scientific Workflow May NSF Geoffrey Fox Indiana University.
Oct 2004 Jeremy Frey Informatics1 Automation and Semantics: The CombeChem Experience Jeremy Frey CombeDay Feb 2005.
David De Roure Workflows in Support of Large-Scale Science Provenance, a.
Simulation Production System Science Advisory Committee Meeting UW-Madison March 1 st -2 nd 2007 Juan Carlos Díaz Vélez.
Metadata Issues in Long-term Management of Data and Metadata
Emergent Semantics: Towards Self-Organizing Scientific Metadata
Chemistry, University of Southampton, UK
CICC Combines Grid Computing with Chemical Informatics
Developing Institutional Data Repositories
Chemical Informatics and Cyberinfrastructure Collaboratory
Presentation transcript:

28 October 2005Jeremy Frey, University of Southampton1 “The CombeChem Experience” CICC Workshop 28 October 2005 Bloomington Indiana

28 October 2005Jeremy Frey, University of Southampton2 Chemical Data & Chemical Grids Chemical data, information & knowledge –Experiments, Simulation & Computation Exponential growth in generation of data –Need automatic capture of meta data Start in the laboratory – pervasive physical grid Computational chemistry very significant source Software to be used by chemists so must be simple to support & maintain –autonomic

28 October 2005Jeremy Frey, University of Southampton3 Chemical Semantic Grid RDF (Resource description framework) –From the semantic web world –Best system for the description of chemical data and processes –Achieves the same as XML + unique identifiers + linking up in a simpler manner Large scale triple stores (so far up to 50 Million triples of molecular structures and properties) Need for scalable software solutions

28 October 2005Jeremy Frey, University of Southampton4 He is charged with expressing contempt for meta-data

28 October 2005Jeremy Frey, University of Southampton5 Permanent, documented and primary record of laboratory observations

28 October 2005Jeremy Frey, University of Southampton6 Observations are never collected on note pads, filter paper or other temporary paper for later transfer into a notebook If you are caught using the “scrap of paper” technique, your improperly recorded data may be confiscated by your TA

28 October 2005Jeremy Frey, University of Southampton7 Digital record at source don’t try to add metadata after the fact

28 October 2005Jeremy Frey, University of Southampton8 Record the chemical processes as well as the data in RDF Physical World RDF

28 October 2005Jeremy Frey, University of Southampton9 Old technology does not scale Problems with relational databases - information too variable and rapidly changing types - multimedia, images are the output of current experiments

28 October 2005Jeremy Frey, University of Southampton10 Create large semantically rich database of structures and properties URI - INChi

28 October 2005Jeremy Frey, University of Southampton11 Property in RDF 1.12Beta/C12H13NO2/c (14)12-10(9) /h1H3,2H2,3-7H,8H2,14H C12H13NO Currently testing on 200,000 compounds but about to go up by order of magnitude 3Store is a scaleable solution

28 October 2005Jeremy Frey, University of Southampton12 You see that dark spooky image on the screen? That’s your credit history coming back to haunt you? Provenance Record experiments Make data available (e-crystals, e-Bank)

28 October 2005Jeremy Frey, University of Southampton13 Security and trust for experiments and data Experiments on the Grid national crystallography service

28 October 2005Jeremy Frey, University of Southampton14 Chemistry Data private or public, open or controlled access

28 October 2005Jeremy Frey, University of Southampton15 Subversive and furtive exploitation of data Data CAS PubMed CML RDF

28 October 2005Jeremy Frey, University of Southampton16 E-Bank E-crystals R4L

28 October 2005Jeremy Frey, University of Southampton17 Standards? Interoperable? Convertible? Useable?

28 October 2005Jeremy Frey, University of Southampton18 Linking Chemistry to the Life- Sciences and the Environment Need to link up small and large molecule chemistry –Bio-Informatics –Medical informatics Need to link in place and time –Environmental Informatics –Spatial-Temporal issues at a cellular and organism level Statistical Modelling

28 October 2005Jeremy Frey, University of Southampton19 Making sure Chemistry will not suffer from a data crunch All I’m saying is that now is the time to develop the technology to deflect an asteroid