ICAT + Information Model Brian Matthews Scientific Information Group E-Science Centre STFC Rutherford Appleton Laboratory

Slides:



Advertisements
Similar presentations
Grey Literature, Institutional Repositories and the Organisational Context Simon Lambert, Brian Matthews & Catherine Jones Business & Information Technology.
Advertisements

The Economic and Social Data Service (ESDS) Kevin Schürer ESDS/UKDA ESDS Awareness Day 5 December 2003.
Abstraction Layers Why do we need them? –Protection against change Where in the hourglass do we put them? –Computer Scientist perspective Expose low-level.
Louisa Casely-Hayford e-Science Ontologies & Ontology tools for the CCLRC Neutron & Muon Facility.
Shoaib Sufi CCLRC e-Science Centre CCLRC Scientific Metadata (CSMD) Model April 2004 NESC.
S.J. Coles a*, M.B. Hursthouse a, R.A. Stephenson a, P. Cliff b, E. Lyon b, M. Patel b J. Downing c & P. Murray-Rust.
Publishing Data Catherine Jones Library Systems Development Manager, STFC Rutherford Appleton Laboratory CLADDIER workshop, Chilworth, Southampton, UK.
Data and Publication Discovery Brian Matthews, Information Management Group, STFC Rutherford Appleton Laboratory CLADDIER workshop, Chilworth, Southampton,
Towards an information model for I2S2
EBankII Workshop 1 Making Scientific Data Openly Available Simon Coles School of Chemistry, University of Southampton.
I2S2 - Infrastructure for Integration in Structural Sciences Cross-Institutional Pilot
I2S2 - Infrastructure for Integration in Structural Sciences Information Model Development Workshop RAL 11 th February 2010
The CLARION Project for the Infrastructure for Integration in Structural Sciences (I2S2) mtg, Rutherford Labs, 11 th February 2010 CLARION – Chemical Laboratory.
Requirements Gathering (work in progress) Manjula Patel, UKOLN & DCC I2S2 Models Workshop 11 th February 2010 STFC, RAL, Didcot
Preserving and Sharing Digital Data Greg Colati, Director, Archives and Special Collections May 11, 2012.
PaN-data WP7 - Integration Brian Matthews STFC-e-Science.
A multi-level metadata approach for a Public Sector Information data infrastructure Nikos Houssos 1,2, Brigitte Jörg 1,3, Brian Matthews 4 1 euroCRIS 2.
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
Data Catalogue Service Work Package 4. Main Objective: Deployment, Operation and Evaluation of a cataloguing service for scientific data. Why: Potential.
Data Grids: Globus vs SRB. Maturity SRB  Older code base  Widely accepted across multiple communities  Core components are tightly integrated Globus.
Brian Matthews, CRIS 2002, 31/08/02 1 Accessing the Outputs of Scientific Projects Brian Matthews, Michael Wilson, Business & Information Technology Dept,
Cloud Computing for Chemical Property Prediction Paul Watson School of Computing Science Newcastle University, UK Microsoft Cloud.
Chapter 1 Database and Database Users Dr. Bernard Chen Ph.D. University of Central Arkansas.
Publication of facility investigations Brian Matthews Scientific Information Group Scientific Computing Department STFC Rutherford Appleton Laboratory.
Moving forward our shared data agenda: a view from the publishing industry ICSTI, March 2012.
Taming the facility data explosion The ICAT system explained Damian Flannery NOBUGS 2008 Sydney ICAT.
Enabling E Research ANU Data Commons. What is it ? Building a repository for data sets o data can be deposited o updated o published to Research Data.
A Metadata Catalog Service for Data Intensive Applications Presented by Chin-Yi Tsai.
Integrated e-Infrastructure for Scientific Facilities Kerstin Kleese van Dam STFC- e-Science Centre Daresbury Laboratory
 DATABASE DATABASE  DATABASE ENVIRONMENT DATABASE ENVIRONMENT  WHY STUDY DATABASE WHY STUDY DATABASE  DBMS & ITS FUNCTIONS DBMS & ITS FUNCTIONS 
ICAT Overview Tom Griffin, ISIS Facility ICAT Developer Workshop The Cosener’s House, Abingdon August 2009
Metadata for Large Science: The ICAT Data Model Brian Matthews, Leader, Scientific Applications Group, E-Science Centre, STFC Rutherford Appleton Laboratory.
A Remarkable Record of Science for Change Since 1967.
material assembled from the web pages at
Relationships July 9, Producers and Consumers SERI - Relationships Session 1.
Manjula Patel Scaling-up to Integrated Research Data Management Workshop 6 th International Digital Curation Conference Holiday Inn, Mart Plaza Chicago,
Context and Linking in the Research Lifecycle CERIF and other standards Catherine Jones Scientific Information Group Scientific Computing Department STFC.
1 All-Hands Meeting 2-4 th Sept 2003 e-Science Centre The Data Portal Glen Drinkwater.
Data Management BIRN supports data intensive activities including: – Imaging, Microscopy, Genomics, Time Series, Analytics and more… BIRN utilities scale:
MEDIN Partners Meeting 2010 Submitting data to and using Data Archive Centres.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
EPrints 10 Years of Digital Preservation. What is EPrints For?  EPrints offers a safe, open and useful place to store, share and manage material in the.
Cross-linking and Referencing Data and Publications in CLADDIER Brian Matthews, E-Science Centre, STFC Rutherford Appleton Laboratory.
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
GEON2 and OpenEarth Framework (OEF) Bradley Wallet School of Geology and Geophysics, University of Oklahoma
Cooperative experiments in VL-e: from scientific workflows to knowledge sharing Z.Zhao (1) V. Guevara( 1) A. Wibisono(1) A. Belloum(1) M. Bubak(1,2) B.
Metadata for structural science Workshop on research metadata in context Nijmegen, 7–8 September 2010 Simon Lambert STFC e-Science UK.
Nick Draper Tessella Instrument Independent Reduction and Analysis at ISIS and SNS.
TopCAT Use Cases Priorities User Interface 1 ICAT developer workshop, August 2009 Laurent Lerusse – STFC
Louisa Casely-Hayford e-Science The ISIS Facilities Ontology and OntoMaintainer Louisa Casely-Hayford and Shoaib Sufi.
ICAT Schema Current Schema organization What’s there but not yet implemented What could we want in the future 1 ICAT developer workshop, August 2009.
Simplified Experiment Submit Proposal Results Excited Users Do Expt Data Analysis Feedback.
U.S. Department of the Interior U.S. Geological Survey Decision Support Tools and USGS Data Management Best Practices Cassandra Ladino USGS Chesapeake.
CombeDay Making Data Openly Available Simon Coles.
Using the ICAT API to ingest business and experiment metadata Tom Griffin, STFC ISIS Facility NOBUGS 2012 ICAT Workshop
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
Importing record using DOIs Catherine Jones & Robert Darby eScience Centre, Science & Technology Facilities Council.
1 Data Management and Information Delivery The Data Management and Information Delivery (DMID) Project 10 Apr 2008 Ashwell Jenneker & Matile Malimabe.
Experimental Context, Publishing and Research Objects Brian Matthews STFC.
The Storage Resource Broker and.
ICAT Status Alistair Mills Project Manager Scientific Computing Department.
The Virtual Observatory and Ecological Informatics System (VOEIS): Using RESTful architecture and an extensible data model to provide a unique data management.
CRISP WP 17 1 / 2 Proposed Metadata Catalogue Architecture Document.
Dynamic/Deferred Document Sharing (D3S) Profile for 2010 presented to the IT Infrastructure Technical Committee Karen Witting February 1, 2010.
Ian Bruno, Suzanna Ward The Cambridge Crystallographic Data Centre
Tools and Services Workshop
Joslynn Lee – Data Science Educator
Research Data Context Preservation in SCAPE
Brian Matthews STFC EOSCpilot Brian Matthews STFC
Stuart Pullinger 24th January 2018
Presentation transcript:

ICAT + Information Model Brian Matthews Scientific Information Group E-Science Centre STFC Rutherford Appleton Laboratory

Facilities Process Proposal Approval Scheduling Experiment Data storage Record Publication Scientist submits application for beamtime Facility committee approves application Facility registers, trains, and schedules scientist’s visit Scientists visits, facility run’s experiment Subsequent publication registered with facility Raw data filtered, and stored Data analysis Tools for processing made available

Investigation PublicationKeywordTopic SampleSample ParameterDataset Dataset Parameter Datafile Datafile Parameter Investigator Related Datafile Parameter Authorisation Core Scientific Metadata Model (CSMD) The Core Metadata model forms the information model for ICAT. Designed to describe facilities based experiments in Structural Science.

RDBMS Web Services API ICAT API Command Line Tools Glassfish / JBOSS JavaC++Fortran Data Storage/ Delivery System Single Sign On User Database System Proposal System Publication System e-Science Services Software Repository ICAT Deployment

I2S2: Methodology Mapping across organisational infrastructures Proposals Once awarded beamtime at ISIS, an entry will be created in ICAT that describes your proposed experiment. Experiment Data collected from your experiment will be indexed by ICAT (with additional experimental conditions) and made available to your experimental team Analysed Data You will have the capability to upload any desired analysed data and associate it with your experiments. Publication Using ICAT you will also be able to associate publications to your experiment and even reference data from your publications. B-lactoglobulin protein interfacial structure Example ISIS Proposal GEM – High intensity, high resolution neutron diffractometer H2-(zeolite) vibrational frequencies vs polarising potential of cations Home Institution Central Facility

Earth Sciences: typical workflow Martin Dove & Erica Yang

CSMD: an established starting point Investigation PublicationKeywordTopic Sample Sample Parameter Dataset Dataset Parameter Datafile Datafile Parameter Investigator Related Datafile Parameter Authorisation CSMD: Core Scientific MetaData model Designed to describe facilities based experiments in Structural Science Forms the information model for ICAT, a production data management infrastructure employed by STFC Forms the basis for extensions: - To derived data - To laboratory based science - To secondary analysis data - To preservation information - To publication data

CSMD- Core A Core CSMD – Taking out a lot of the facility specific stuff – A simple model of datasets

I2S2-IM : Core Layer These are entities which are in the CSMD extended with the software execution to accept relationships between data sets Working with ORE-CHEM

Software execution

Model with Data Derivation Extension to the model to add an alternative Investigation activity type – Very straightforward natural extension to the model ICAT can be used almost without modification to record data derivation – Just another data generation activity