Federation eCrystals Federation: Open Repositories for Data-driven Science Dr Liz Lyon, UKOLN, University of Bath, UK Dr Simon Coles, University of Southampton,

Slides:



Advertisements
Similar presentations
EPrints - Introducing EPrints 3 Software William J Nixon Digital Library Development Manager, University of Glasgow With many thanks to Les Carr and the.
Advertisements

IATUL Porto, May 21, 2006 DOI and e-Science Dr Anne E Trefethen Oxford e-Research Centre
DRIVER Long Term Preservation for Enhanced Publications in the DRIVER Infrastructure 1 WePreserve Workshop, October 2008 Dale Peters, Scientific Technical.
AHM, Nottingham, September eBank UK : linking research data, scholarly communication and learning. Dr Liz Lyon, UKOLN, University of Bath Dr Simon.
A centre of expertise in data curation and preservation DCC/NeSC eScience Workshop, June 2008 Working in partnership with the eScience community This work.
S.J. Coles a*, M.B. Hursthouse a, R.A. Stephenson a, P. Cliff b, E. Lyon b, M. Patel b J. Downing c & P. Murray-Rust.
© S.J. Coles 2006 Usability WS, NeSC Jan 06 Enabling the reusability of scientific data: Experiences with designing an open access infrastructure for sharing.
Opening the Research Data Lifecycle Workshop Capturing and Sharing Research Data Simon Coles School of Chemistry, University of Southampton, U.K.
© S.J. Coles 2006 Digital Repositories as a Mechanism for the Capture, Management and Dissemination of Chemical Data Simon Coles School of Chemistry, University.
Linking Data and Publications: the Chemistry Way Simon Coles School of Chemistry, University of Southampton, U.K. CLADDIER workshop.
S.J. Coles a*, J.G. Frey a, M.B. Hursthouse a, L. Carr b & C.J. Gutteridge b. a School of Chemistry, University of Southampton, UK.; b School of Electronics.
© S.J. Coles 2006 Digital Repositories as a Mechanism for the Capture, Management and Dissemination of Chemical Data Simon Coles School of Chemistry, University.
RCUK, Octiber Archiving research data and research publications. Dr Leslie Carr, Intelligence, Agents Multimedia, University of Southampton Dr Simon.
© S.J. Coles 2006 eCrystals: A Route for Open Access to Small Molecule Crystal Structure Data Simon Coles School of Chemistry, University of Southampton,
UKOLN is supported by: Enhanced support for eScience: the role of Digital Libraries Digital Libraries Go eScience, ECDL, Alicante September 2006 Rachel.
A centre of expertise in digital information management UKOLN is supported by: Digital libraries and digital scholarship: changing roles.
A centre of expertise in digital information management UKOLN is supported by: Adding Value to Data and Information: Moving towards a Science.
A centre of expertise in digital information management UKOLN is supported by: Dealing with Data: Perspectives on Progress to Date Dr Liz.
A centre of expertise in digital information management UKOLN is supported by: Dealing with Data: One Year On Dr Liz Lyon, Director, UKOLN,
UKOLN is supported by: From research data to new knowledge: a lifecycle approach. Dr Liz Lyon, Director UKOLN, University of Bath, UK JISC/SURF/CNI Conference.
A centre of expertise in digital information management UKOLN is supported by: Curating the Scientific Record: The Challenges Ahead Dr.
Digital | Curation | Centre Adding value to open access research data: reflections on the process of data curation Dr Liz Lyon, DCC Associate Director.
A centre of expertise in digital information management UKOLN is supported by: Dealing with Data: Roles, Rights, Responsibilities & Relationships.
Integrating research data into the publication workflow: eBank UK experience Rachel Heery, UKOLN, University of Bath
Data Curation in Crystallography: Publisher Perspectives JISC Data Cluster Consultation Workshop CCLRC, Didcot, Oxon 10 October 2006.
UKOLN is supported by: Digital Libraries and e-Research: new horizons, new challenges? Dr Liz Lyon, Director UKOLN, University of Bath, UK 8 th International.
UKOLN is supported by: Digital Libraries and e-Research: a UK perspective on a changing landscape. Dr Liz Lyon, Director UKOLN, University of Bath, UK.
UKOLN is supported by: eBank UK : linking research data, scholarly communications and learning. Dr Liz Lyon, UKOLN, University of Bath, UK JISC CNI Conference.
UKOLN is supported by: Data, information and knowledge repositories: developing infrastructure to support the e-Research landscape. Dr Liz Lyon, Director.
JISC Joint Programmes Meeting eBank UK : linking research data, learning and scholarly communications. Dr Liz Lyon, UKOLN, University of Bath Dr.
A centre of expertise in digital information management UKOLN is supported by: Digital repositories as research infrastructure: a UK perspective.
Digital | Curation | Centre An Introduction to the UK Digital Curation Centre Dr Liz Lyon, DCC Associate Director Outreach Director, UKOLN, University.
UKOLN is supported by: Adding value to open access research data: the eBank UK Project. Dr Liz Lyon, Director UKOLN, University of Bath, UK OAI4, CERN.
A centre of expertise in digital information management UKOLN is supported by: British Academy e-Resources Policy Review: UKOLN Report.
UKOLN is supported by: Emergent technologies & digitisation: the institutional impact. Liz Lyon & Kevin Edge VCs Retreat, October a.
A centre of expertise in digital information management UKOLN is supported by: Data Informatics Top Ten : (for Libraries) Dr Liz Lyon,
A centre of expertise in data curation and preservation 2 nd International Digital Curation Conference, November 2006 Reflections on open scholarship:
Federation The eCrystals Federation Dr Simon Coles, University of Southampton, UK Dr Liz Lyon, UKOLN, University of Bath, UK Open Repositories 2008, University.
Federation eCrystals Federation: Open Repositories for Open Science Dr Liz Lyon, UKOLN, University of Bath, UK Dr Simon Coles, University of Southampton,
A centre of expertise in digital information management UKOLN is supported by: C21st Scholarship: Data as an Agent for Change Dr Liz Lyon,
A centre of expertise in digital information management UKOLN is supported by: UK Perspectives on the Curation and Preservation of Scientific.
A centre of expertise in digital information management UKOLN is supported by: Research Data & Institutions Roles & Responsibilities? Dr.
A centre of expertise in digital information management UKOLN is supported by: Digital Futures for MLAs? A snapshot in real time. Dr Liz.
A centre of expertise in digital information management UKOLN is supported by: UKOLN Update on Selected Activities Dr Liz Lyon, Director,
A centre of expertise in digital information management UKOLN is supported by: Memory institutions and the social fabric of the Web Dr.
UKOLN is supported by: Enhancing access to research data: the challenge of crystallography Rachel Heery, Monica Duke, Michael Day UKOLN, University of.
© S.J. Coles 2006 Institutional Data Repositories for Chemistry Simon Coles School of Chemistry, University of Southampton, U.K.
EBankII Workshop 1 Making Scientific Data Openly Available Simon Coles School of Chemistry, University of Southampton.
I2S2 - Infrastructure for Integration in Structural Sciences Information Model Development Workshop RAL 11 th February 2010
A centre of expertise in digital information management UKOLN is supported by: Dealing with the Data Cloud Dr Liz Lyon, Director, UKOLN,
A centre of expertise in digital information management UKOLN is supported by: Open Science and the Research Library: Roles, Challenges.
Digital | Curation | Centre Supporting Digital Curation to safeguard research data: adding value today and ensuring long-term access Dr Liz Lyon, DCC Associate.
EBank UK CCLRC Workshop February eBank and CCLRC Workshop February 2005 University of Bath.
Digital Repositories: interoperability & common services Closing Remarks Dr Liz Lyon, UKOLN, University of Bath, UK
The Discovery Landscape in Crystallography UKOLN is supported by: Monica Duke UKOLN, University of Bath, UK – eBank UK project A centre.
University of Southampton, U.K.
EPrints Workshop, January eBank UK: Dissemination of research data using EPrints Simon Coles, School of Chemistry, University of Southampton.
© S.J. Coles 2006 Data Management in the Chemistry Domain Simon Coles School of Chemistry, University of Southampton, U.K.
© S.J. Coles 2005 eChemInfo2005 Open Archives as a Route for Capture, Dissemination and Access to Chemical Data and Information Simon Coles School of Chemistry,
Implementing DOIs for Data DataPool supporting institutional service development Simon Coles, Andrew Milsted, Wendy White Jisc Managing Research Data Programme.
EBank UK: linking scientific data, scholarly communication and learning Michael Day and Rachel Heery UKOLN, University of Bath
UKOLN is supported by: Introduction to UKOLN Dr Liz Lyon, Director UKOLN, University of Bath, UK Grand Challenge Meeting, June a centre.
CombeDay Making Data Openly Available Simon Coles.
UKOLN is supported by: Library futures in the new research landscape. Dr Liz Lyon, UKOLN, University of Bath, UK CURL Members Meeting October 2004, London.
eCrystals Federation: Open Repositories for global Open Science
eCrystals Federation: Open Repositories for Open Science
JISC Joint Programmes Meeting 2005
Bird of Feather Session
Developing Institutional Data Repositories
eCrystals Federation: Open Repositories for global Open Science
Presentation transcript:

Federation eCrystals Federation: Open Repositories for Data-driven Science Dr Liz Lyon, UKOLN, University of Bath, UK Dr Simon Coles, University of Southampton, UK Chemical Informatics Workshop, Manchester, March 2008 This work is licensed under a Creative Commons Licence Attribution-ShareAlike 3.0

Themes 1.Context: Institutional data repositories crystallography exemplar 2.Scale: repository federations 3.Longevity: Digital curation and preservation 4.Integration: Semantic challenges

eBank Project – building the eCrystals Data Repository ePrints Southampton Institutional Repository exemplar Embedded in workflow Started Sept 2003 Scholarly knowledge cycle context UKOLN-led interdisciplinary team

Scaling Up Report Phase 3 findings: Data policy should reflect lab practice & institutional model Diverse lab practice LIMS proprietary formats Data quality criteria/validation Prior publication problem We need automated assignment of terms for data discovery No discipline preservation model

nλ = 2 d sinθ The

eCrystals Repository ePrints.org v3.0

Repository Foundations Using simple Dublin Core Crystal structure Title (Systematic IUPAC Name) Authors Affiliation Creation Date Additional chemical information through Qualified Dublin Core Empirical formula International Chemical Identifier (InChI) Compound Class & Keywords Specifies which datasets are present in an entry Application Profile DOI links Rights & Citation Learned society + subject repository support

Federation interoperability & linking services Roll-out in 2 phases led by University of Southampton Establish Federation policies, application profile, mappings Bi-directional links with derived articles in publisher repositories, IUCr, Royal Society of Chemistry (RSC), Chemistry Central: scholarly knowledge cycle StOReLink project - Test linking options: StORe middleware and CLADDIER OAI-ORE Testbed eChemistry project

Laboratory practice & workflow Community standard CIF Mixed lab practice – central service facility versus single staff crystallographer in department Achieve end-to-end workflow Challenge of instrument manufacturers with proprietary formats Repository Lite for smaller lab operations? X-ray diffractometers

eBank-UK Phase 3 Curation & Preservation Study: Sustainability issues uk/curation/ Examined four main areas 1.Audit and certification (TRAC, DRAMBORA, NESTOR, ISO International repository audit and certification BOF Group) 2.The Open Archival Information System (OAIS) and Representation Information (RI) 3.eBank-UK application profile and preservation metadata 4.ePrints.org repository platform Recommendations: Self-assessment using DRAMBORA Consider Representation Information in wider context Develop preservation strategy Capture preservation metadata - PREMIS

Crystallographic schema underpins CIF (Crystallographic Information Framework), but is limited to data parameters e.g. cell_length_a Semantic issues

IUCr Acta Cryst 1992 Limited set of keywords describing methods, properties & applications, compounds, attributes No established crystallography dictionary or controlled vocabulary to give chemistry context

What do we want to do? Support depositors keyword/term assignment Facilitate and improve automated indexing Support advanced search / browse Allow metadata validation & enhancement Apply across a heterogeneous Federation Cross search, cross browse functionality Link data to all associated digital objects Develop domain semantics / vocabulary Use domain-specific authority files Mine to discover rather than find Achieve full inter-disciplinary integration

Some (semantic) issues….. How are terms assigned? Informal tags and/or structured KOS? How is a vocabulary curated and maintained? Can a vocabulary be transformed into a (Semantic Web related understanding) ontology? Disambiguation, acronyms, IUPAC names Persistent identification for data citation Granularity of data citation Data (and metadata) quality, provenance, validation Embedding within complex workflows Use collaborative social approaches? Community adoption: becomes part of the culture

Federation Questions? Slides will be available at : This work is licensed under a Creative Commons Licence Attribution-ShareAlike 3.0