Digital | Curation | Centre Supporting Digital Curation to safeguard research data: adding value today and ensuring long-term access Dr Liz Lyon, DCC Associate Director Outreach Director, UKOLN, University of Bath, UK Funded by: This work is licensed under a Creative Commons Licence Attribution-ShareAlike 2.0 JISC Conference March 2006
Digital | Curation | Centre 2 Overview Digital curation and the e-Research cycle UK Digital Curation Centre –Development activity –Research agenda –Advisory services –Outreach programme Chemistry exemplar projects maintaining and adding value to a trusted body of digital information for current and future use
Digital | Curation | Centre 3 UK Digital Curation Centre Development activities Research agenda Delivering services Outreach Programme
Digital | Curation | Centre 4 DCC people (some of them…) Management & Co-ordination –Director Chris Rusbridge (University of Edinburgh) Community Support & Outreach –Led by Dr Liz Lyon (UKOLN, University of Bath) Service Definition & Delivery –Led by Professor Seamus Ross (HATII, University of Glasgow) Development –Led by Dr David Giaretta (Astronomical Software & Services, CCLRC) Research –Led by Professor Peter Buneman (University of Edinburgh)
Digital | Curation | Centre 5 (Very simple) e-Research Cycle and Data Curation Formulate hypothesis / ideas, test, experiment, observe: data creation, collection & capture Adding value: Data linking, annotation, visualisation, simulation (New) knowledge extraction: data mining, modelling, analysis, synthesis e-Infrastructure Open access Collaboration Scholarly communications: data disclosure, publication, citation, discovery, re-use Data management storage & validation: description, deposit, self-archiving, preservation, certification Data processing This work is licensed under a Creative Commons Licence Attribution-ShareAlike 2.0
Digital | Curation | Centre 6 Data capture & integration into research workflows R4L Repository for the Laboratory Project (JISC-funded) automated data capture from instrumentation, deposit of results (chemistry) SMART TEA electronic Laboratory notebook + annotations
Digital | Curation | Centre 7 Disciplinary data-centres
Digital | Curation | Centre 8 eBank UK Project Two key themes: –Open access to datasets –Linking research data to publications and to learning UKOLN, University of Southampton, University of Manchester e-Science application Combechem : Grid-enabled combinatorial chemistry + National Crystallography Service Resource Discovery Network / PSIgate physical sciences portal
Digital | Curation | Centre 9 A data repository entry
Digital | Curation | Centre 10 Access to the underlying data: complex objects ecrystals.chem.soton.ac.uk
Digital | Curation | Centre 11 Data descriptions Validation, publication & discovery of data models & schema Metadata packaging standards –METS –MPEG 21 DIDL Semantic descriptions –Formal controlled vocabularies –High-level and domain ontologies –Inter-disciplinary discovery Informal approaches Web 2.0 folksonomies
Digital | Curation | Centre 12 Audit & certification: trusted digital repositories DCC Development & Services teams Draft Audit Checklist for Certification August 2005 Research Libraries Group RLG-NARA Pilot audits planned –Koninklijke Bibliotheek (KB) –British Atmospheric Data Centre (BADC) –JISC Digital Repository projects? –Institutional repositories? Revised Checklist based on feedback and pilot audit outcomes
Digital | Curation | Centre 13 Development: Representation Information Registry DCC Approach to Digital Curation based on the Reference Model for an Open Archival Information System (OAIS); ISO standard, 14721: Development of a Representation Information (RI) registry/repository (DCC-RR) Prototype demonstrator: based on 2 key concepts to facilitate sharing of the curation effort –Curation persistent ID –Descriptive label (structural, semantic, other metadata) Development of tools and interfaces for creating, using and re-using representation information for details of Wiki and list
Digital | Curation | Centre 14 Persistent identifiers for data citation Warwick Workshop research issue Schemes: DOI, Handle, ARK, PURL Global identification: express as http URIs eBank data citation policy (human and machine-actionable) Domain identifiers: e.g. International Chemical Identifier (INChI) codes
Digital | Curation | Centre 15 Discovering data: Coles, S.J., Day, N.E., Murray-Rust, P., Rzepa, H.S., Zhang, Y., Org. Biomol. Chem., 2005, (10), DOI: /b502828k Domain identifier: International Chemical Identifier (INChI) code Google molecule using INChI Slide from Simon Coles
Digital | Curation | Centre 16 Adding value: eBank linking data to publications
Digital | Curation | Centre 17 Linking research to learning - embedding eBank aggregator service in a science portal for student learners
Digital | Curation | Centre 18 Adding value through annotation DCC Research at the University of Edinburgh Scientific databases: Annotation scoping report AstroDAS: distributed annotation servers in astronomy New annotation model + prototype MONDRIAN: top- ranked demonstration at recent DB conference
Digital | Curation | Centre 19 Supporting the community: Services legal - technical guidance Curation Manual 45 chapters planned, Briefing Papers Case studies
Digital | Curation | Centre 20 DCC Case Study published: Wide Field Astronomy Unit
Digital | Curation | Centre 21 Supporting the community: Outreach & Services Workshops: LOCKSS 6 April Warwick Archiving 24 April, Newcastle Associates Network 17 May, NeSC, Edinburgh Digital Curation Policies, June, Oxford tbc Data dictionary for Preservation Metadata (PREMIS), July tbc Information Days 2006 Nottingham, Birmingham, Manchester 2 nd International Conference November Glasgow Keynotes: Hans F. Hoffmann, CERN, Clifford Lynch, CNI Call for papers deadline 3 rd April
Digital | Curation | Centre 22 Associates Network Goals: Develop understanding, share best practice, advance research, promote recognition, develop consensus 376 Members and growing……. Benefits: Early access to R&D outputs, advisory services, training, input to definition and design, community participation Discussion Forum Topics: formats at risk, Creative Commons, digital archives and digital libraries Meeting 17 NeSC, Edinburgh Please join us!
Digital | Curation | Centre Thank you. Questions? Join the DCC Associates Network at