Download presentation
Presentation is loading. Please wait.
Published byAnthony Fitzgerald Modified over 10 years ago
1
AHM, Nottingham, September 20041 eBank UK : linking research data, scholarly communication and learning. Dr Liz Lyon, UKOLN, University of Bath Dr Simon Coles, School of Chemistry, University of Southampton
2
AHM, Nottingham, September 20042 Overview In context: scholarly communications –Open Access –Data, information, workflows and provenance The data publication bottleneck –e-Science and crystallography –Comb-e-chem Project eBank UK –Information architecture and data flow –Interoperability issues Challenges for the future
3
Scholarly communications
4
AHM, Nottingham, September 20044 Current chemistry publishing protocols Ideas and interpretations Results & derived data Hooks into the literature Raw data!
5
AHM, Nottingham, September 20045
6
AHM, Nottingham, September 20046
7
AHM, Nottingham, September 20047 It is envisaged that the sharing of primary data would prevent unnecessary repetition of experiments and enable scientists to build directly on each others work, creating greater efficiencies and productivity in the research process. The government line
8
AHM, Nottingham, September 20048 Research & e-Science workflows Aggregator services: national, commercial Repositories : institutional, e-prints, subject, data, learning objects Data curation: databases & databanks Validation Harvesting metadata Data creation / capture / gathering: laboratory experiments, Grids, fieldwork, surveys, media Deposit / self- archiving Peer-reviewed publications: journals, conference proceedings Publication Validation Data analysis, transformation, mining, modelling Searching, harvesting, embedding Presentation services: subject, media-specific, data, commercial portals Resource discovery, linking, embedding Linking The scholarly knowledge cycle. Liz Lyon, eBankUK article. Ariadne, July 2003.
9
AHM, Nottingham, September 20049 Learning & Teaching workflows Aggregator services: national, commercial Repositories : institutional, e-prints, subject, data, learning objects Institutional presentation services: portals, Learning Management Systems, u/g, p/g courses, modules Harvesting metadata Resource discovery, linking, embedding Peer-reviewed publications: journals, conference proceedings Validation Resource discovery, linking, embedding Deposit / self- archiving Learning object creation, re-use Searching, harvesting, embedding Quality assurance bodies Validation Presentation services: subject, media-specific, data, commercial portals
10
AHM, Nottingham, September 200410 Learning & Teaching workflows Research & e-Science workflows Aggregator services: national, commercial Repositories : institutional, e-prints, subject, data, learning objects Data curation: databases & databanks Institutional presentation services: portals, Learning Management Systems, u/g, p/g courses, modules Validation Harvesting metadata Data creation / capture / gathering: laboratory experiments, Grids, fieldwork, surveys, media Resource discovery, linking, embedding Deposit / self- archiving Peer-reviewed publications: journals, conference proceedings Publication Validation Data analysis, transformation, mining, modelling Resource discovery, linking, embedding Deposit / self- archiving Learning object creation, re-use Searching, harvesting, embedding Quality assurance bodies Validation Presentation services: subject, media-specific, data, commercial portals Resource discovery, linking, embedding Linking
11
AHM, Nottingham, September 200411 Learning & Teaching workflows Research & e-Science workflows Aggregator services: eBank UK Repositories : institutional, e-prints, subject, data, learning objects Data curation: databases & databanks Institutional presentation services: portals, Learning Management Systems, u/g, p/g courses, modules Validation Harvesting metadata Data creation / capture / gathering: laboratory experiments, Grids, fieldwork, surveys, media Resource discovery, linking, embedding Deposit / self- archiving Peer-reviewed publications: journals, conference proceedings Publication Validation Data analysis, transformation, mining, modelling Resource discovery, linking, embedding Deposit / self- archiving Learning object creation, re-use Searching, harvesting, embedding Quality assurance bodies Validation Presentation services: subject, media-specific, data, commercial portals Resource discovery, linking, embedding Linking
12
The Data Publication Bottleneck
13
AHM, Nottingham, September 200413 Data Overload! How do we disseminate? EPSRC National Crystallography Service The data deluge
14
AHM, Nottingham, September 200414 CombeChem: An EPSRC pilot project X-Ray e-Lab Analysis Properties Properties e-Lab Simulation Video Diffractometer Grid Middleware Structures Database
15
AHM, Nottingham, September 200415 Grid E-Scientists Entire E-Science Cycle Encompassing experimentation, analysis, publication, research, learning 5 Institutional Archive Local Web Publisher Holdings Digital Library E-Scientists Graduate Students Undergraduate Students Virtual Learning Environment E-Experimentation E-Scientists Technical Reports Reprints Peer- Reviewed Journal & Conference Papers Preprints & Metadata Certified Experimental Results & Analyses Data, Metadata & Ontologies
16
The eBank UK Project
17
AHM, Nottingham, September 200417 eBank UK project JISC-funded for 1 year from September 2003 UKOLN at the University of Bath (lead), University of Southampton, University of Manchester Building the links between research data, scholarly communication and learning Exemplar: e-Science testbed Combechem –Grid-enabled combinatorial chemistry –Crystallography, laser and surface chemistry examples –Development of an e-Lab using pervasive computing technology –National Crystallography Service Resource Discovery Network / PSIgate physical sciences portal http://www.ukoln.ac.uk/projects/ebank-uk/
18
AHM, Nottingham, September 200418 The project team UKOLN Michael Day Monica Duke Rachel Heery Liz Lyon + Andy Powell Southampton Les Carr Simon Coles Jeremy Frey Chris Gutteridge Mike Hursthouse Manchester John Blunden-Ellis
19
AHM, Nottingham, September 200419 First steps: establishing common ground… Understand the data creation process Terminology and definitions –Data –Metadata –Datafile –Dataset –Data holding Different views –Digital library researchers, computer scientists, chemists –Generic vs specific –Modeller vs practitioner Aim for a common ontology Modelling the domain Creating a metadata schema
20
AHM, Nottingham, September 200420 Progress update Version 2.0 eBank metadata schema Enhanced ePrints.org software Pilot institutional e-data repository for harvesting (raw, derived, results data) Exports records as ebank_dc and oai_dc Validation of schema Pilot eBank UK aggregator service Developing search interface Version 1.0 Testing with PSIgate physical sciences portal – embedding eBank UK
21
AHM, Nottingham, September 200421 Crystallography workflow Initialisation: mount new sample on diffractometer & set up data collection Collection: collect data Processing: process and correct images Solution: solve structures Refinement: refine structure CIF: produce CIF (Crystallographic Information File format) Report: generate Crystal Structure Report RAW DATADERIVED DATARESULTS DATA
22
AHM, Nottingham, September 200422 Deposition into the archive
23
AHM, Nottingham, September 200423 An Archive entry ecrystals.chem.soton.ac.uk For a demo come to the JISC booth! Today @ 13:00 & during tea
24
AHM, Nottingham, September 200424 All the way back to the underlying data…
25
AHM, Nottingham, September 200425 Some metadata issues Using simple and qualified Dublin Core Additional chemical information in schema for harvesting e.g. empirical formula Schema contains International Chemical Identifier (InChI) Links to all datasets associated with an experiment Links to individual datasets within an experiment Links to eprints (and other published literature) derived from the data Using vocabularies specific to crystallography Engaging the broader scientific community to ensure different schemas are compliant and standards can emerge
26
AHM, Nottingham, September 200426 ebank_dc record (XML) Crystal structure (data holding) Crystal structure report (HTML) Dataset Institutional repository Deposit Dataset dc:identifier dcterms:references Linking dc:type=CrystalStructure and/or Collection Model input Andy Powell, UKOLN. Eprint oai_dc record (XML) dcterms:isReferencedBy dc:type=Eprint and/or Text Data flow in eBank Eprint jump-off page (HTML) dc:identifier Eprint manifestation (e.g. PDF) Linking
27
AHM, Nottingham, September 200427 ebank_dc record (XML) Crystal structure (data holding) Crystal structure report (HTML) Dataset Institutional repository eBank UK aggregator service ePrint UK aggregator service Subject service Deposit Harvesting OAI-PMH ebank_dc Harvesting OAI-PMH oai_dc Dataset dc:identifier dcterms:references Linking dc:type=CrystalStructure and/or Collection Model input Andy Powell, UKOLN. Eprint oai_dc record (XML) dcterms:isReferencedBy dc:type=Eprint and/or Text Data flow in eBank Eprint jump-off page (HTML) dc:identifier Eprint manifestation (e.g. PDF) Linking
28
AHM, Nottingham, September 200428 ebank_dc record (XML) Crystal structure (data holding) Crystal structure report (HTML) Dataset Institutional repository eBank UK aggregator service ePrint UK aggregator service Subject service Deposit Harvesting OAI-PMH ebank_dc Harvesting OAI-PMH oai_dc Searching, linking and embedding Dataset dc:identifier dcterms:references Linking dc:type=CrystalStructure and/or Collection Model input Andy Powell, UKOLN. PSIgate portal Eprint oai_dc record (XML) dcterms:isReferencedBy dc:type=Eprint and/or Text Data flow in eBank Eprint jump-off page (HTML) dc:identifier Eprint manifestation (e.g. PDF) Linking
29
AHM, Nottingham, September 200429 Harvesting: OAIster
30
AHM, Nottingham, September 200430 Linking and aggregating: Search & discover For a demo come to the JISC booth! Today @ 13:00 & during tea or the buffet
31
AHM, Nottingham, September 200431 Linking and aggregating: Hit browsing
32
AHM, Nottingham, September 200432 And finally… eBank embedded in a science portal
33
AHM, Nottingham, September 200433 Currently we are…… Assessing outcomes of a Consultation Workshop held in August e.g. –Cost-benefit issues for researchers? –RAE / assessment impact? –Disciplinary differences? Presenting a demonstrator Completing supporting studies on (1) Provenance and (2) Data models and schema Promoting Open Access and Open eData Archives to international crystallographic organisations, publishers, learned societies Phase 2 proposal funding sought for further 12 months
34
Challenges for the future
35
AHM, Nottingham, September 200435 Phase 2 plan…….(1) Continue to progress towards generic metadata schemas Validation against other schema –CLRC Scientific Metadata Model Modify Eprints.org software to allow for more generic scientific data and schemas Metadata enhancement: subject keyword additions based on knowledge of keywords in related publications Investigate identifiers e.g. International Chemical Identifier (InChI code) Explore context sensitive linking: find me –Datasets by this person; Journal articles by this person; Datasets related to this subject; Journal articles on this subject; Learning objects by this person; Learning objects on this subject
36
AHM, Nottingham, September 200436 Phase 2…….(2) Full embedding into the crystallographic research and publishing communities Chemistry workflow embedding –SMART TEA e synthesis Lab –Other analytical techniques in chemistry e-Learning embedding and pedagogic evaluation –Undergraduate chemical informatics courses –Introduction to visiting schools Expand into other physical, mathematical, geological and engineering sciences Feasibility study in related domains – bio and medical sciences Feasibility study in unrelated domains – arts and humanities
37
Thank you. Questions?…..
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.