David Shotton Image BioInformatics Research Group Department of Zoology University of Oxford, UK SPIDER Project Launch Meeting.

Slides:



Advertisements
Similar presentations
David Shotton Image BioInformatics Research Group Department of Zoology University of Oxford, UK The Dryad-UK vision © David Shotton,
Advertisements

Peter Griffith and Megan McGroddy 4 th NACP All Investigators Meeting February 3, 2013 Expectations and Opportunities for NACP Investigators to Share and.
OUP in support of digital libraries Main objectives Historical Context Why Xml ? Librarian Resource Centre Oxford Index Marzena Giers Fidler 5 th June.
The Research Workflow Revolution: The Impact of Web 2.0 And Emerging Networking Tools On Research Workflow Bill Russell Communications Director 4 th April.
UKSG: April African Journals OnLine (AJOL) Supporting indigenous published research Pippa Smart INASP
Trends in Scientific Publishing Guenther Eichhorn DirectorAbstracting & Indexing Cambridge, MA April 2010.
Queensland University of Technology CRICOS No J How can a Repository Contribute to University Success? APSR - The Successful Repository June 29,
PubMed Central ANCHASL Spring Meeting April 1, 2005 Robert James Associate Director of Public Services Duke University.
NATIONAL LIBRARY OF MEDICINE PubMed Central Brooke Dine National Library of Medicine Medical Library Association Conference May 2005.
DEVELOPMENT OF A EUROPEAN NETWORK OF LIBRARIES Hans Geleijnse Director of Library and IT Services & CIO Tilburg University, The Netherlands.
Figures for ADMIRAL Project grant application These figures are copyright © David Shotton, University of Oxford, They are made available for reuse.
Image BioInformatics Research Group Department of Zoology University of Oxford, UK Semantic Web Applications and Tools for Life.
Highlights from the Open Access Timeline (1) 1971, Project Gutenberg launched on the Internet (originally as an FTP site). There are now 18,000 free books.
1 Enriching UK PubMed Central SPIDER launch meeting, Wolfson College, Oxford Paul Davey, UK PubMed Central Engagement Manager.
Build VIVO in the Cloud NIH Workshop on Value Added Services for VIVO Brand Niemann Semantic Community March 25-26,
Community of Science The Leading Internet Site for Researchers Worldwide
Wellcome Trust - Funding the best science
New services and Springer Guenther Eichhorn Director, Abstracting & Indexing Cambridge, MA (April 2010)
DEVA Data Management Workshop Devil’s Hole Pupfish Project Data Management Workshop Devil’s Hole Pupfish Program Death Valley National Park Introduction.
David Shotton Image BioInformatics Research Group Department of Zoology University of Oxford, UK CITATION NETWORK ANALYSIS © David.
David Shotton Image BioInformatics Research Group Department of Zoology University of Oxford, UK SPIDER Project Launch Meeting.
American Medical Association Journals include: JAMA (journal of the American Medical Association.), Archives of surgery, Archives of ophthalmology and.
GOVERNANCE ELECTRONIC. ” “ E-Governance is the application of Information and Communication Technology (ICT) for delivering government services, exchange.
New Crossroads Transitions & Transformations Science Librarians in the 21st Century Mary M. Case University of Illinois at Chicago.
Presented by Ansie van der Westhuizen Unisa Institutional Repository: Sharing knowledge to advance research
BY OBAJE, ALFRED MICHAEL (DBA., B.SC., M.L.S.) MEDICAL SCIENCES LIBRARIAN 1 Library E-resources Sensitization and Demonstrations for 400 Level Medical.
Management, marketing and population of repositories Morag Greig, University of Glasgow.
Semantic Web outlook and trends May The Past 24 Odd Years 1984 Lenat’s Cyc vision 1989 TBL’s Web vision 1991 DARPA Knowledge Sharing Effort 1996.
Data Infrastructures Opportunities for the European Scientific Information Space Carlos Morais Pires European Commission Paris, 5 March 2012 "The views.
Open access in action: Imperial College’s Digital Repository Deborah Shorley Director of Library Services, Imperial College London LIRGroup Annual.
Impact Measurement & CERIF Brussels September; Horizon 2020 and Beyond Gerry Lawson, Natural Environment Research Council,
SCIENCE, RESEARCH DATA, AND PUBLISHING Stewart Wills Editorial Director, Web & New Media, Science 26 February 2013.
THE LEGACY OF FACETED CLASSIFICATION Brian Vickery and the Classification Research Group.
Digital/Open Access repositories Paul Sheehan Director of Library Services DCU HEAnet National Networking Conference Athlone 11 th November 2005.
Building An Academic Career
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
 Copyright 2008 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute The Digital Enterprise Research.
Committed to making the world’s scientific and medical literature a public resource.
BMC Open Access Colloquium, 8 February Morgan: "Open Access Repositories"
DATABASES Southern Region CEO Wednesday 13 th October 2010.
Scientific Data and Electronic Publishing Renze Brandsma, Head, Digital Production Centre University of Amsterdam Maarten Hoogerwerf, Project Manager,
Shruthi(s) II M.Sc(CS) msccomputerscience.com. Introduction Digital Libraries have become the source of information sharing across the globe for education,
Accessing journals by via PubMed Note the link to find articles through HINARI/PubMed. Using this option will be covered in later in the Short Course.
BAA - Big Mechanism using SIRA Technology Chuck Rehberg CTO at Trigent Software and Chief Scientist at Semantic Insights™
 Copyright 2007 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute Research publication & enabling.
Image BioInformatics Research Group Department of Zoology University of Oxford, UK CERIF Data Surgery University of Bath 9 February.
HEFCE/Higher Education Academy/JISC cc-by-sa (uk2.5) Image source – flickr (cc-by) OER and the Open Agenda Malcolm Read, Executive Secretary, JISC.
Semantic Web: The Future Starts Today “Industrial Ontologies” Group InBCT Project, Agora Center, University of Jyväskylä, 29 April 2003.
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
It’s the data that makes a paper Joerg Heber Executive Editor Nature Communications.
Partner Publishers’ Websites From the Partner publisher services dropdown menu, click on the Elsevier Science - Science Direct website. Note that this.
Institutional Repositories: the DSpace Experience Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
COLLABORATION : THE KEY TO UNLOCK OPEN ACCESS PUBLICATION Frederick J. Friend Senior Consultant, Information Program, Open Society Institute
Fire Emissions Network Sept. 4, 2002 A white paper for the development of a NSF Digital Government Program proposal Stefan Falke Washington University.
Tiziana // Alessandra Lenzi - MG Breaking down the walls Project Museo Galileo and the Linked Open Data A joint project between.
The DEER Distributed European Electronic Resource Dr Suzanne Keene Francesca Monti University College London.
Realising MRC’s Vision in Health and Bioinformatics MRC Open Council Meeting July 2014 Janet Valentine Head of Population Health and Informatics.
The Impact of the Social Sciences Jane
Ukpmc.ac.uk As a result of the mandates Research in the open How mandates work in practice 29 th May, 2009 Paul Davey, UK PubMed Central Engagement Manager,
Deep Indexing in ProQuest Health and Medical Databases.
LAMS: The Learning Activity Management System James Dalziel Professor of Learning Technology & Director, Macquarie E-Learning Centre Of Excellence (MELCOE)
CitEc as a source for research assessment and evaluation José Manuel Barrueco Universitat de València (SPAIN) May, й Международной научно-практической.
LAMS: The Learning Activity Management System
Next Generation Preprint Service
Elsevier Activity Range
OECD, Paris 8/9 December 2008 Joint Information Systems Committee
….part of the OSU Libraries' suite of digital library tools…
DELNET – Developing Library Network
Bird of Feather Session
Presentation transcript:

David Shotton Image BioInformatics Research Group Department of Zoology University of Oxford, UK SPIDER Project Launch Meeting Wolfson College, Oxford Friday 27 June 2008 SPIDER – Semantic Publishing for Infectious Disease Epidemiology Research © David Shotton, 2008 Published under the Creative Commons Attribution-Noncommercial-Share Alike 3.0 Licence zoo.ox.ac.uk

The SPIDER Project Objective:  To change the world in terms of research publications relating to infectious disease epidemiology – both papers and datasets – and thereby improve epidemiologists’ access to disease data, by assisting authors, data producers, publishers and information consumers to use the Web to its full potential The SPIDER Project partnership will undertake pre-competitive multi- disciplinary R&D that individual research groups and companies have neither the capacity nor the skills to undertake individually It will seek to demonstrate enhanced “end-to-end” data management for epidemiological research Early adopters of semantic publishing will benefit by increasing the desirability of their journals, and by developing added value services Biomedical researchers will benefit by better, faster, cheaper access to data for epidemic prediction and countermeasures

The anticipated SPIDER Project partnership Biological and veterinary researchers, and others who create or consume infectious disease incidence data (DEFRA and HPA) Computer scientists with technologies that can assist in data management My Image Bioinformatics Research Group (IBRG), which employs semantic and Web technologies to enhance the usefulness of biological data, particularly image data Leading scientific publishers publishing journals relevant to disease The partnership should as far as possible be geographically local, to enhance collaborative potential

Infectious diseases are a continuing threat UK Government’s 2006 Foresight Report  Future threats from infectious diseases are at least as great as in the past century, with an undiminished rate of emergence of new diseases and increasing resistance being shown by many disease agents to antimicrobial drugs Malaria and TB are major global health problems HIV-AIDS, SARS, West Nile fever, bluetongue and avian influenza present new hazards Bioterrorism (e.g. anthrax, smallpox) presents potential threats Lives depend upon the timely availability of reliable disease incidence data that permit predictions of the severity and spread of epidemics

Biological and veterinary researchers Angela McLean, Oxford University – epidemiologist David Rogers, Oxford University – uses remote sensing satellite data to predict disease spread Joe Brownlie, Royal Veterinary College, London – one of the prime movers behind the Foresight Report: Infectious Diseases: Preparing for the Future – is in Turin today and sends his apologies Jane Gibbens, DEFRA: “Apologies for not responding earlier - we are currently dealing with an outbreak of avian flu, and unlikely to be able to field a participant” Health Protection Agency

The potential of the Web Tim Berners-Lee, 2001: “The semantic web will profoundly change the very nature of how scientific knowledge is produced and shared, in ways that we can now barely imagine” Semantic publishing will enable new knowledge to be more effectively linked into the existing knowledge network It will make explicit the rhetorical claims of each paper, and assist users in accessing data, both within the paper and in the underpinning datasets It will enable this by imparting publications with machine-readable metadata, allowing computers to assist in the task of information discovery and integration

Computer science and ICT input Andy Seaborne Hewlett Packard Research Labs, Bristol “I’m sorry I can not make June 27, but the area of semantic publishing that SPIDER is addressing is of interest to us and we support this work. “We are interested in developing a platform around scalable RDF storage, query and inference, and to create and deploy a live service for a specific use case could form the basis for joint work” Nigel CollierBioCaster, presently in Tokyo “I am very keen to be part of the SPIDER Project” Ian HorrocksInformation Systems Group, Oxford Inigo Surguy Chief technologist, 67 Bricks, Oxford Siegfried HandschuhDigital Enterprise Research Institute (DERI), GalwayDERI

Nigel Collier’s BioCaster Global Health Monitor Biocaster automatically scours the Internet and aggregates news of infectious diseases, employing text mining tools to analyse it Nigel has developed an extensive ontology of infectious diseases which we will use for the SPIDER Project

Integrating heterogeneous data from distributed sources IBRG is using Semantic Web tools and standards to develop data webs  All identifiers are mapped to URIs  RDF is used as the standard format for sharable metadata  The RDF query language SPARQL is used for data web queries

Scholarly publishing in the UK UK researchers contribute 9% of the world’s research publications and 12% of the world’s citations Many academic publishers are based in the UK, particularly in Cambridge, Oxford and London The English-language scientific, technical and medical (STM) journals market was estimated to be worth around $5 billion globally in 2004 Scholarly journals are the primary dissemination channel and public record of new research and development effort The original research article is the key commodity which drives this marketplace The new UK Research Assessment Framework will use metrics based on research articles published

The information overload

... but publishing has changed very little We still have a linear narrative, with references The norm is that the online version of a journal article is a PDF file  PDF is an electronic embodiment of a static printed page  This is antithetical to the spirit of the Web, and ignores its great potential Rather, we need lively journal content  Semantic mark-up of text  Interactive figures  Actionable numerical data  Links between papers and datasets

The Brussels Declaration on STM Publishing Published by the international scientific, technical and medical (STM) publishing community ( “STM publishers are committed to change and innovation that will make science more effective, and believe that raw research data and datasets submitted with a paper to a journal should, wherever possible, be made freely accessible to other scholars”

Publishers invited to participate in the SPIDER Project Anita de Waard, Elsevier: “SPIDER is a most interesting endeavor - we will provide support, input and access to content” Timo Hannay, Nature: “Unfortunately I cannot attend and haven’t been able to find a good substitute to attend from Nature. Keep us informed with any resulting documentation and further meeting announcements” Mark Patterson, PLoS: “I’m very interested in the SPIDER Project, and if there are some specific things that you need from PLoS, I’ll do my best to see what we can provide” Nick Royle, The Cochrane Collaboration: “We are trying to find someone - snag is that the most appropriate people are in Australia and Canada” Matthew Cockerill, BioMed Central: “Very interested – cannot come but will send a colleague” (Matt Hodgkinson) Philippa Scoones, Wiley-Blackwell: “We have a lot of relevant content and are very keen to be participants” (Will Wilcox) Richard O’Beirne, Oxford Journals

Other stakeholders in academic publishing Stephane GoldsteinResearch Information Network:  “This certainly looks like an interesting project, clearly relevant to the work that we've been doing of late on data stewardship and publication. We would certainly wish to be kept abreast of developments” Ed Pentz Executive Director, CrossRef, Oxford On seeing the Royal Society of Chemistry’s efforts in semantic publishing: “This is fantastic. I’ve just seen the future of the journal” Robert Kiley Head of e-Strategy, Wellcome Trust Library Paul Davey UK PubMed Central (STM), British Library

Aspects of semantic publishing Primary enhancements  Enhancements of the authoring process  Enhancement of the online paper  Enhancement of data publication Subsequent enhancements  Better integration between papers, and between papers and datasets  Creation of added-value services on top of these publications

Enhancements of the authoring process Anita de Waard of the Elsevier Advanced Technology Group is currently developing a system, in conjunction with the editors and authors of Cell, whereby the authors are enabled to mark-up while writing the paper Microsoft have a plug-in for MS-Word 2007 ( for the creation of structural mark-up that supports the National Library of Medicine’s document format Organizing committee included  Siegfried Handschuh and Nigel Collier (co-chairs) and Anita de Waard

Enhancement of the online paper Some journals, particularly those of the Royal Society of Chemistry, have taken the lead in this area Enhancement of data publication Few examples yet in the scholarly journals, but some excellent examples from other data sources  e.g. SourceOECD, the Online Library of Statistical Databases, Books and Periodicals of the Organisation for Economic Co-operation and Development (OECD)

The Elsevier Grand Challenge The Elsevier Grand Challenge: Knowledge Enhancement in the Life Sciences  “To improve the interpretation and identification of meaning in online journals and text databases relating to the life sciences” First prize of US$35,000, second prize of US$15,000

The paper we have chosen to semantically ‘enliven’

The morning agenda 09:30 Coffee 10:00David Shotton Welcome, introductions, SPIDER Project vision 10:30Angela McLeanEpidemiology researchers and data access 10:45David RogersRemote sensing geospatial data and disease 11:00 Coffee 11:30Ian Horrocks Can computer science help research publishing? 11:45 Siegfried Handschuh DERI and semantic publishing 12:00Katie Portwin, Alistair Miles, Jun Zhao, Graham Klyne The IBRG semantic publishing vision 12:45 Discussion of biological and computer science issues

The afternoon agenda 14:00Richard O’BeirneThe challenge of the Web 14:10Matt Hodgkinson BioMed Central's interest and activities in semantic publishing and data sharing 14:20Will WilcoxWiley-Blackwell and semantic publishing 14:30Ed Pentz CrossRef, citations and information linking 14:40Paul Davey Enriching UK Pubmed Central holdings 14:50Robert Kiley The Wellcome Trust’s view 15:00Discussion of publishing issues 15:15David ShottonThe SPIDER project and the EPSRC Digital Economy funding opportunity 15:40General discussion, conclusions and future actions