DRIVER - Supporting Institutional Repositories in Europe Mary Robinson SHERPA, University of Nottingham, UK Wolfram Horstmann University of Goettingen, Germany ELPUB June 2007 Digital Repository Infrastructure Vision for European Research
Welcome, Introductions & Aims of Workshop DRIVER Objectives Repository Networking DRIVER Advocacy and Support Activities DRIVER Software Architecture Coffee Presentations by Workshop Participants Open Discussion, Questions & Answers 17:15 -- Wrap-up Agenda
DRIVER – O bjectives, Repository Networking Wolfram Horstmann University of Goettingen, Germany ELPUB June 2007 Digital Repository Infrastructure Vision for European Research
The current situation for digital repositories More than 850 institutional repositories worldwide –Many others: disciplinary, national, … –Many types: Primary data, textual documents, learning materials, multimedia objects, Code etc. –Documents: incl. pre-prints, postprints, technical papers, dissertations, theses –Various repository software
Many have the OAI-PMH implemented –small but relevant local specialties Some international registries exist –OpenDOAR, ROAR … Some search engines exist –BASE, OAIster, Google Scholar … What is known about repositories?
Collaboration in repositories Very few mature national repository organisations/collaborations No trans-national repository organisation/collaboration Lack of data harmonization, orchestration of services
From the user point of view [ talking about researchers ] Fragmented, obscure information landscape content can be (partly) searched and found quality and re-use differs from repository to repository
Need for Repository Infrastructure Organisational Networking –Forum for stakeholders Data harmonization –Cleaned and enriched information space –Offered for re-use by service providers Software orchestration –Open, scalable technical architecture –Re-use of existing services –Facilitate interactions between services Organisation Data Software
DRIVER Vision Build a pan-European Digital Repository Infrastructure and provide the European Hub in a Global Repository Network for any type of document, data and object, of any format, involving all European countries and covering all academic disciplines
DRIVER now! A Digital Repository Infrastructure Test- bed Textual documents from 5 Countries (BE, FR,GE, NL,UK) Distributed, high-quality content provision Distributed software deployment All open and re-usable
DRIVER Current Activities Test Bed for Repository-Services Focused Studies Network of Content-Providers Advocacy and Awareness Raising Content Technology Planning Outreach
Repository networking Organisation being synergetic with data infratstructure Organisation Data Software
Make it workable Focus on existing repositories and services Focus on Institutional Repositories –Rapid progress over the last years –Inherent sustainability (e.g. libraries) –Adequate technical homogeneity (OAI-PMH) Focus on textual materials Focus on specific (test-bed) countries
Networking test-bed 5 countries –DRIVER exemplars –BE, DE, FR, NL, UK Different –Maturity –Policies –Technologies –…
The Netherlands –National Federation –Aggregated Network –Outsourced –Cream of Science >> Model for Distributed Network
Germany –National Policies –Pro-active, individual approach –High quality certificate >> Model for quality assurance
Great Britain –National federation –Advocacy focus –ROMEO IPR lookup –OpenDoar Registry >> Model for advocacy / added value services
France centralized scheme >> Model for centralized network
Belgium –2006 no national federation –13 Rectors signed Berlin Declaration >> Model for network construction
Networking test-bed 50+ organized repositories Is there a simple technical recipe for networking these repositories?
All publications in a repository ……………… … …
Aggregate with OAI-PMH Aggregator OAI-PMH
Feed back to research Aggregator OAI-PMH search engine
Lift to an international scale Aggregator … … search engine
Problems remain Building of aggregations is not trivial! An example: Full-text availability –Full-text not homogeneously offered metadata only, jump-off pages, authorization full-text URLs in varying locations
Full-text guidelines Help to improve the repository landscape –Support of sets –Clear location of URL –Resource harvesting Studies say: –International retrieval –Guidelines
Next Steps Help 50+ repositories –Guidelines release –Help Desk (WP8) –… Help more repositories –DRIVER countries –other countries –…
DEMO Disclaimer –Test version –Some interfaces for administrators only Repository Landscape –DEMO1DEMO1 –DEMO2DEMO2
DEMO OF TEST-VERSION
DRIVER Current Activities Test Bed for Repository-Services Focused Studies Network of Content-Providers Advocacy and Awareness Raising Content Technology Planning Outreach
DRIVER – Software Architecture Wolfram Horstmann University of Goettingen, Germany ELPUB June 2007 Digital Repository Infrastructure Vision for European Research
DRIVER Software Organisation Data Software
EU Knowledge Infrastructural Vision Build and maintain a sustainable European environment where content and functionality resources can be openly shared and integrated for use by any Application Sustainability means: - Maintainability, Scalability, Reusability
Service-based Repository Systems Environment and tools for building service- based Repository Systems –Sets of services running at different network sites, possibly in multiple instances, interacting, dynamic, sharable, open
European Information Space Includes the DRIVER Repository System –Providing registered users with advanced functionalities over a uniform European Information Space formed by aggregating multiple Repositories Repositories –Can join or leave the infrastructure at any time –Are dynamically/automatically aggregated to populate and keep updated the DRIVER Information Space
Run-Time DRIVER Infrastructure European Information Space Repository System –Scalable, reusable, self-maintained Enabling Layer Data Layer Functionality Layer
Enabling Layer Infrastructure management: common to all Repository Systems –Service Registration (P2P-like) and Discovery –Subscription and notification on Service-related events –Orchestration of Services: coordination of Service interactions Enabling Layer Functionality LayerData Layer Information Service Manager Service Authz&Authn Service
Data Layer DRIVER Information Space Management Enabling Layer Repositories DRIVER Objects Functionality Layer Data Layer OAI-PMH Service Index Service Collection Service Store Service Aggregator Service
Functionality Layer User-oriented functionalities Data Layer Enabling Layer Functionality Layer User Interface Service Recomm. Service Community Service User Service Search Service
DRIVER infrastructure: the benefits Enabling Layer Data Layer Functionality Layer Repositories DRIVER Infrastructure Repository System
DRIVER and standards Service Resources are implemented as Web Services and accessed through the corresponding Web Service Interface –Parameters calls are enveloped into SOAP messages –The Enabling Services are also compatible with REST XML is the lingua-franca for the whole system –Resource internal status, i.e. Resource profiles –Profiles in Information Service use Exist XML engine
DRIVER and standards Vocabularies –Names of Languages: ISO 639 – 2 (three letters, B/T) –Names of Countries: ISO 3166 (two letters) –Date format: ISO 8601: 1988 (E)
DRIVER and standards DRIVER Aggregation –Harvesting according to OAI-PMH protocol –Adopting OAI-Provenance best practice (OAI-about) –To be extended to other object models and harvesting protocols Queries to Search Service and Index Service obey to SRW/CQL standard
DRIVER and standards Subscription and Notification Service –Any Service can subscribe to events regarding any DRIVER Resource: creation, deletion, and specific action accomplished by a resource –The Subscription and Notification mechanism is compliant with the OASIS Standards WS Base Notification 1.3 and WS Topics 1.3 Authorization and Authentication Service –Access Control Markup Language, XACML
DEMO Disclaimer –Test version –Some interfaces for administrators only Repository Landscape –DEMO1DEMO1 –DEMO2DEMO2
Application Scenarios
End-User / Researcher not the primary target group but exemplary applications being built –Search Service (pan-European) –Recommender Service –… Third parties shall join!
Local Repository Operator Register Repository Monitor Data Quality –Comply with Guidelines (full-text) –Enrich data (provenance)
Local Repository Developer Re-use services for local system –Information space Search Index Collections …
Repository network manager Manage clusters of repositories Build a regional, national thematic data network –Re-use collections Develop individual GUIs
Repository platform developer Align developments with DRIVER –Better harvesting –Data quality –Service integration
Service Provider Integrate service into DRIVER infrastructure Benefit from service orchestration Allow to speak with other services
Service Developer Develop services that run also with DRIVER –Citation analyis –Workflow systems –Text-Mining / Data-Mining –Classification systems –…
DRIVER- Advocacy & Support Activities Mary Robinson SHERPA, University of Nottingham ELPUB DRIVER Workshop June 2007
Repositories Worldwide
Repositories in Europe
DRIVER - Inventory study into the present type and level of OAI compliant Digital Repository activities in the EU support.eu/en/about.html
Academics –as authors –as researchers Institution Administrators Repository Administrators Library & Support Staff Funding Agencies –National government –European –Other Publishers National research bodies European research bodies General Public Stakeholder & Information Gap Analysis
What are the benefits? How will this affect the peer review process? Where can I go for more information? Where can I go to discuss this issue? Questions common to all Stakeholders
Advocacy Materials Attendance at conferences Contacting professional groups Resources, Support and Community Building –DRIVER Support website –DRIVER Guideline Helpdesk –Mentor Service –DRIVER Wiki Approach
DRIVER Support website Information Dissemination –Infrastructure reports –Materials for re-use –Advocacy for stakeholders Support & Facilitate Communication –Point of contact with project & national contacts –Wiki –Mentor –News, Events and Links of interest
DRIVER- Advocacy & Support Activities
DRIVER Workshop Participant Contributions
Royal Netherlands Academy of Arts and Sciences DAREnet, CRIS and NARCIS: Access to research information in the Netherlands Elly Dijk KNAW Research Information ELPUB, Vienna, June 2007
Royal Netherlands Academy of Arts and Sciences Outline Royal Netherlands Academy of Arts and Sciences DARE programme and DAREnet Integration of DAREnet and NARCIS Linking CRIS information, publications and datasets
Royal Netherlands Academy of Arts and Sciences KNAW Research Information Academy's mission: to ensure the quality of scientific research in the Netherlands and to promote the open accessibility of scientific information Our mission: national focal point of research information Dutch Research Database (NOD) DAREnet NARCIS
Royal Netherlands Academy of Arts and Sciences The Mission of DARE Better access to the results of publicly funded research in the Netherlands
Royal Netherlands Academy of Arts and Sciences DARE Programme Approved:14 June 2002 Name:DARE [Digital Academic REpositories] Period: 1 Jan – 31 Dec Budget: 5.9 M Standards: OAI-PMH 2.0; Dublin Core Partners: All the universities, KNAW, NWO, KB and SURF
Royal Netherlands Academy of Arts and Sciences Society Research Education Harvesters Virtual Learning Environments, Course Ware, Readers,... Institutional Repository Academy TUDelft NWO..... CNRS MIT Subject repositories, refereed portals, databases, collaboratories, (Open Access) journals,... Institutional windows, expertise, professional journals, personal Web sites, national windows,... Two Layers
Royal Netherlands Academy of Arts and Sciences Institutional Repository CNR-ISTI Amsterdam..... CERN ArXiv 27 January 2004 DAREnet Nationwide (13 universities, KNAW, NWO) Initially 17,000 publications (currently 128,000) Milestone #1
Royal Netherlands Academy of Arts and Sciences Milestone #2 Cream of Science 10 May Entire output of +10 top academics per DARE participant
Royal Netherlands Academy of Arts and Sciences 16 institutions 229 authors 47,000 records = 205 per author (from 3 to 1,224) 60% full text = 28,000 records 25% copyright obstructed, 15% only metadata available at the moment Some figures of Cream of Science
Royal Netherlands Academy of Arts and Sciences 100,000 new OA publications before the end of 2006 of which 10,000 doctoral theses under the heading of Promise of Science Milestone #3 One HunDAREd Thousand
Royal Netherlands Academy of Arts and Sciences Milestone #4 Promise of Science September 2006: Dutch national site for e-theses officially launched June 2007: nearly 15,000 doctoral e-theses from all Dutch universities Annual production: c. 2,500 e-theses 1995: some 17% accessible 2006: some 77% accessible
Royal Netherlands Academy of Arts and Sciences Lessons learned Cooperation Academy of Science, university libraries, National Library, SURF, National Research Council (NWO) Agreement on open standards and harvesting Internet, Web, XML, OAI PMH, DC ++, (IEEE LOM, DIDL, SOA) New actions to feed the repositories Cream of Science, Promise of Science
Royal Netherlands Academy of Arts and Sciences Success factors DAREnet with 128,000 objects IRs are embedded in the local and national knowledge infrastructure Copyright toolbox, including a licence to publish Communication toolkit for acquisition of doctoral e- theses DRIVER : Academy assumed responsibility for DAREnet
Royal Netherlands Academy of Arts and Sciences Next step Integration of CRIS and repositories by integrating DAREnet and NARCIS What is NARCIS?
Royal Netherlands Academy of Arts and Sciences NARCIS National Academic Research and Collaborations Information System Project of KNAW Research Information in collaboration with NWO It is a portal giving access to: Research Information: research programmes, research institutes, researchers Dutch repositories: full text publications News from scientific organisations Extra feature: RSS feed
Royal Netherlands Academy of Arts and Sciences Integrating DAREnet and NARCIS NARCIS combines roughly the Dutch Research Database and DAREnet, but DAREnet has unique features NARCIS-DAREnet integration project to combine the best of both worlds
Royal Netherlands Academy of Arts and Sciences Steps in the integration process Integrated backend by using the DAREnet backend system Advanced search Special access to Cream of Science and Promise of Science Google New look-and-feel; and a new name?
Royal Netherlands Academy of Arts and Sciences Linking CRIS information, publications and datasets Academy: current research National Library: e-depot DANS: research data (humanities and social sciences) Linking by: Digital author identifiers in the NOD Persistent identifiers of publications Persistent identifiers of datasets
Royal Netherlands Academy of Arts and Sciences
Institutional Repositories in Latin America and Syria: 2 European Commission sponsored projects Ian Johnson
América Latina Formación Académica Academic links 3+ Universities in Latin America 3+ Universities in European Union
Biblioteca de Babel
BIBLIOTECA DE BABEL Coordinator Pontificia Universidad Catolica de Valparaiso, Chile Partners 15 universities in Latin America 9 universities in Europe Associated member Columbus Group European Universities Association Unión de Universidades de América Latina y el Caribe
BIBLIOTECA DE BABEL Aims: Guidelines on best practice for university libraries
BIBLIOTECA DE BABEL Outputs – guidelines on Institutional repositories Information literacy Digital reference
TEMPUS MEDA 1 university in North Africa or the Middle East 1 university in the European Union Joint European Projects Structural and Complementary Measures
SYReLIB Coordinator The Robert Gordon University Partners Middlesex University, London 2 universities in Syria eIFL.net
SYReLIB Aims Establishing a national strategic framework for electronic libraries
SYReLIB Outputs Pilot projects in 2 universities Enhanced e-journal collection Integrated Library Management Systems Institutional Repositories Proposal for second project
Questions? Comments? Thank you for your attention
DRIVER Workshop- Supporting Institutional Repositories in Europe Mary Robinson SHERPA, University of Nottingham, UK Wolfram Horstmann University of Goettingen, Germany ELPUB June 2007 Digital Repository Infrastructure Vision for European Research