Context Interoperability Submission Search Preservation

Slides:



Advertisements
Similar presentations
Adding OAI-ORE Support to Repository Platforms Alexey Maslov, Adam Mikeal, Scott Phillips, John Leggett, Mark McFarland Texas Digital Library TCDL09.
Advertisements

OAI from 50,000 Feet OAI develops and promotes interoperability solutions that aim to facilitate the efficient dissemination of content. Begun in 1999.
GL5, December 4 - 5, 2003 Amsterdam, The Netherlands CERN Document Server Martin Vesely CERN Geneva, Switzerland Document Management System for Grey Literature.
50 Years of Experience in Making Grey Literature Available Matching the Expectations of the Particle Physics Community Carmen ODell.
EPrint Software options William J Nixon DAEDALUS Project, University of Glasgow eFAIR Meeting, Southampton.
1 st OAF-Workshop, th May 2002, Pisa, Italyhttp://cdsware.cern.ch/ CERN Document Server Software Martin Vesely CERN Geneva, Switzerland.
Setting Up Information Portal Irwan Sampurna C-CONTENT 23 May 2006.
Finding a Software System to Support ETDs Susan Gibbons Digital Initiatives Librarian University of Rochester.
OAI in DigiTool DigiTool Version 3.0.
The Library behind the scene How does it work ? The Library behind the scenes 1 JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot.
IAEA International Atomic Energy Agency INIS Collection Search: Introduction and main features INIS Training Seminar 7-11 October 2013, Vienna Domenico.
ARCHIMÈDE Presented by Guy Teasdale Directeur, Services soutien et développement Bibliothèque de l’Université Laval CARL Workshop on Institutional Repositories.
UCLA Digital Library UC Digital Library Forum August 5, 2002 UCLA Digital Library Presenter: Curtis Fornadley Senior Programmer/Analyst.
National Aeronautics and Space Administration Implementing DSpace at NASA Langley Research Center 1 Greta Lowe Librarian NASA Langley Research Center
Sam Kalb Scholarly Communication Services Coordinator QUEEN’S.
Dienst Distributed Networked Publishing Carl Lagoze Digital Library Scientist Cornell University.
„Serving Innovation …“ ElPub2006 – Workshop Wolfram Horstmann 2006/06/14 I n i t i a t i v e f o r I n n o v a t i o n i n S c h o l a r l y C o m m u.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
JY Le Meur/Tibor Simko 12 th Feb’04 1)Context 2)Interoperability 3)Submission 4)Search 5)Preservation CERN, OAI3 Workshop, Geneva.
ILC EDMS project suite Status Maura Barone GDE/Fermilab ILC Valencia - November 7, 2006.
Serenate1 Non-standard users: The Library Raf Dekeyser K.U.Leuven.
CERN – IT Department CH-1211 Genève 23 Switzerland t CERN Open Source Collaborative tools: Digital Library Software Tim Smith CERN/IT.
Processing e-literature at CERN Corrado Pettenati Mick Draper20 March 2000 Processing electronic literature: CERN case study C. Pettenati (ETT-SI) M. Draper.
1 The Chemistry Preprint Server: An Experiment in Scientific Communication James Weeks, ChemWeb Inc. 84 Theobalds Road, Holborn, London WC1X 8RR
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
CERN Document Server software JY. Le Meur; T. Baron T. Simko; D. Bourillot Europython – 4th July 2006 On the usage of Python in the CERN Document Server's.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
University of North Texas Libraries Building Search Systems for Digital Library Collections Mark E. Phillips Texas Conference on Digital Libraries May.
2005 JACoW Team Meeting Thomas Baron/Jose Benito Gonzalez – CERN – IT Managing Events with Indico.
07/11/2002Thomas Baron - JACoW Workshop1 CERN Library Requirements T. Baron CERN ETT-DH-CDS.
The DPubS Development Project: Building an Open Source Electronic Publishing System David Ruddy Cornell University Library.
FlexElink Winter presentation 26 February 2002 Flexible linking (and formatting) management software Hector Sanchez Universitat Jaume I Ing. Informatica.
International Seminary on Digitisation: Experience and Technology 11 th May 2004 | National Library | Lisbon – Portugal DIGITAL ARCHIVE OF PORTUGUESE ART.
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
CERN - IT Department CH-1211 Genève 23 Switzerland t INSPIRE A Global Digital Library for HEP 14 th February 2011 Tim Smith on behalf of.
Uganda Scholarly Digital Library (USDL) Makerere University’s Institutional Repository By Margaret Nakiganda URL:
Open Archive Initiative – Protocol for metadata Harvesting (OAI-PMH) Surinder Kumar Technical Director NIC, New Delhi
Slavic Digital Text Workshop 2006 The Open Archives Initiative Protocol for Metadata Harvesting: an Opportunity for Sharing Content in a Distributed Environment.
OAIS: From Requirements to Reality at OCLC FLICC / CENDI Symposium, Dec Pam Kircher Product Manager, Digital Archive OCLC Digital & Preservation.
OAI Workshop, October 17, Geneva, Switzerland CERN Document Server: An OAI-based solution for managing data collections Jean-Yves.
OAI and peer review Workshop (CERN 22/03/2001) Thomas Baron – Tibor Simko CERN Document Server: Validation & OAI WORKSHOP on the Open Archives initiative.
The Open Archives Initiative Marshall Breeding Director for Innovative Technologies and Research Vanderbilt University
DSpace - Digital Library Software
DSpace System Architecture 11 July 2002 DSpace System Architecture.
Serenate1 The librarian’s view Raf Dekeyser K.U.Leuven.
2/22/2016J Ammerman1 Open Archives Initiative What is it? What’s it good for?
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
Jan 18, EDMS 1 Status of ILCDoc Maura Barone GDE/Fermilab.
CERN Document Server 19 tth January 2006 CERN Document Server Jean-Yves Le Meur 19 th January 2006.
Virtual Collections VIRTUAL COLLECTIONS LDI Architecture Meeting, Tuesday, July 19.
5/29/2001Y. D. Wu & M. Liu1 Content Management for Digital Library May 29, 2001.
IST SciX project: Lowering the technical, economic and social barriers to open scientific publishing Žiga Turk University of Ljubljana, Slovenia.
GNU EPrints 2 Overview Christopher Gutteridge 19 th October 2002 CERN. Geneva, Switzerland.
Bielefeld Academic Search Engine
A look at the digital initiatives of Laval University Library
from Invenire: inveniō invenīs invenit invenī́mus invenī́tis inveniunt
Managing Copyrights in Invenio
Tim Smith CERN Geneva, Switzerland
VI-SEEM Data Repository
Building Search Systems for Digital Library Collections
VI-SEEM Data Repository
Introduction to Implementing an Institutional Repository
Introduction to DSpace
DIGITAL LIBRARY.
The New Face of Information Retrieval: The Ankara University Open Access Platform Prof. Dr. Sekine Karakaş Prof. Dr. Doğan.
Introduction of Week 11 Return assignment 9-1 Collect assignment 10-1
Institutional Repositories
Programmatic interaction with the Invenio-based NADRE Repository
Programmatic interaction with the Invenio-based NADRE Repository
Presentation transcript:

Context Interoperability Submission Search Preservation CERN, OAI3 Workshop, Geneva

Once upon a time … CERN Library: Mission of dissemination and CONTEXT Once upon a time … 1 - Computers start to be used in libraries ALEPH Integrated System used at CERN CERN Library: Mission of dissemination and long term keeping of HEP results THREE MAJOR CHANGES Dec. 1973: CERN library 2- First www server (Dec. 1990) 1993: CERN preprint server  Library Server  CERN Doc Server 3- Open Archive Initiative is launched (in 1999) CDSware OAI-compliant distributed as GPL

OAI/Non OAI Data Provider CDSware metadata+ data CONTEXT CDSware Architecture … admin BibFormat author WebSubmit WebAccess BibConvert BibUpload BibHarvest OAI/Non OAI Data Provider BibSched BibIndex WebAccess MySQL RDBMS CDSware Indexes Apache/Python XML MARC GNU GPL Incremental organic-growth SW development model CDSware metadata+ data system librarian user WebSearch WebAccess BibData WebAccess user WebAlert WebAccess OAI Data Providing OAI Services

CDSware Interoperability OAI Harvesting OAI Harvester: BibHarvest Non-OAI Harvester: BibConvert At CERN: more than 80 distinct sources are harvested OAI Providing Records can be private, public and “OAI-public” OAI Sets can be defined using any search criteria Search Output Formats XML MARC; XML Dublin Core and more… Any query is “OAI-ready” Eg: OAI harvester could harvest only papers written by Ellis, J. Eg: OAI harvester could harvest only title fields Applications built on top of CDSware APIs to CDSware available Connection with other Search Engines

CDSware Submission process Each collection can have its own submission policy Direct submission Submission with monitoring Submission with simple approval Submission with peer review/refereeing and editorial board Each collection can have its own record definition Metadata fields (mandatory, optional, controlled at input time…) Full text formats Revised versions Each submission has its own process management With an HTML administration interface To define submission screens To define actions to be applied ‘Batch submission’ mode BibHarvest, BibConvert and BibUpload modules

CDSware Search Google-like speed up to 1,000,000 records Web Application server  DB server DB insufficient: in-house performance-driven index design Fast marshalling & fast set intersections: query no.hits search time cern 223,843 0.07 sec of 439,793 0.07 sec of cern 109,635 0.10 sec of cern the this 11,940 0.17 sec Combined metadata/fulltext/reference search Multi-stage search guidance system Personalization: baskets, email alerts Navigable collection trees Primary and Virtual orthogonal views Internationalization: multi-language interface

CDSware: Long term preservation CDSware at CERN “Certified Information System” (CIS) Considered as a long term electronic archive Hosts the official CERN Archives MARC21 based: LOC standard XML MARC is the internal representation of CDSware records Records deletion policy Record IDs never change Full text automatically converted to PDF CERN Conversion server can be plugged in (GNU GPL) Digital content disseminated… via OAI !

125,000 distinct hosts/clients in 2003 650 000 different records - 350 000 full texts - 450 different collections 1000 new preprints per week: 70 % from ArXiv 5 % from CERN 25 % from 80 other sources 125,000 distinct hosts/clients in 2003 12,000 distinct hosts/clients per month 120,000 searches per month 5,000 OAI harvesting requests per month

CDSware: Conclusions Used in many places (dozen of installations) Dedicated support from CDS team (charged) Extending traditional library systems Designed to evolve Suitable for mid to large size repositories (1M recs) http://cdsware.cern.ch