JY Le Meur/Tibor Simko 12 th Feb’04 1)Context 2)Interoperability 3)Submission 4)Search 5)Preservation CERN, OAI3 Workshop, Geneva.

Slides:



Advertisements
Similar presentations
Adding OAI-ORE Support to Repository Platforms Alexey Maslov, Adam Mikeal, Scott Phillips, John Leggett, Mark McFarland Texas Digital Library TCDL09.
Advertisements

OAI from 50,000 Feet OAI develops and promotes interoperability solutions that aim to facilitate the efficient dissemination of content. Begun in 1999.
GL5, December 4 - 5, 2003 Amsterdam, The Netherlands CERN Document Server Martin Vesely CERN Geneva, Switzerland Document Management System for Grey Literature.
50 Years of Experience in Making Grey Literature Available Matching the Expectations of the Particle Physics Community Carmen ODell.
EPrint Software options William J Nixon DAEDALUS Project, University of Glasgow eFAIR Meeting, Southampton.
1 st OAF-Workshop, th May 2002, Pisa, Italyhttp://cdsware.cern.ch/ CERN Document Server Software Martin Vesely CERN Geneva, Switzerland.
Setting Up Information Portal Irwan Sampurna C-CONTENT 23 May 2006.
OAI in DigiTool DigiTool Version 3.0.
The Library behind the scene How does it work ? The Library behind the scenes 1 JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot.
ELPUB 2006 June Bansko Bulgaria1 Automated Building of OAI Compliant Repository from Legacy Collection Kurt Maly Department of Computer.
HEPiX Spring Meeting, Edinburgh 26th May 2004 Integrated Digital Conferencing Mick Draper CERN (on behalf of CDS/InDiCo team)
DSpace Devika P. Madalli DRTC, ISI Bangalore.
ARCHIMÈDE Presented by Guy Teasdale Directeur, Services soutien et développement Bibliothèque de l’Université Laval CARL Workshop on Institutional Repositories.
UCLA Digital Library UC Digital Library Forum August 5, 2002 UCLA Digital Library Presenter: Curtis Fornadley Senior Programmer/Analyst.
Introducing Symposia : “ The digital repository that thinks like a librarian”
National Aeronautics and Space Administration Implementing DSpace at NASA Langley Research Center 1 Greta Lowe Librarian NASA Langley Research Center
Geneve, February 12, 2004 CERN OAI 3 Workshop - Tutorial 2 F. Lützenkirchen Implementing institutional Content Repositories with MyCoRe and MILESS 3rd.
Dienst Distributed Networked Publishing Carl Lagoze Digital Library Scientist Cornell University.
„Serving Innovation …“ ElPub2006 – Workshop Wolfram Horstmann 2006/06/14 I n i t i a t i v e f o r I n n o v a t i o n i n S c h o l a r l y C o m m u.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
ILC EDMS project suite Status Maura Barone GDE/Fermilab ILC Valencia - November 7, 2006.
Serenate1 Non-standard users: The Library Raf Dekeyser K.U.Leuven.
CERN – IT Department CH-1211 Genève 23 Switzerland t CERN Open Source Collaborative tools: Digital Library Software Tim Smith CERN/IT.
Processing e-literature at CERN Corrado Pettenati Mick Draper20 March 2000 Processing electronic literature: CERN case study C. Pettenati (ETT-SI) M. Draper.
1 The Chemistry Preprint Server: An Experiment in Scientific Communication James Weeks, ChemWeb Inc. 84 Theobalds Road, Holborn, London WC1X 8RR
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
The repositories Landscape: where are Repositories now and what’s around the corner? UKDA-store Louise Corti UKDA, University of Essex MIMAS OPEN FORUM.
CERN Document Server software JY. Le Meur; T. Baron T. Simko; D. Bourillot Europython – 4th July 2006 On the usage of Python in the CERN Document Server's.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
University of North Texas Libraries Building Search Systems for Digital Library Collections Mark E. Phillips Texas Conference on Digital Libraries May.
2005 JACoW Team Meeting Thomas Baron/Jose Benito Gonzalez – CERN – IT Managing Events with Indico.
07/11/2002Thomas Baron - JACoW Workshop1 CERN Library Requirements T. Baron CERN ETT-DH-CDS.
The DPubS Development Project: Building an Open Source Electronic Publishing System David Ruddy Cornell University Library.
Design of a Search Engine for Metadata Search Based on Metalogy Ing-Xiang Chen, Che-Min Chen,and Cheng-Zen Yang Dept. of Computer Engineering and Science.
FlexElink Winter presentation 26 February 2002 Flexible linking (and formatting) management software Hector Sanchez Universitat Jaume I Ing. Informatica.
International Seminary on Digitisation: Experience and Technology 11 th May 2004 | National Library | Lisbon – Portugal DIGITAL ARCHIVE OF PORTUGUESE ART.
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
CERN - IT Department CH-1211 Genève 23 Switzerland t INSPIRE A Global Digital Library for HEP 14 th February 2011 Tim Smith on behalf of.
Uganda Scholarly Digital Library (USDL) Makerere University’s Institutional Repository By Margaret Nakiganda URL:
Open Archive Initiative – Protocol for metadata Harvesting (OAI-PMH) Surinder Kumar Technical Director NIC, New Delhi
OAIS: From Requirements to Reality at OCLC FLICC / CENDI Symposium, Dec Pam Kircher Product Manager, Digital Archive OCLC Digital & Preservation.
OAI Workshop, October 17, Geneva, Switzerland CERN Document Server: An OAI-based solution for managing data collections Jean-Yves.
Feb 7th, 2007 BILCW07 1 ILCAgenda – ILCDoc – ILC EDMS Status Maura Barone GDE/Fermilab.
OAI and peer review Workshop (CERN 22/03/2001) Thomas Baron – Tibor Simko CERN Document Server: Validation & OAI WORKSHOP on the Open Archives initiative.
The Open Archives Initiative Marshall Breeding Director for Innovative Technologies and Research Vanderbilt University
DSpace - Digital Library Software
DSpace System Architecture 11 July 2002 DSpace System Architecture.
Serenate1 The librarian’s view Raf Dekeyser K.U.Leuven.
Sept 21, 2006 ILC EDMS Maura Barone 1 News –Updated to CDS Invenio version New version in about 2 weeks (?) address.
2/22/2016J Ammerman1 Open Archives Initiative What is it? What’s it good for?
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
Jan 18, EDMS 1 Status of ILCDoc Maura Barone GDE/Fermilab.
CERN Document Server 19 tth January 2006 CERN Document Server Jean-Yves Le Meur 19 th January 2006.
Virtual Collections VIRTUAL COLLECTIONS LDI Architecture Meeting, Tuesday, July 19.
5/29/2001Y. D. Wu & M. Liu1 Content Management for Digital Library May 29, 2001.
IST SciX project: Lowering the technical, economic and social barriers to open scientific publishing Žiga Turk University of Ljubljana, Slovenia.
GNU EPrints 2 Overview Christopher Gutteridge 19 th October 2002 CERN. Geneva, Switzerland.
James Weeks, ChemWeb Inc. 84 Theobalds Road, Holborn, London WC1X 8RR
from Invenire: inveniō invenīs invenit invenī́mus invenī́tis inveniunt
Implementing institutional Content Repositories with MyCoRe and MILESS
Managing Copyrights in Invenio
Tim Smith CERN Geneva, Switzerland
VI-SEEM Data Repository
Building Search Systems for Digital Library Collections
VI-SEEM Data Repository
Introduction to DSpace
DIGITAL LIBRARY.
Context Interoperability Submission Search Preservation
Institutional Repositories
Presentation transcript:

JY Le Meur/Tibor Simko 12 th Feb’04 1)Context 2)Interoperability 3)Submission 4)Search 5)Preservation CERN, OAI3 Workshop, Geneva

JY Le Meur/Tibor Simko 12 th Feb’04 Dec. 1973: CERN library Once upon a time … CONTEXT 1 - Computers start to be used in libraries ALEPH Integrated System used at CERN CERN Library: Mission of dissemination and long term keeping of HEP results THREE MAJOR CHANGES 2- First www server (Dec. 1990) 1993: CERN preprint server  Library Server  CERN Doc Server 3- Open Archive Initiative is launched (in 1999) CDSware OAI-compliant distributed as GPL

JY Le Meur/Tibor Simko 12 th Feb’04 CDSware metadata+ data BibConvert BibUpload BibSched BibData WebAccess system librarian BibIndex WebAccess BibHarvest OAI/Non OAI Data Provider author WebSubmit WebAccess user WebAlert WebAccess user WebSearch WebAccess OAI Data Providing OAI Services admin BibFormat CONTEXT CDSware Architecture … MySQL RDBMS CDSware Indexes Apache/Python XML MARC GNU GPL Incremental organic-growth SW development model

JY Le Meur/Tibor Simko 12 th Feb’04 CDSware Interoperability  OAI Harvesting OAI Harvester: BibHarvest Non-OAI Harvester: BibConvert At CERN: more than 80 distinct sources are harvested  OAI Providing Records can be private, public and “OAI-public” OAI Sets can be defined using any search criteria  Search Output Formats XML MARC; XML Dublin Core and more… Any query is “OAI-ready” Eg: OAI harvester could harvest only papers written by Ellis, J. Eg: OAI harvester could harvest only title fields  Applications built on top of CDSware APIs to CDSware available  Connection with other Search Engines INTEROPERABILITY

JY Le Meur/Tibor Simko 12 th Feb’04 CDSware Submission process  Each collection can have its own submission policy Direct submission Submission with monitoring Submission with simple approval Submission with peer review/refereeing and editorial board  Each collection can have its own record definition Metadata fields (mandatory, optional, controlled at input time…) Full text formats Revised versions  Each submission has its own process management With an HTML administration interface To define submission screens To define actions to be applied  ‘Batch submission’ mode BibHarvest, BibConvert and BibUpload modules SUBMISSION

JY Le Meur/Tibor Simko 12 th Feb’04 CDSware Search  Google-like speed up to 1,000,000 records Web Application server  DB server DB insufficient: in-house performance-driven index design Fast marshalling & fast set intersections: query no.hits search time cern 223, sec of 439, sec of cern 109, sec of cern the this 11, sec  Combined metadata/fulltext/reference search  Multi-stage search guidance system  Personalization: baskets, alerts  Navigable collection trees Primary and Virtual orthogonal views  Internationalization: multi-language interface SEARCH

JY Le Meur/Tibor Simko 12 th Feb’04 CDSware: Long term preservation  CDSware at CERN “Certified Information System” (CIS) Considered as a long term electronic archive Hosts the official CERN Archives  MARC21 based: LOC standard XML MARC is the internal representation of CDSware records  Records deletion policy Record IDs never change  Full text automatically converted to PDF CERN Conversion server can be plugged in (GNU GPL)  Digital content disseminated… via OAI ! PRESERVATION

JY Le Meur/Tibor Simko 12 th Feb’ different records full texts different collections new preprints per week: - 70 % from ArXiv - 5 % from CERN - 25 % from 80 other sources 125,000 distinct hosts/clients in ,000 distinct hosts/clients per month 120,000 searches per month 5,000 OAI harvesting requests per month

JY Le Meur/Tibor Simko 12 th Feb’04 CDSware: Conclusions  Used in many places (dozen of installations)  Dedicated support from CDS team (charged)  Extending traditional library systems  Designed to evolve  Suitable for mid to large size repositories (1M recs)