OAI Workshop, October 17,19 2002 Geneva, Switzerland CERN Document Server: An OAI-based solution for managing data collections Jean-Yves.

Slides:



Advertisements
Similar presentations
Publishers Web Sites Standard Features. Objectives Access publishers websites Identify general features available on most publishers websites Know how.
Advertisements

50 Years of Experience in Making Grey Literature Available Matching the Expectations of the Particle Physics Community Carmen ODell.
1 st OAF-Workshop, th May 2002, Pisa, Italyhttp://cdsware.cern.ch/ CERN Document Server Software Martin Vesely CERN Geneva, Switzerland.
Daedalus Service Development Stephen Gallacher Lesley Drysdale.
1 L U N D U N I V E R S I T Y a home grown, bespoke institutional Federated Search tool JIBS Conference at The John Rylands University Library,
JY Le MeurAugust 2002 General Presentation University of California San Diego, CA, US.
The Library behind the scene How does it work ? The Library behind the scenes 1 JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot.
IAEA International Atomic Energy Agency INIS Collection Search: Introduction and main features INIS Training Seminar 7-11 October 2013, Vienna Domenico.
HEPiX Spring Meeting, Edinburgh 26th May 2004 Integrated Digital Conferencing Mick Draper CERN (on behalf of CDS/InDiCo team)
Technical Tips and Tricks for User Support Mike Gardner
How to fill an institutional repository - winning scientists over – the example from CERN Joanne Yeomans CERN Scientific Information Group Geneva - Switzerland.
Web of Science: An Introduction Peggy Jobe
Introducing Symposia : “ The digital repository that thinks like a librarian”
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
HEPiX Fall Meeting 2005 Thomas Baron – CERN – IT Indico: An Event Management Software (and more)
Presented by Mina Haratiannezhadi 1.  publishing, editing and modifying content  maintenance  central interface  manage workflows 2.
National Aeronautics and Space Administration Implementing DSpace at NASA Langley Research Center 1 Greta Lowe Librarian NASA Langley Research Center
How the University Library can help you with your term paper Computer Science SC Hester Mountifield Science Library x 8050
Sam Kalb Scholarly Communication Services Coordinator QUEEN’S.
„Serving Innovation …“ ElPub2006 – Workshop Wolfram Horstmann 2006/06/14 I n i t i a t i v e f o r I n n o v a t i o n i n S c h o l a r l y C o m m u.
JY Le Meur/Tibor Simko 12 th Feb’04 1)Context 2)Interoperability 3)Submission 4)Search 5)Preservation CERN, OAI3 Workshop, Geneva.
ILC EDMS project suite Status Maura Barone GDE/Fermilab ILC Valencia - November 7, 2006.
European Organization for Nuclear Research Organisation Européenne pour la Recherche Nucléaire CDS Invenio CERN’s open source digital library information.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
CERN – IT Department CH-1211 Genève 23 Switzerland t CERN Open Source Collaborative tools: Digital Library Software Tim Smith CERN/IT.
1 The Chemistry Preprint Server: An Experiment in Scientific Communication James Weeks, ChemWeb Inc. 84 Theobalds Road, Holborn, London WC1X 8RR
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
Vendor services for current awareness services Laila jarkhi.
Electronic Theses at Rhodes University presented by Irene Vermaak Rhodes University Library National ETD Project CHELSA Stakeholder Workshop 5 November.
Depth customization of DSpace: Best practices and techniques of institutional repository at IIT Kanpur, India By S. K. Vijaianand V. D. Shrivastava Gaurav.
Thomson Scientific October 2006 ISI Web of Knowledge Autumn updates.
CERN Tuesday, April 15, 2003 ETT-DH Group
Open Access to Grey Literature: Challenges and Opportunities in India By Dr. Manorama Tripathi Prof. H. N. Prasad Banaras Hindu University, Varanasi. Mr.
1 Reference Linking in Project Euclid …with some thoughts on the preservation of digital collections. A presentation at the Workshop on Linking and searching.
2005 JACoW Team Meeting Thomas Baron/Jose Benito Gonzalez – CERN – IT Managing Events with Indico.
07/11/2002Thomas Baron - JACoW Workshop1 CERN Library Requirements T. Baron CERN ETT-DH-CDS.
S YCAMORE S CHOLARS ISU Institutional Repository.
FlexElink Winter presentation 26 February 2002 Flexible linking (and formatting) management software Hector Sanchez Universitat Jaume I Ing. Informatica.
Digital Commons & Open Access Repositories Johanna Bristow, Strategic Marketing Manager APBSLG Libraries: September 2006.
1 ARRO: Anglia Ruskin Research Online Making submissions: Benefits and Process.
Exercise Your your Library ® RefWorks: The Basics October 10, 2006.
Using EBSCOhost databases Access via MyAthens Click on the EBSCOhost link.
Welcome to de Gruyter Reference Global. De Gruyter Reference Global provides you with comprehensive access to high quality academic content Run a quick.
Using RSS to Promote Scholarly Publications Ken Varnum Associate Librarian Edwin Ginn Library The Fletcher School Tufts University Cool Tools and New Technologies.
OAI and peer review Workshop (CERN 22/03/2001) Thomas Baron – Tibor Simko CERN Document Server: Validation & OAI WORKSHOP on the Open Archives initiative.
DSpace - Digital Library Software
A Project of the University Libraries Ball State University Libraries A destination for research, learning, and friends.
Greater Visibility, Greater Access QSpace QSpace Queen’s University Research & Learning Repository.
Library presentation, 28 September 2010T. Basaglia, A. Gentil-Beccot - GS-SI What can CERN Library do for TE department?
Jan 18, EDMS 1 Status of ILCDoc Maura Barone GDE/Fermilab.
CERN Document Server 19 tth January 2006 CERN Document Server Jean-Yves Le Meur 19 th January 2006.
Open Access (OA) : a summary for 2006 Joanne Yeomans CERN Scientific Information Group (Presentation for the CESSID students 12 th May 2006)
Digitalcommons.unl.edu Archiving Department Records.
ILC DMS – 8 th November 2005 Thomas Baron – CERN – IT Managing Events with Indico.
GNU EPrints 2 Overview Christopher Gutteridge 19 th October 2002 CERN. Geneva, Switzerland.
JACoW / SPMS Joint Accelerator Conference Web (JACoW) Site Scientific Program Management System (SPMS) Conference Database Management Software Matt Arena,
James Weeks, ChemWeb Inc. 84 Theobalds Road, Holborn, London WC1X 8RR
from Invenire: inveniō invenīs invenit invenī́mus invenī́tis inveniunt
E-Publishing at CERN: the life of an electronic document
Tim Smith CERN Geneva, Switzerland
Metadata Editor Introduction
VI-SEEM Data Repository
Introduction to DSpace
Context Interoperability Submission Search Preservation
COUNTER Update February 2006.
Building an open library without walls : Archiving of particle physics data and results for long-term access and use Joanne Yeomans CERN Scientific Information.
Programmatic interaction with the Invenio-based NADRE Repository
Programmatic interaction with the Invenio-based NADRE Repository
Presentation transcript:

OAI Workshop, October 17, Geneva, Switzerland CERN Document Server: An OAI-based solution for managing data collections Jean-Yves Le Meur CERN Geneva, Switzerland

2/22 A physicist office CERN-MI Starting Point NOT OAI compatible !

3/22 CERN Contributions to the open archive movement  Hosting this workshop !  Taking part into the technical committee  Testing the versions of the protocol  Delivering CERN documents via OAI  And now: releasing CDSware as GPL CERN Document Server Software

4/22 CDSware at CERN covers:  All particle Physics literature since 1950 and related areas documents: Astrophysics, Mathematics, Life at CERN…  ‘Virtual’ Collections: special views dedicated to an activity or a group. e.g: CERN Experiments collection (LHC, ATLAS, etc) CERN Divisions collections Customized views (Pauli collection)  And It serves: 156,000 distinct hosts/clients in ,000 distinct hosts/clients per month 1,000 “visits” and 3,500 searches per day 50,000 “hits” and 1.5 GB net traffic per day

5/22 CDSware at CERN contains: Articles, preprints, thesis Books Journals Conferences Archived items Multimedia items (photos, clips, press cuttings…) Talks (slides, videos) different records full texts different collections new preprints per week: - 70 % from ArXiv - 5 % from CERN - 25 % from 80 other sources

6/22 CDSware at CERN services: CDSware on CDSware on

7/22 CDSware general:  First version released 1 st of August 2002  All modules delivered as one single package  Distributed under GNU Public License.  Two mailing lists available, one for getting the news, and one for implementers discussions  Everything at lines of code !  Built with:  MySQL, Apache, PHP, Python, WML  All customization & administration is web based

8/22 CDSware Featuring:  WebSubmit: Submitting data  BibHarvest: harvesting OAI repository  BibConvert: harvesting non-OAI collections  BibFormat: Formatting and linking records  WebSearch: Searching metadata/citations/full text  BibWord: Indexing metadata and full text  WebAccess: Managing complex collection hierarchy  WebPerso: Personalizing web access  BibData: Modifying records (librarians only)

9/22 CDSware Direct Submit  Web submission -by authors; by secretaries; by library staff  Submission in steps and with control -Open; Monitoring; Approval; [Peer reviewing]  Automatic Document conversion  Automatic report number generation and stamping  Multiple ‘post-submission’ functions. Eg: -Forward to distribution lists for advertising -Enable comments by peers -Modify submitted metadata -Send revised versions of full text -Extraction of citations -Extraction of author lists (when long) -[Extraction of keywords]

10/22 CDSware: harvesting strategy  BibHarvest and BibConvert: allows to run massive importation of records  from OAI compliant data provider  from non OAI compliant provider Template for describing the source to be uploaded Template to describe the transformation of the source  Always convert into OAI Marc XML, used as our internal record representation  Also enable fetching full texts  95 % of CERN Library uploads !

11/22 CDSware: linking strategy  BibFormat: Flexible Formatting and Linking  All linking information separated from bibliographic information  Search Engine doesn’t know anything about linking or formatting  Supports different types of link solving: External linking  Just generate the link from stored rules Internal linking  The link is always a file, it checks the existence, access, formats, etc  First scenario:  Input: Bunch of records in OAI MARC XML  Output: Original XML record with its HTML version  Second scenario:  Input: records in OAI MARC XML  Output: HTML version to be displayed or PHP to be saved to a file  Egs: see  Links to full text  Links to articles or abstracts of e-journals

12/22 CDSware: Searching  Google-like syntax and speed  OAI functions implemented (v2.0)  Marc21 representation database:  each field can be searched/browsed alone  Full text, Citations and Metadata can be searched together with boolean operators  supported formats: PostScript, PDF, MS Word, MS Excel, MS PowerPoint  Search options can be customized:  fields to be searched  sort options  formats of the records: html brief or detailed, xml oai dc+marc21, etc  splitting results by collections, with complex hierarchy  Personalization options:  Baskets, alerts, layout

13/22 CDSware: Summary CDSware metadata+ data BibConvert BibUpload BibSched BibData WebAccess system librarian BibWords WebAccess BibHarvest OAI/Non OAI Data Provider author WebSubmit WebAccess user WebPerso WebAccess user WebSearch WebAccess OAI Data Providing OAI Services admin BibFormat OAI compatible !

14/22 OAI at CERN: our experience The different points of view:  Archivists  Librarians  Researchers  Managers  Computer scientists

15/22 OAI at CERN: the archivist view  DC or MARC metadata is not enough: OAIS (Reference Model for an Open Archival Information System).  Important documents are printed. Long term electronic preservation half-trusted Need to run an “OA printshop” … Do you really mean “Archive” ?…

16/22 OAI at CERN: the librarian view Thank you but it does not solve everything !  Look at a simple example: oai:arXiv:hep-th/

17/22 OAI at CERN: the librarian view - author exemple  In subscription From: Author: J. Lukierski (Institute for Theoretical Physics, University of Wroclaw, Poland)  With OAI GetRecord: Lukierski, J.  In CERN Library: - -author: Lukierski, J -affiliation: Institute for Theoretical Physics, University of Wroclaw, Poland

18/22 OAI at CERN: the librarian view - “comment” exemple  With or OAI GetRecord:  Comment: LaTeX, 9 pages, Invited talk at 11-th International Colloqium "Quantum Groups and Integrable Systems", June 2002, Prague, presented by J. Lukierski; in press in Proceedings Volume of Czech. J. Phys. vol. 52, (2002)  In CERN Library:  Page number: 9 p  Conference code: prague  Appears in 11th International Colloquium on Quantum Groups and Integrable Systems, Prague, Czech Republic, Jun 2002 (list conference papers)11th International Colloquium on Quantum Groups and Integrable Systems conference papers

19/22 OAI at CERN: the researcher view Where the hell is the Higgs Boson ? CERN-DI

20/22 OAI at CERN: the manager view Does OAI make savings ?  Some hope !  If one day it allows full high quality document harvesting  less maintenance  If one day it allows journal subscription cancellation  If one day it becomes a long term archiving solution ……  But today ?  Let’s get research grants (NSF, EC…) !

21/22 OAI at CERN: the computer scientist view  OAI: what a nice recipe !  Easy to cook  And still a lot to play with !  A large community of OAI-adduct is born

22/22 Conclusion  CERN will continue to be involved in the Open Archive movement by:  Providing, supporting, enhancing CDSware  Joining initiatives to promote the idea  And let’s hope it will be as successful as the open source movement… Thank you.

23/22 Contact  CERN Document Server  CDSware sources, mailing lists, demo  Contact