A Global Digital Library for High-Energy Physics Annette Holtkamp CERN-UNESCO School on Digital Libraries – Rabat, Nov 2010.

Slides:



Advertisements
Similar presentations
INSPIRE A new information system for High-Energy Physics
Advertisements

How do High Energy Physics scholars search their information? Anne Gentil-Beccot, CERN – 11 December 2007, GL9 conference.
50 Years of Experience in Making Grey Literature Available Matching the Expectations of the Particle Physics Community Carmen ODell.
Library The Web of Science, Bibliometrics and Rankings 23 November 2011.
Global Digital Library in High-Energy Physics Jan Iwaszkiewicz (CERN) Digital Repositories – Linked Open Data – the possible Role of D4Science 16 th Dec.
S.J. Coles a*, M.B. Hursthouse a, R.A. Stephenson a, P. Cliff b, E. Lyon b, M. Patel b J. Downing c & P. Murray-Rust.
Michael Maune Carl von Ossietzky University, Oldenburg and Institute for Science Networking Oldenburg Distributed Open Access Reference Citations Service.
Jane Long, MA, MLIS Reference Services Librarian Al Harris Library.
ISI Web of Knowledge – Innovative Solutions ISI Web of Knowledge / Web of Science – coming developments BIOSIS Archive Web Citation Index – New product.
Trends in Scientific Publishing Guenther Eichhorn DirectorAbstracting & Indexing Cambridge, MA April 2010.
1 2 HEP aims to understand how our Universe works: -Experimental HEP : builds the largest scientific instruments ever to reach.
Maximizing the benefit of research information in Particle Physics *** A user-driven story Anne Gentil-Beccot, CERN. EuroCris. 11 May 2010.
Citing and reading behaviours in High Energy Physics *** Learning from OA bibliometrics? Anne Gentil-Beccot, CERN. Uppsala. 17 November 2010.
SPIRES and INSPIRE Travis Brooks SLAC National Accelerator Laboratory INSPIRE Collaboration PPA Computing 1 July 2010.
Realizing the Dream of a Global Digital Library in High-Energy Physics Annette Holtkamp, Salvatore Mele, Tibor Simko, Tim Smith CERN, Geneva DML 2010 –
Information-Seeking Behavior in the High-Energy Physics Community Tamar Sadeh School of Informatics, City University, London Ex Libris HCI conference,
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS The Library behind the scene Opportunities for Scientific.
The Library behind the scene How does it work ? The Library behind the scenes 1 JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot.
P. Boyce 1 Use of Astronomy’s Info System : The Highly Productive User Peter B. Boyce Maria Mitchell Association and Past Executive Officer American Astronomical.
What have we done? What are we doing? What can we do? Travis Brooks (SLAC) Zaven Akopov (DESY)
Doing Literature Research in Psychology You already have substantial experience in searching for material on line. There are innumerable search engines.
New services and Springer Guenther Eichhorn Director, Abstracting & Indexing Cambridge, MA (April 2010)
Web of Science: An Introduction Peggy Jobe
Institutional Perspective on Credit Systems for Research Data MacKenzie Smith Research Director, MIT Libraries.
Introduction to Information Retrieval Got a question concerning literature? Ask! Marion Bierhahn (4630) Where is the library? Bldg:1d.
How the University Library can help you with your term paper Computer Science SC Hester Mountifield Science Library x 8050
Information systems for HEP: INSPIRE, arXiv and more Annette Holtkamp CERN ASP 2012 Kumasi, Ghana, Aug 3, 2012.
THOMSON SCIENTIFIC Web of Science 7.0 via the Web of Knowledge 3.0 Platform Access to the World’s Most Important Published Research.
Some facets of knowledge management in mathematics Wolfram Sperber (Zentralblatt Math) Patrick Ion (Math Reviews) Facets of Knowledge Organization A tribute.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
INSPIRE Travis Brooks (SLAC) Tibor Simko (CERN). SPIRES’ History Index to HEP literature for 35 years Via terminal login Via Via web (1st U.S. Website/1st.
European Organization for Nuclear Research Organisation Européenne pour la Recherche Nucléaire CDS Invenio CERN’s open source digital library information.
CERN – IT Department CH-1211 Genève 23 Switzerland t CERN Open Source Collaborative tools: Digital Library Software Tim Smith CERN/IT.
Processing e-literature at CERN Corrado Pettenati Mick Draper20 March 2000 Processing electronic literature: CERN case study C. Pettenati (ETT-SI) M. Draper.
E-Infrastructures for scholarly communication A first step to OA. An indispensable step for e-Science The case of High-Energy Physics Jens Vigen – Head.
European Organization for Nuclear Research Organisation Européenne pour la Recherche Nucléaire Digital Library and Conferencing update HEPiX at Cornell.
Thomson Scientific October 2006 ISI Web of Knowledge Autumn updates.
CERN - IT Department CH-1211 Genève 23 Switzerland t The CERN Document Server 12 th November 2010 Tim Smith.
Using Electronic Sources to Find Information Kay Grieves Information Services, 2002.
Tullio Basaglia, CERN GS-SI CERN Scientific Information Service The context Presentation of the Service How do they search and use information? The project.
OARE Module 5A: Scopus (Elsevier). Table of Contents About Scopus (Elsevier) Using Scopus Search Page Results/Refine Search Pages Download, PDF, Export,
07/11/2002Thomas Baron - JACoW Workshop1 CERN Library Requirements T. Baron CERN ETT-DH-CDS.
S YCAMORE S CHOLARS ISU Institutional Repository.
A Tony Thomas-inspired guide to INSPIRE The evolution from SPIRES to INSPIRE and what it means for you Tony Thomas 60th Birthday Fest Feb Heath.
CERN - IT Department CH-1211 Genève 23 Switzerland t INSPIRE A Global Digital Library for HEP 14 th February 2011 Tim Smith on behalf of.
Presented by Dr. S. C. Jindal Librarian Central Science Library University of Delhi Delhi Information Competency.
Library Science talk – Geneva/Bern 27./ Integrating information resources Annette Holtkamp CERN/DESY.
VIVO and Scholarly Repositories: Synergistic Opportunities.
RESEARCH – DOING AND ANALYSING Gavin Coney Thomson Reuters May 2009.
Welcome to de Gruyter Reference Global. De Gruyter Reference Global provides you with comprehensive access to high quality academic content Run a quick.
T. Brooks OAI6 18/6/09 Giving researchers what they want SPIRES, High-energy physics and subject repositories Travis Brooks SLAC National Accelerator Laboratory.
Open CERN The context High Energy Physics information landscape Open Access: 3 myths to be dispelled Policies Some stats Licenses What’s next:
1 The next generation HEP information system. HEP scientists love community services 2 What is the primary source of information for HEP scientists? From.
Jean-Yves Le Meur - CERN Geneva Switzerland - GL'99 Conference 1.
HEPont.rdf Kirsten Sachs AAHEP7 April 3, 2014 HEP astro nucl ins-det acc-ph cs math cond-mat data-an general
January 29, 2007Annette Holtkamp, DESY 54.5% published in 6 journals: PRD, PLB, JHEP, NPB, PRL, NIMA (rest: 143 journals) 86% published by 4 publishers:
CDS. CERN Document Server as the Library catalogue and institutional repository.
Roger Mills February don’t be evil stand on the shoulders of giants.
Deep Indexing in ProQuest Health and Medical Databases.
Information Literacy & Open Access for Physics and Astronomy Graduate Students Jackie Werner, Science Librarian Georgia State University
How the Library can support your project or dissertation
The High Energy Physics information platform: Introduction
Annette Holtkamp - AAHEP7
Tim Smith CERN Geneva, Switzerland
H.B. O'Connell HEP Info Summit DESY May 2008
Introduction to Information Retrieval
Introduction to Information Retrieval
Gwyn P. Williams and Kim Kindrew Pizza Seminar, September 18, 2013
Citation databases and social networks for researchers: measuring research impact and disseminating results - exercise Elisavet Koutzamani
DESY Documentation: Status + projects
Presentation transcript:

A Global Digital Library for High-Energy Physics Annette Holtkamp CERN-UNESCO School on Digital Libraries – Rabat, Nov 2010

2 HEP community l closely-knit community n 20-30k active researchers publishing 10k articles n large collaborations (up to 5000 members) n very international (even small author groups) n authors = readers l rapid information exchange essential n mailing of preprints since the 60’s n long OA tradition n >90% of HEP journal articles on arXiv l dominance of community based information systems n arXiv n SPIRES

Dominance of community services 3 From 2007 survey of 2,000 physicists. Gentil-Beccot et al, Information Resources in High-Energy Physics: Surveying the Present Landscape and Charting the Future Course. J.Am.Soc.Inf.Sci.60: ,2009 arXiv:

SPIRES (1974-) 4 l network of databases n HEP literature, conferences, institutions, experiments, hepnames, jobs l SLAC – DESY – Fermilab Collaboration l SPIRES-HEP n Metadata for 850k objects, ~800 new records per week n Preprints, journal articles, conference contributions, books, grey literature n since 1974, web server since 1991 n 100k searches/day l high data quality, manually curated, comprehensive coverage l high acceptance, user involvement But: l outdated technology from the 70‘s

5 run by (2007-)

6 Bibliographic Content l SPIRES content (plus part of CDS): journal articles, conference proceedings, preprints, experimental notes, theses l going beyond SPIRES: conference slides, multimedia, software, high-level research data… l going back before 1974 l more material from neighboring disciplines astrophysics, nuclear physics, mathematics… cited by core HEP articles

7 “Fulltext” repository l all freely accessible articles n esp. “endangered” material l access restricted articles n “hidden archive” n agreements with Springer and APS l historical material n scanning of old preprint series l beyond articles n slides, multimedia, software, wikis…

8 Search l Google-like freetext search l fulltext search l Complex second-order searches Example: Find the most influential HEP core papers that cite the Hitchin article „Generalized Calabi-Yau manifolds“ but don‘t cite any papers by Polchinski refersto:reportnumber:math/ collection:core cited:100->9999 NOT refersto:author:Polchinski

9 Fulltext search - snippets

10 Detailed record page l abstract l keywords l publication info l thumbnails of figures l various export formats l tabs for n references n citations n fulltext n full-sized plots with captions

11 Detailed record with plots

Plot extraction l figures extracted from LaTeX sources (arXiv) l captions searchable soon to come: l extraction from pdf l phrase from fulltext referencing a figure

13 Citation analysis l cited by l co-cited with l self-citations l citation history

Citation analysis: Example

15 Author page l affiliation history l coauthors l frequent keywords l article classification l citation summary

17 HEPNAMES

18 HEP taxonomy hierarchical structure of all important l HEP concepts (dynamical symmetry breaking) providing n synonyms (dynamically broken) n related terms (spontaneous symmetry breaking) n broader/narrower (symmetry breaking) n definitions n subject areas (high-energy physics – theory)

19 Keyword extraction arXiv: Author keywords: quantum cosmology -> quantum cosmology wheeler-dewitt equation -> tunneling probability -> tunneling positive cosmological constant -> cosmological constant Composite keywords: 10 transformation, canonical [22, 24] 9 potential, symplectic [22, 33] 3 tensor, energy-momentum [3, 3] 2 quantization, canonical [8, 24] 2 symmetry, gauge [4, 2] 2 oscillator, harmonic [2, 2] 1 dimension, 2 [0, 33] 1 fluid, pressure [22, 2] 1 operator, differential [16, 1] 1 inflation, open [4, 1] 1 field theory, scalar [0, 1] Single keywords: 19 wave function 14 tunneling 13 Wheeler-DeWitt equation 13 cosmological constant 8 zero mode 7 Robertson-Walker 7 quantum cosmology 6 variational 5 Schroedinger equation 4 boundary condition 4 Poisson bracket 4 phase space Acronyms: WDW Wheeler-DeWitt equation Core keywords: Wheeler-DeWitt equation quantum cosmology

20 Taxonomy applications l fast automatic generation of keywords n enabling e.g. prompt alerts n manually curated afterwards l automatic selection of HEP relevant articles n no longer time delay in border areas due to manual selection l improved search algorithm (planned) n A search for „SUSY“ will also find „supersymmetry“ n narrow/broaden search l user tagging (planned) n improve Inspire generated classification n improve taxonomy

21 Author identification l INSPIRE author id n compatible with other identification schemes n active participation in ORCID l author disambiguation n using e.g. lab id’s, affiliation history, coauthors and more n INSPIRE-id’s already assigned l automatic association of papers with authors n using info on affiliations, coauthors, research topics, from publishers G. Chen: 963 docs, 21 real authors, only 22 docs not assigned, 97.2% success rate n INSPIRE-id part of author lists of large collaborations

22 Future projects l innovative metrics l semantic analysis l content indexing of plots and tables l recommender systems n combining citations, keywords, fulltext, usage pattern data... l open API for 3rd party tools and searching l object aggregation (OAI-ORE) l OAIS standards for long-term document preservation