Library Science talk – Geneva/Bern 27./28.9.09 1 Integrating information resources Annette Holtkamp CERN/DESY.

Slides:



Advertisements
Similar presentations
Creating Institutional Repositories Stephen Pinfield.
Advertisements

S.J. Coles a*, J.G. Frey a, M.B. Hursthouse a, L. Carr b & C.J. Gutteridge b. a School of Chemistry, University of Southampton, UK.; b School of Electronics.
EBankII Workshop 1 Making Scientific Data Openly Available Simon Coles School of Chemistry, University of Southampton.
Digital Repositories: interoperability & common services Closing Remarks Dr Liz Lyon, UKOLN, University of Bath, UK
The Data Lifecycle and the Curation of Laboratory Experimental Data Tony Hey Corporate VP for Technical Computing Microsoft Corporation.
1 2 HEP aims to understand how our Universe works: -Experimental HEP : builds the largest scientific instruments ever to reach.
Maximizing the benefit of research information in Particle Physics *** A user-driven story Anne Gentil-Beccot, CERN. EuroCris. 11 May 2010.
Citing and reading behaviours in High Energy Physics *** Learning from OA bibliometrics? Anne Gentil-Beccot, CERN. Uppsala. 17 November 2010.
Realizing the Dream of a Global Digital Library in High-Energy Physics Annette Holtkamp, Salvatore Mele, Tibor Simko, Tim Smith CERN, Geneva DML 2010 –
Information-Seeking Behavior in the High-Energy Physics Community Tamar Sadeh School of Informatics, City University, London Ex Libris HCI conference,
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS The Library behind the scene Opportunities for Scientific.
The Library behind the scene How does it work ? The Library behind the scenes 1 JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot.
IDENTIFIERS & THE DATA CITATION INDEX DISCOVERY, ACCESS, AND CITATION OF PUBLISHED RESEARCH DATA NIGEL ROBINSON 17 OCTOBER 2013.
FAO and UNESCO-IOC/IODE Combine Efforts in their Support of Open Access Written by Marc Goovaerts, U. Hasselt, BE.
Object Re-Use and Exchange Mellon Retreat, Nassau Inn, Princeton, NJ, March Herbert Van de Sompel, Carl Lagoze The OAI Object Re-Use & Exchange.
Planning for Flexible Integration via Service-Oriented Architecture (SOA) APSR Forum – The Well-Integrated Repository Sydney, Australia February 2006 Sandy.
UKOLN is supported by: OAI-ORE a perspective on compound information objects ( Defining Image Access.
EPrints Workshop, January eBank UK: Dissemination of research data using EPrints Simon Coles, School of Chemistry, University of Southampton.
UKOLN is supported by: A non-technical introduction to: OAI-ORE ( Defining Image Access project meeting.
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
Introducing Symposia : “ The digital repository that thinks like a librarian”
Introduction to Implementing an Institutional Repository Delivered to Technical Services Staff Dr. John Archer Library University of Regina September 21,
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
© 2013 Association for Computing Machinery Honeywell Introduction to the ACM Digital Library January 16, 2013 Honeywell Introduction to the ACM Digital.
ⓒ UNIST LIBRARY UNIST Institutional Repository ⓒ UNIST LIBRARY
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
Information systems for HEP: INSPIRE, arXiv and more Annette Holtkamp CERN ASP 2012 Kumasi, Ghana, Aug 3, 2012.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
INSPIRE Travis Brooks (SLAC) Tibor Simko (CERN). SPIRES’ History Index to HEP literature for 35 years Via terminal login Via Via web (1st U.S. Website/1st.
JUMPSTART YOUR DISSERTATION TIME SAVING METHODS FOR SEARCHING AND CITING.
CERN – IT Department CH-1211 Genève 23 Switzerland t CERN Open Source Collaborative tools: Digital Library Software Tim Smith CERN/IT.
1 Guidelines For The Future Sharing Best Practice For National Bibliographies In The Digital Era Neil Wilson Information Coordinator IFLA Bibliography.
DSpace. TM 2 Agenda  Introduction to DSpace  DSpace community  Institutional Repository  Easy to add/find content in DSpace  Building Online Communities.
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
Research evaluation requirements José Manuel Barrueco Universitat de València (SPAIN) Servei de Biblioteques i Documentació May, 2011.
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
Thomson Scientific October 2006 ISI Web of Knowledge Autumn updates.
JINR DOCUMENT SERVER: Current Status and Future Plans (From Open Access Repositories to Digital Libraries and to the Knowledge Infrastructure) I.Filozova.
Scholarly communications Discussion group Linked Data Workshop May 2010.
Open access & visibility Management Digital Preservation ORA: Purposes.
Scientific Data and Electronic Publishing Renze Brandsma, Head, Digital Production Centre University of Amsterdam Maarten Hoogerwerf, Project Manager,
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
CERN - IT Department CH-1211 Genève 23 Switzerland t INSPIRE A Global Digital Library for HEP 14 th February 2011 Tim Smith on behalf of.
10/07/2008 Semantic Web Technologies & Higher Education.
Deepcarbon.net Xiaogang (Marshall) Ma, Yu Chen, Han Wang, John Erickson, Patrick West, Peter Fox Tetherless World Constellation Rensselaer Polytechnic.
VIVO and Scholarly Repositories: Synergistic Opportunities.
The Digital Library for Earth System Science: Contributing resources and collections GCCS Internship Orientation Holly Devaul 19 June 2003.
Digitization – Basics and Beyond workshop Interoperability of cultural and academic resources New services for digitized collections Muriel Foulonneau.
T. Brooks OAI6 18/6/09 Giving researchers what they want SPIRES, High-energy physics and subject repositories Travis Brooks SLAC National Accelerator Laboratory.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
A Global Digital Library for High-Energy Physics Annette Holtkamp CERN-UNESCO School on Digital Libraries – Rabat, Nov 2010.
Open CERN The context High Energy Physics information landscape Open Access: 3 myths to be dispelled Policies Some stats Licenses What’s next:
JISC/NSF PI Meeting, June Archon - A Digital Library that Federates Physics Collections with Varying Degrees of Metadata Richness Department of Computer.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
DIGITUM: Digital Deposit of the University of Murcia Antonia Angosto, Enrique Mingorance Murcia, 2012.
Institutional Repositories: the DSpace Experience Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
1 The next generation HEP information system. HEP scientists love community services 2 What is the primary source of information for HEP scientists? From.
Entering the Data Era; Digital Curation of Data-intensive Science…… and the role Publishers can play The STM view on publishing datasets Bloomsbury Conference.
CombeDay Making Data Openly Available Simon Coles.
Providing access to your data: Determining your audience Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
CitEc as a source for research assessment and evaluation José Manuel Barrueco Universitat de València (SPAIN) May, й Международной научно-практической.
An Overview of Data-PASS Shared Catalog
The High Energy Physics information platform: Introduction
Annette Holtkamp - AAHEP7
Tim Smith CERN Geneva, Switzerland
H.B. O'Connell HEP Info Summit DESY May 2008
VI-SEEM Data Repository
PREMIS Tools and Services
DESY Documentation: Status + projects
Presentation transcript:

Library Science talk – Geneva/Bern 27./ Integrating information resources Annette Holtkamp CERN/DESY

Annette Holtkamp Library Science talk – Geneva/Bern 27./ From secrecy… Hooke’s law: ceiiinossssttuv 2

Annette Holtkamp Library Science talk – Geneva/Bern 27./ …via journals… For centuries, scientific journals performed the crucial function of disseminating scientific results and providing the basis of scientific reputation Today, they provide a corset too restrictive for modern scholarly communication. What is the contribution of one of the 1000s of authors on a LHC article? 3

Annette Holtkamp Library Science talk – Geneva/Bern 27./ …to Open Science We should aim to create an open scientific culture where as much information as possible is moved out of people's heads and labs, onto the network and into tools that can help us structure and filter the information. (Michael Nielsen) 4

Annette Holtkamp Library Science talk – Geneva/Bern 27./ SPIRES HEP database l early form of Open Access in HEP by institutes mailing preprints worldwide l preprint catalog evolved into SPIRES HEP l 35 years of high-quality human-proofed metadata curated at DESY, Fermilab, SLAC l early integration of preprint and journal metadata l close collaboration with arXiv l close relationship to user community

Annette Holtkamp Library Science talk – Geneva/Bern 27./ Advantage of community services 6 What is the primary source of information for HEP scientists? From 2007 survey of 2,000 physicists. Gentil-Beccot et al, Information Resources in High-Energy Physics: Surveying the Present Landscape and Charting the Future Course. J.Am.Soc.Inf.Sci.60: ,2009 arXiv:

Annette Holtkamp Library Science talk – Geneva/Bern 27./ run by

Annette Holtkamp Library Science talk – Geneva/Bern 27./ Inspire integrated information platform tailored to the specific needs of HEP researchers l providing access to the complete HEP literature l fulltext repository l offering text- and data-mining applications l Web2.0 tools l based on an open source multimedia digital library system l freely accessible to anyone l going into production at the end of the year

Annette Holtkamp Library Science talk – Geneva/Bern 27./ Extended publication Article supplemented by additional material l Data l Multimedia l Software l … Aggregation of diverse digital objects 9

Annette Holtkamp Library Science talk – Geneva/Bern 27./

Annette Holtkamp Library Science talk – Geneva/Bern 27./ display and manipulate diagrams with Mathematica 11

Annette Holtkamp Library Science talk – Geneva/Bern 27./

Aggregation Easy access to related material/information n Conference slides n Preprint n Journal article n Supplementary material n Comments, reviews n Visualizations n Similar articles n Derivative works n … at different levels n article, author,... 13

Annette Holtkamp Library Science talk – Geneva/Bern 27./

Annette Holtkamp Library Science talk – Geneva/Bern 27./

Annette Holtkamp Library Science talk – Geneva/Bern 27./ Why publish more than articles? l Increased reproducibility and reusability l But reputation still based on journal articles l Incentive needed to publish supplementary material So make all scholarly objects l visible l independently searchable l citable l measurable 16

Annette Holtkamp Library Science talk – Geneva/Bern 27./ International Lattice Data Grid 17 l worldwide project to share lattice QCD configurations (Monte Carlo simulations) see e.g. l Semantic data access to worldwide distributed data (~100 TB) l Union of regional data grids (grid-of-grids) n Australia, France, Germany, Italy, Japan, UK, USA n founded in 2001, interoperable since Jul 07 l Metadata standards for describing configurations l Standards on binary file formats l standard interfaces

Annette Holtkamp Library Science talk – Geneva/Bern 27./

Annette Holtkamp Library Science talk – Geneva/Bern 27./

Annette Holtkamp Library Science talk – Geneva/Bern 27./

Annette Holtkamp Library Science talk – Geneva/Bern 27./ l Data publishing journal l peer reviewed l Open Access

Annette Holtkamp Library Science talk – Geneva/Bern 27./ l Independent “publication” of non-article scholarly objects l Persistent identifiers n DOI’s? l Citation standards l Metrics l Wider notion of aggregation 22

Annette Holtkamp Library Science talk – Geneva/Bern 27./ OAI-ORE Open Archives Initiative Object Reuse and Exchange (OAI- ORE) defines standards for the description and exchange of aggregations of Web resources. These aggregations, sometimes called compound digital objects, may combine distributed resources with multiple media types including text, images, data, and video. The goal of these standards is to expose the rich content in these aggregations to applications that support authoring, deposit, exchange, visualization, reuse, and preservation. 23

Annette Holtkamp Library Science talk – Geneva/Bern 27./ OAI-ORE l Resources identified by URIs l Resource map defines ingredients of an aggregation and the relations between them l Relationships expressed in semantically meaningful way as triples n Subject – predicate – object l Understandable by robots 24

Annette Holtkamp Library Science talk – Geneva/Bern 27./ OAI-ORE Resource Map 25

Annette Holtkamp Library Science talk – Geneva/Bern 27./ Disambiguation Which J. Ellis is this? l unique author identification n using e.g. lab id’s, affiliation history, research topics… l unique association of papers with authors using info on affiliations, coauthors, from publishers and the community (“claim my paper”) l compatible with other author-id schemes e.g.Thomson-Reuter’s ResearcherID

Annette Holtkamp Library Science talk – Geneva/Bern 27./ Semantic publishing Scientific article as a machine-readable knowledge base: …anything that enhances the meaning of a published journal article, facilitates its automated discovery, enables its linking to semantically related articles, provides access to data within the article in actionable form, or facilitates integration of data between papers (David Shotton) 27

Annette Holtkamp Library Science talk – Geneva/Bern 27./ Semantic publishing l Machine-understandable semantic markup l Embedded metadata l links to external resources, web-based ontologies l Actionable data, interactive figures l Data fusion (mash-ups) l Structured document summary l … 28

Annette Holtkamp Library Science talk – Geneva/Bern 27./ Example of a PLoS paper enhanced by D. Shotton et al.:

Annette Holtkamp Library Science talk – Geneva/Bern 27./ Citation in context 30

Annette Holtkamp Library Science talk – Geneva/Bern 27./ Structured Digital Abstract l Short papers on protein-protein interactions l SDA complement to the regular journal article abstract l XML-encoded summary n Names of interacting proteins, unique identifiers, links to MINT and Uniprot n Types of protein-protein interaction involved n Vocabulary from the Molecular Interaction ontology 31 Molecular INTeraction database MINT

Annette Holtkamp Library Science talk – Geneva/Bern 27./

Annette Holtkamp Library Science talk – Geneva/Bern 27./ Publishing the process Open Notebook Science is the practice of making the entire primary record of a research project publicly available online as it is recorded. This involves placing the personal, or laboratory, notebook of the researcher online along with all raw and processed data, and any associated material, as this material is generated. (Wikipedia) UsefulChem, OpenWetWare, … 33

Annette Holtkamp Library Science talk – Geneva/Bern 27./ UsefulChem Example from UsefulChem 34

Annette Holtkamp Library Science talk – Geneva/Bern 27./ Blogs 35

Annette Holtkamp Library Science talk – Geneva/Bern 27./ Structuring knowledge l Standardized metadata n descriptive n administrative n structural l Integrated landscape of metadata l Ontologies n formalized representation of the knowledge of the domain 36

Annette Holtkamp Library Science talk – Geneva/Bern 27./ Organizing knowledge on HEP (Poly)hierarchical organization (taxonomy) of all important l HEP terms (dynamical symmetry breaking) providing l synonyms (dynamically broken) l related terms (spontaneous symmetry breaking) l broader/narrower (symmetry breaking) l definitions l subject areas (high-energy physics – theory) applicable to all material

Annette Holtkamp Library Science talk – Geneva/Bern 27./ Taxonomy applications in Inspire l keywords included in metadata of all material l automatic selection of HEP relevant material n selective harvesting n no longer time delay in border areas due to manual selection l fast automatic generation of keywords n enabling e.g. timely alerts/feeds l improved search algorithm (planned) n A search for „SUSY“ will also find „supersymmetry“ n narrow/broaden search l user tagging (planned) n Combine controlled vocabulary with folksonomy

Annette Holtkamp Library Science talk – Geneva/Bern 27./ The most important resource… … is our community l New material (drop box) l Comments, reviews, ranking, blogs… l Aggregation l Corrections l Classification, subject tagging 39

Annette Holtkamp Library Science talk – Geneva/Bern 27./

Annette Holtkamp Library Science talk – Geneva/Bern 27./ l Comprehensive directory of philosophy articles n ~200k records n From journals, archives, personal pages l Community involvement n User submission n Discussion forum n Taxonomy-based categorization 41

Annette Holtkamp Library Science talk – Geneva/Bern 27./

Annette Holtkamp Library Science talk – Geneva/Bern 27./ To read | Discuss | Edit | Categorize | Remove from this list | File under.. | Export | Scholar | More..ScholarMore..

Annette Holtkamp Library Science talk – Geneva/Bern 27./ RePEc Research Papers in Economics l Public-access decentralized database with ~ items n Working papers, articles, books, software n Author pages n Institutional listings l Collaborative effort of hundreds of volunteers, no paid staff l Input from departments, institutional archives and publishers n No direct user submission l Author registration (>20.000) n Unique author id, profile with bibliographic data (~1/2 of database claimed) n Statistics on downloads and abstract views 44

Annette Holtkamp Library Science talk – Geneva/Bern 27./ RePEc ranking l Impact factor n Simple, age discount, recursive l Ranking of works n Number of citations, weighted by age, by impact factor l Ranking of authors n Number of works, weighted by number of authors or various impact factors n Citation counts, weighted by number of authors, various impact factors, h-index etc n Popularity (abstract views, downloads) n Various aggregations of criteria l Ranking of institutions l Ranking of geographic regions l … 45

Annette Holtkamp Library Science talk – Geneva/Bern 27./

Annette Holtkamp Library Science talk – Geneva/Bern 27./ Working together l community databases l libraries l IT l publishers l other information providers l neighboring communities always in close contact with our community Visit