Digitometric Services for Open Archives Environments

Slides:



Advertisements
Similar presentations
A centre of expertise in digital information management The OAI Protocol for Metadata Harvesting Andy Powell UKOLN,
Advertisements

Search, access and impact: Web citation services Tim Brody Intelligence, Agents, Multimedia Group University of Southampton.
OAI Protocol for Metadata Harvesting Tim Brody Intelligence, Agents, Multimedia Group University of Southampton OpCit –
Preserv Preservation Eprint Services Simple Preservation Services – towards Proactive Support for the Institutional Repository.
28 April 2004Second Nordic Conference on Scholarly Communication 1 Citation Analysis for the Free, Online Literature Tim Brody Intelligence, Agents, Multimedia.
A busy persons introduction to OAI-PMH Christopher Gutteridge ALT, April 2003.
A brief overview of the Open Archives Initiative Steve Hitchcock Open Citation Project (OpCit) Southampton University Prepared for Z39.50/OAI/OpenURL plenary.
Preserv Preservation Eprint Services Scenario: Digital lifecycle begins with author creation and deposit of paper or data content into the institutional.
Tim Brody University of Southampton CiteBase Services 13/07/2001.
Crystal Structure EPrints: Source Through the Open Archive Initiative S.J. Coles a*, J.G. Frey a, M.B. Hursthouse a, L. Carr b & C.J. Gutteridge.
A centre of expertise in digital information management UKOLN is supported by: The Dublin Core Application Profile for Scholarly Works.
Web of Knowledge Tracking a good article Who else has used this paper?
Y.T. a brief history of the OAI 0 Kaynak: Herbert van de Sompel.
Electronic publishing: issues and future trends Anne Bell.
OAI-PMH Dawn Petherick, University Web Services Team Manager, Information Services, University of Birmingham MIDESS Dissemination.
Universidad Autónoma del Estado de México Network of Scientific Journals of Latin America, the Caribbean, Spain and Portugal Redalyc The Open Archives.
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
OAI Standards for Sheet Music Meeting March 28-29, 2002 Basic OAI Principals How They Apply to Sheet Music Presenter: Curtis Fornadley, Senior Programmer/Analyst.
Open Access Citation Index Services Tim Brody Intelligence, Agents, Multimedia Group University of Southampton.
The Open Archives Initiative Simeon Warner Cornell University, Ithaca, NY, USA CREPUQ 2002, Montréal, Canada 14:00, 24 October 2002.
NAL-Institutional Repository: A Case Study CSIR Metadata Harvester I.R.N. Goudar Head, ICAST, NAL National Symposium on Open Access and.
Serenate1 Non-standard users: The Library Raf Dekeyser K.U.Leuven.
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
4th March 2002Tim Brody 1 A joint JISC/NSF project.
IESR Interfaces: Current Services and Future Plans Ann Apps MIMAS, The University of Manchester, UK.
07/11/2002Thomas Baron - JACoW Workshop1 CERN Library Requirements T. Baron CERN ETT-DH-CDS.
OAI-PMH: Open Archives Initiative Protocol for Metadata Harvesting T.B. Rajashekar National Centre for Science Information (NCSI) Indian Institute of Science,
The ISI Web of Knowledge nce/training/wok/#tab3.
DNER Architecture Andy Powell 6 March 2001 UKOLN, University of Bath UKOLN is funded by Resource: The Council for.
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) Phil Barker, March © Heriot-Watt University. You may reproduce all or any part.
Open Archive Initiative – Protocol for metadata Harvesting (OAI-PMH) Surinder Kumar Technical Director NIC, New Delhi
Caltech CODA CODA: Collection of Digital Archives Caltech Scholarly Communication.
Slavic Digital Text Workshop 2006 The Open Archives Initiative Protocol for Metadata Harvesting: an Opportunity for Sharing Content in a Distributed Environment.
1 GRID Based Federated Digital Library K. Maly, M. Zubair, V. Chilukamarri, and P. Kothari Department of Computer Science Old Dominion University February,
L JSTOR Tools for Linguists 22nd June 2009 Michael Krot Clare Llewellyn Matt O’Donnell.
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
Search Interoperability, OAI, and Metadata Sarah Shreeves University of Illinois at Urbana-Champaign Basics and Beyond Grainger Engineering Library April.
SPASE and the VxOs Jim Thieman Todd King Aaron Roberts.
The OAI: technical overview OAI Open Meeting – Washington DC – January 23 rd 2001 Herbert Van de Sompel & Carl Lagoze Cornell University -- Computer Science.
The Open Archives Initiative Marshall Breeding Director for Innovative Technologies and Research Vanderbilt University
Open Archives Initiative Protocol for Metadata Harvesting.
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
Standards OAI-Protocol Metadata: DC - Agris - MODS Marc Goovaerts Hasselt University Library ODIN-PI TRAINING OSTENDE, May 2008.
2/22/2016J Ammerman1 Open Archives Initiative What is it? What’s it good for?
NSDL & the Open Archives Initiative A Brief Introduction to OAI Timothy W. Cole Mathematics Librarian & Professor of Library Administration.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
AMERICAN INSTITUTE OF PHYSICS URL:
Mod_oai: Metadata Harvesting for Everyone Michael L. Nelson, Herbert Van de Sompel, Xiaoming Liu, Aravind Elango
Metadata & Repositories Jackie Knowles RSP Support Officer.
Web Services Overview Thomas Hickey. 2 What are Web Services? Machine-to-machine communication Run over standard Web protocols –XML syntax, HTTP packaging.
Data Mining for Expertise: Using Scopus to Create Lists of Experts for U.S. Department of Education Discretionary Grant Programs Good afternoon, my name.
Introduction to SHERPA RoMEO and its Significance for Publishers
Open your Alfresco Data
Using JSTOR May 2016.
An Overview of Data-PASS Shared Catalog
Georges Arnaout Chaitanya Krishna
Accessing a national digital library: an architecture for the UK DNER
Jenn Riley Metadata Librarian Digital Library Program
Making the most of research outputs
OAI and Metadata Harvesting
Building on the shoulders of Giants: the Scholarly Web
Cataloging the Internet
NSDL Data Repository (NDR)
OAI 11/20/07.
Open Archive Initiative
JISC Information Environment Service Registry (IESR)
Interoperable Repository Statistics
IVOA Interoperability Meeting - Boston
Jenn Riley Metadata Librarian Digital Library Program
Presentation transcript:

Digitometric Services for Open Archives Environments Tim Brody Simon Kampa, Stevan Harnad, Les Carr, Steve Hitchcock {tdb01r,srk,harnad,lac,sh94r}@ecs.soton.ac.uk University of Southampton, Intelligence, Agents, Multimedia Group 08 December 2018 ECDL 2003, Trondheim, Norway

Open Archives Initiative The protocol is openly documented, and metadata is “exposed” to at least some peer group (note: rights management can still apply!) Archive defined as a “collection of stuff” -- not the archivist’s definition of “archive”. “Repository” used in most OAI documents. Promoting interoperability 08 December 2018 ECDL 2003, Trondheim, Norway

OAI Data Model: Resources/Items/Records All available (meta)data about the resource Item = OAI identifier item Dublin Core Metadata MARC Metadata ??? XML records record = metadata + identifier + datestamp 08 December 2018 ECDL 2003, Trondheim, Norway

Protocol Responses 08 December 2018 ECDL 2003, Trondheim, Norway

Protocol 1 2 3 HTTP URL Requests Service Provider Data Provider XML Responses Identify 1 Collection-level Description ListRecords?metadataPrefix=xyz 2 All repository xyz records 3 ListRecords?from=2003-04-02&… All repository xyz records since 2003-04-02 08 December 2018 ECDL 2003, Trondheim, Norway

Other Commands ListIdentifiers ListMetadataFormats ListSets GetRecord Return only the identifier/datestamp/set membership ListMetadataFormats Return the available data formats ListSets Return the set structure (if there is one) GetRecord Return a record given by OAI identifier 08 December 2018 ECDL 2003, Trondheim, Norway

Interest in OAI 111 registered OAI repositories Many unregistered (e.g. all GNU EPrints.org and DSpace archives) 4,500,000 public records http://arc.cs.odu.edu/ NSDL project, UK’s JISC Information Environment OLAC (language community built on OAI) 08 December 2018 ECDL 2003, Trondheim, Norway

Why OAI? Mandated Dublin Core allows the quick establishment of basic services and tools Simple and metadata-neutral protocol allows more interesting possibilities (without breaking 1.) and extensions … 08 December 2018 ECDL 2003, Trondheim, Norway

Adding Caching to OAI-PMH 08 December 2018 ECDL 2003, Trondheim, Norway

Celestial (OAI Cache) Developed to maintain a local metadata copy Avoid repeated, large harvests during development Provides an abstraction over multiple OAI versions (hence acts as a gateway to older implementations) Useful for testing OAI implementations & improving performance Using XSLT provides a Web interface to OAI Provides redundancy 08 December 2018 ECDL 2003, Trondheim, Norway

08 December 2018 ECDL 2003, Trondheim, Norway

Citebase Search – Data Model e-Services 08 December 2018 ECDL 2003, Trondheim, Norway

Content 250,000 full-text resources 6 million references 240,000 of which arXiv.org 6 million references 29 mean refs/paper (therefore failed to extract references for 18% of papers) (n.b. modal refs is 19) 1 million references linked internally to the full-text (15%) 08 December 2018 ECDL 2003, Trondheim, Norway

08 December 2018 ECDL 2003, Trondheim, Norway

Citebase Search 08 December 2018 ECDL 2003, Trondheim, Norway The abstract page shows the usual title/authors/abstract and some analysis of the current article. The graph shows over time when the paper has been cited and when it has been downloaded. 08 December 2018 ECDL 2003, Trondheim, Norway

Citebase Search: Navigation by Citation Links Article with reference list Future Reference link Following the abstract are links to related pages by citations. These links can go backwards in time using the reference list, forwards in time by what has cited me, and sideways by either related or co-citation. Related papers are papers that have a similar reference list – often where an author has used the same references more than once! Co-cited is where two papers have been cited next to each other, the same as author co-citation. However co-cited papers can only be found for articles that have been cited, hence can’t be used for new articles. Related Current Article Co-cited Past 08 December 2018 ECDL 2003, Trondheim, Norway

Citebase Search cites cites 08 December 2018 This is the reference list, as parsed from the full-text. “eprint” takes the user to the Citebase abstract page of the cited article, journal are bespoke links for the American Physical Society journals. 08 December 2018 ECDL 2003, Trondheim, Norway

Citebase Search cites cites 08 December 2018 Articles that have cited the current article, following these links will take the user towards newer papers. 08 December 2018 ECDL 2003, Trondheim, Norway

Citebase Search “Co-cited” 08 December 2018 And co-cited articles. The development version of Citebase also includes Related articles. 08 December 2018 ECDL 2003, Trondheim, Norway

Read/Cite Cycle 08 December 2018 ECDL 2003, Trondheim, Norway

Digitometric Services for OAI Tools for visualising research metadata Builds an analysis service on Citebase Knowledge mapping (co-authors, co-citation, etc.) 08 December 2018 ECDL 2003, Trondheim, Norway

Co-Citation Network 08 December 2018 ECDL 2003, Trondheim, Norway A co-citation map embedded within the Digitometric user interface. The nodes on the map represent individual publications. By hovering with the mouse pointer over a node, the user can generate details (title, author, abstract) in the information box. The arcs between the nodes represent a co-citation relationship. A cluster of related publications are evident in the centre of the map. Four distinct paths emanate out of this indicating the possibility of specialty fields arising out of the main cluster. 08 December 2018 ECDL 2003, Trondheim, Norway

Full Co-Citation Map 08 December 2018 ECDL 2003, Trondheim, Norway A full-sized co-citation map with a lower co-citation threshold resulting in more nodes being included. Several clusters (research fronts) are evident, in particular the large cluster towards the bottom right of the map. Researchers may get a better understanding of their research landscape by exploring these clusters and the relationships between them. Different colours are also used to indicate which nodes have been recently highly cited, paving the way for up-and-coming (or dying) research fronts to be identified. There are also several occurences of 5 or 6 nodes emanating sequentially out of a single node, indicating a sequence of papers being published that address a common problem or theme. 08 December 2018 ECDL 2003, Trondheim, Norway

Digitometric Services for Open Archives Environments http://www.openarchives.org/ http://opcit.eprints.org/ http://citebase.eprints.org/ http://www.eprints.org/ http://www.hyphen.info/ AKT Project (knowledge) Thank you for listening! Tim Brody 08 December 2018 ECDL 2003, Trondheim, Norway