UKOLN is supported by : A centre of expertise in digital information management Paul Walk Building Metadata Aggregation.

Slides:



Advertisements
Similar presentations
Library Resources in the Networked Environment or, Its all about service(s) (and data…) Kevin Kidd Library Applications & Systems Manager Boston College.
Advertisements

Theme 3: Architecture. Q1: Who houses stuff, both records and identifiers All useful services and repositories are centralized (latency, etc.) … but centralizing.
DRIVER Building a worldwide scientific data repository infrastructure in support of scholarly communication 1 JISC/CNI Conference, Belfast, July.
A centre of expertise in digital information management UKOLN is supported by: Memory institutions and the social fabric of the Web Dr.
A centre of expertise in digital information managementwww.ukoln.ac.uk Enough Talking - Let's Use The Next Generation Technologies! Using Networked Technologies.
Linking Repositories Scoping Study Key Perspectives Ltd University of Hull SHERPA University of Southampton.
The Top 10 Reasons Why Federated Can’t Succeed And Why it Will Anyway.
A centre of expertise in digital information management UKOLN is supported by: If you don’t remember anything else, remember these… Peter.
CS 431 The Semester in Elevator Speak Carl Lagoze – Cornell University May 5, 2004.
DNER Architecture Andy Powell UKOLN, University of Bath Web of Science Enhancements Committee, Centre Point 5 March.
Introduction to the Information Environment Service Registry Amanda Hill MIMAS, The University of Manchester, UK.
OAIster != Google Kat Hagedorn University of Michigan Libraries October 26, 2007.
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
A centre of expertise in digital information management UKOLN is supported by: Signed metadata : method and application International Conference.
A centre of expertise in digital information management UKOLN is supported by: Signed metadata : method and application International Conference.
Corporation For National Research Initiatives NSF SMETE Library Building the SMETE Library: Getting Started William Y. Arms.
Metadata Standards & Applications 7. Approaches to Models of Metadata Creation, Storage, and Retrieval.
A centre of expertise in digital information managementwww.ukoln.ac.uk Web 2.0: The Potential Of RSS And Location Based Services Brian Kelly UKOLN University.
OCLC Online Computer Library Center A Global OpenURL Resolver Registry Phil Norman OCLC Dlsr4lib Workshop March 23 rd, 2006 Arlington VA.
A centre of expertise in digital information managementwww.ukoln.ac.uk Twitter: #or2012 OR 2012: Working With Text Workshop Can We Mine JISCMail Lists?
The Natural Resources Digital Library Needs, Partners, and Challenges Bonnie Avery, Janine Salwasser, & Janet Webster Oregon State University.
1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University.
A centre of expertise in digital information managementwww.ukoln.ac.uk Digital Preservation / UK Web Focus Brian Kelly UKOLN University of Bath Bath, BA2.
XML: The Strategic Opportunity Roy Tennant Challenges*  Only librarians like to search, everyone else likes to find  Our users want more information.
Metadata Standards and Applications 1. Introduction to Digital Libraries and Metadata.
7. Approaches to Models of Metadata Creation, Storage and Retrieval Metadata Standards and Applications.
The "repository ecology" approach to describing cross-search service management Phil Barker and Malcolm Moffat, ICBL, Heriot-Watt University
Microsoft Academic Search Search | Explore | Discover Alex D. Wade Director - Scholarly Communication.
Programs and research Libraries in the new network environment San Jose 16 November 2007 Lorcan Dempsey OCLC.
UKOLN is supported by: Approaches to Metadata Quality Marieke Guy QA Focus A centre of expertise in digital information management
A centre of expertise in digital information management RDN, e-Prints UK and NOF- Digitise: a (very) small sample of UK OAI activity Andy.
Library Repositories and the Documentation of Rights Leslie Johnston, University of Virginia Library NISO Workshop on Rights Expression May 19, 2005.
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
DNER Architecture Andy Powell 6 March 2001 UKOLN, University of Bath UKOLN is funded by Resource: The Council for.
Accessing a national digital library: an architecture for the UK DNER Andy Powell ELAG 2001, Prague 7 June 2001 UKOLN, University of Bath
What’s the Big Deal About R? Tom Tiedeman, OCIO July 21, 2015.
Kurt Maly Department of Computer Science Old Dominion University Norfolk, Virginia 23529, USA Digital Libraries, OAI and Free Software.
Introduction People's Network Service and issues for cultural content contributors People’s Network Service cultural content contributor meeting, 27/04/05.
Research libraries in a European e-science infrastructure Wouter Schallier Executive Director LIBER (Association of European Research Libraries)
A centre of expertise in digital information management
Innovation Forum: some conclusions Sarah Porter Head of Innovation, JISC.
JISC/NSF PI Meeting, June Archon - A Digital Library that Federates Physics Collections with Varying Degrees of Metadata Richness Department of Computer.
A centre of expertise in digital information management Shaping the e-future? Grids, Web Services and Digital Libraries Professor Tony.
Content Challenges for Open Government Dale Waldt Sr. Analyst / Consultant
Find Research Data b2find.eudat.eu B2FIND User Training How to find data objects and collections using EUDAT’s B2FIND This work is licensed.
Open Archive Forum Rachel Heery UKOLN, University of Bath UKOLN is funded by Resource: The Council for Museums, Archives.
Institutional Repositories July 2007 DIGITAL CURATION creating, managing and preserving digital objects Dr D Peters DISA Digital Innovation South.
Metadata-based Discovery: Experience in Crystallography UKOLN is supported by: Monica Duke UKOLN, University of Bath, UK A centre of.
A centre of expertise in digital information managementwww.ukoln.ac.uk Search Facilities For Web Sites A Discussion Group Session Brian Kelly UKOLN University.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
Chapter 8: Web Analytics, Web Mining, and Social Analytics
Linked Library (+AM) Data Presented LITA Next-Generation Catalog IG Corey A Harper Publish, Enrich, Relate and Un-Silo.
A centre of expertise in digital information management 10 minute practical guide to the JISC Information Environment (for publishers!)
The world’s libraries. Connected. Linked Data A View of OCLC’s Strategy Ted Fons Executive Director, Data Services,& WorldCat Quality ALA Annual Conference,
CONTENTdm A proven solution September A complete digital collection management software solution Stores, manages and provides access for all digital.
Pcdm, iiif, & interoperability esmé dplafest
OR2009 Atlanta, 18 May1 The Global Registries Initiative: Progress Report and Software Demonstration Chris Blackall, Australian National Data Service Jeremy.
Developing our Metadata: Technical Considerations & Approach Ray Plante NIST 4/14/16 NMI Registry Workshop BIPM, Paris 1 …don’t worry ;-) or How we concentrate.
Resource Discovery Landscape
2nd GEO Data Providers workshop (20-21 April 2017, Florence, Italy)
GISELA & CHAIN Workshop Digital Cultural Heritage Network
CREATIVE COMMONS FOR CULTURAL HERITAGE
knowledge organization for a food secure world
Accessing a national digital library: an architecture for the UK DNER
Bloomsbury Conference 24 June 2010
OAI and Metadata Harvesting
Web archive data and researchers’ needs: how might we meet them?
Malte Dreyer – Matthias Razum
GISELA & CHAIN Workshop Digital Cultural Heritage Network
Australian and New Zealand Metadata Working Group
Presentation transcript:

UKOLN is supported by : A centre of expertise in digital information management Paul Walk Building Metadata Aggregation Services for Resource Discovery

2 aggregating metadata

3 why aggregate metadata? to address systems/network latency - a cache supporting resource-discovery for ‘Web Scale concentration’ ‘gaming’ Google - raising ‘visibility’ of content network effects if user facing services also developed to showcase resources to create middleman business opportunities as infrastructure to support 3rd-party services as an approach to preservation

4 patterns harvest from network, aggregate and re- expose discovery.ac.uk, Europeana, RepUK collect from offline sources and make available in aggregate on the network Collections Trust (UK) harvest without re-exposing, build services on top of aggregation Google et.al. expose as a ‘data dump’, or expose through an API

5 the big question facing data providers: do you want to provide a data service, or just data?

6 current work in the UK

7 a metadata ‘ecosystem’ aggregation is a major component preparing resources for aggregation

8 support innovation develop some ‘business intelligence’ develop infrastructure component for services

9 issues with aggregation

10 distribution state management is a challenge! (deletions, changes) aggregation of aggregations is consequently non-trivial e.g. federated models linking? should records in an aggregation ever be the target of a link? Or, should such links point to the source? can/should we make aggregations into Google-friendly targets? if we succeed with SEO, are we undermining source repositories? ‘attribution stacking’ ( access-data-protocol/)

11 openness and usability ‘open’ in danger of becoming synonymous with ‘permissively licensed’ can be both ‘open’ but very difficult to use needs periodic review - right now SPARQL is barrier to wide adoption remember all those SOAP interfaces.... a well supported API might be more open than a completely freely available dump of gigabytes (or more) of data in the sense that it might allow open engagement from more people we need a richer understanding of openness

12 be open, usefully in other words…

13 character encodings.... huge number of XML records from UK IRs are invalid due to character encoding issues.... there is a special place in hell for developers who ignore character encodings...

14 a distributed system is one in which the failure of a computer you didn't even know existed can render your own computer unusable Leslie Lamport are we creating a new version of this with data....?

15 shifting landscape Google was previously seen as in opposition to a rich metadata approach... recall versus precision Google’s abandonment of OAI-PMH but now... Google, Microsoft & Yahoo committed to improving precision through harvesting of Microdata schema.org and others bridging this divide so, is there still a need for other ‘concentrations’ or can we rely on the global search engines?

16 good practice

17 licensing! use explicit licenses this means requiring explicit licenses from sources if at all possible work with extremely open licenses such as CC0 in data aggregation, especially when using a Linked Data approach, ‘share alike’ might be easier than ‘attribution’

18 “build for normal users, developers and machines” Tom Coates

19 developer-friendly formats XML has a lot going for it: very well supported with tools, libraries etc. well understood & often fits the info models we’re used to but it has some issues: validation is a pain and is very often ignored it’s verbose - it takes up a lot of bandwidth JSON has gained rapid adoption less verbose - good for simple client-side manipulation curl -D - -L -H "Accept: application/rdf+xml" " curl -D - -L -H "Accept: application/json" "

20 service (anti)patterns design your API to be developer-friendly be aware of what works, and of what appears to work but actually might not... share this understanding Paul Walk, An infrastructure service anti-pattern pattern/

21 expect & enable users to filter - give them feeds (RSS/Atom) (CC BY-NC-ND 2.0)

22 workshop tomorrow!

23 tomorrow at 16:15 short presentations from UKOLN on LOCAH and RepUK, and from Edina on aggregating services open discussion on the way forward for metadata aggregation, addressing questions such as: is Linked Data the future for metadata aggregation services? do initiatives like Microdata & schema.org reduce the need for our investment in metadata aggregation services? does usability matter as much as ‘openness’? please join us! …and feel free to bring your own questions & issues to discuss

24 summing up in a sentence....

25 we should use aggregation [applying a tool] with the solving of problems [developing & providing services] to balance the creation of opportunity [building infrastructure]

26 thank you! these slides available here: aggregation-services or