OCLC Online Computer Library Center Erpanet Symposium on Persistent Identifiers PURLs Stuart Weibel Senior Research Scientist June 17, 2004.

Slides:



Advertisements
Similar presentations
3.02H Publishing a Website 3.02 Develop webpages..
Advertisements

Basic Internet Terms Digital Design. Arpanet The first Internet prototype created in 1965 by the Department of Defense.
Reinventing using REST. Anything addressable by a URI is called a resource GET, PUT, POST, DELETE WebDAV (MOVE, LOCK)
DDI3 Uniform Resource Names: Locating and Providing the Related DDI3 Objects Part of Session: DDI 3 Tools: Possibilities for Implementers IASSIST Conference,
1 CS 502: Computing Methods for Digital Libraries Lecture 22 Web browsers.
Web development  World Wide Web (web) is the Internet system for hypertext linking.  A hypertext document (web page) is an online document. It contains.
Jacob Boston Josh Pfeifer. Definition of HyperText Transfer Protocol How HTTP works How Websites work GoDaddy.com OSI Model Networking.
Web Servers How do our requests for resources on the Internet get handled? Can they be located anywhere? Global?
How Clients and Servers Work Together. Objectives Learn about the interaction of clients and servers Explore the features and functions of Web servers.
CORDRA Philip V.W. Dodds March The “Problem Space” The SCORM framework specifies how to develop and deploy content objects that can be shared and.
Understanding Active Directory
OPEN-SOURCE SOFTWARE BY: SAMANTHA HERALD  Otherwise known as OSS, is computer software that is available with source code: normally reserved for copyright.
LC and the W3C: History b Attended two W3C Workshops Indexing/Distributed Search Indexing/Distributed Search Query Language Query Language.
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Network of Expertise in Digital Preservation 1 Persistent Identifiers Reinhard.
1 APARSEN - WP2200 Identifiers and Citability Interoperability Framework for PI systems Webinar on PI - 15 February 2013 Maurizio Lunghi.
Copyright © 2004, Epok, Inc. Extensible Resource Identifiers (XRIs) XDI Face to Face 28 April 2004.
Locating objects identified by DDI3 Uniform Resource Names Part of Session: Concurrent B2: Reports and Updates on DDI activities 2nd Annual European DDI.
“ Does Cloud Computing Offer a Viable Option for the Control of Statistical Data: How Safe Are Clouds” Federal Committee for Statistical Methodology (FCSM)
 Internet vs WWW  Pages vs Sites  How the Internet Works  Getting a Web Presence.
Cardea Requirements, Authorization Model, Standards and Approach Globus World Security Workshop January 23, 2004 Rebekah Lepro Metz
Copyright © cs-tutorial.com. Introduction to Web Development In 1990 and 1991,Tim Berners-Lee created the World Wide Web at the European Laboratory for.
The Internet COM 366 Web Design & Production. Brief history Internet began as nationwide network for Department of Defense in 1960s –Expanded to universities.
Chapter 4 Networking and the Internet Introduction to CS 1 st Semester, 2015 Sanghyun Park.
Web Mastering Module Internet Fundamentals. What is the Internet? –Global network of networks –Communicating using same set of rules (protocols/languages)
Chapter 6 The World Wide Web. Web Pages Each page is an interactive multimedia publication It can include: text, graphics, music and videos Pages are.
XHTML Introductory1 Linking and Publishing Basic Web Pages Chapter 3.
Chapter 8 The Internet: A Resource for All of Us.
1 Multi Cloud Navid Pustchi April 25, 2014 World-Leading Research with Real-World Impact!
NBN:URN Generator and Resolver ERPANET Workshop on Persistent Identifiers Cork, June, Ádám Horváth National Széchényi Library Hungary.
Information Trends in Libraries Get More Value from Data Give More Value to Users Get Users involved July 9, 2007 Stuart Weibel Senior Research Scientist.
11 MANAGING AND DISTRIBUTING SOFTWARE BY USING GROUP POLICY Chapter 5.
OCLC Online Computer Library Center Erpanet Symposium on Persistent Identifiers A framework for understanding Identifiers and “info” URIs Stuart Weibel.
DOI Workshop, Luxembourg - 20 May Identifiers in Context Andy Powell UKOLN University of Bath UKOLN.
Lecture 4 Internet Basics, part II Instructor: Jie Yang Department of Computer Science University of Massachusetts Lowell Exploring the Internet,
OPEN-SOURCE SOFTWARE BY: SAMANTHA HERALD  Otherwise known as OSS, is computer software that is available with source code: normally reserved for copyright.
Copyright © cs-tutorial.com. Overview Introduction Architecture Implementation Evaluation.
LSIDs in a Nutshell Jun Zhao University of Manchester 1 st December, 2005.
Lesson 2: The World Wide Web Objectives After completing this lesson, you will be able to:  Define WWW and its relation to the Internet.  Explain how.
Globally Unique Identifiers in Biodiversity Informatics Kevin Richards Landcare Research NZ TDWG 2008.
Module - Identifiers The DSpace Course. Module Overview  By the end of this module you will:  Understand what persistent identifiers are, how they work.
Data Communications and Networks Chapter 9 – Distributed Systems ICT-BVF8.1- Data Communications and Network Trainer: Dr. Abbes Sebihi.
Introduction to Active Directory
Family Connection Collaborative Webs A Tool for Creating and Managing Web sites.
EGI-Engage Data Services and Solutions Part 1: Data in the Grid Vincenzo Spinoso EGI.eu/INFN Data Services.
The Internet. Important Terms Network Network Internet Internet WWW (World Wide Web) WWW (World Wide Web) Web page Web page Web site Web site Browser.
Low-Risk Persistent Identification: the “Entity” (N2T) Resolver 10 October 2006 John Kunze, California Digital Library, University of California.
1 CS 502: Computing Methods for Digital Libraries Guest Lecture William Y. Arms Identifiers: URNs, Handles, PURLs, DOIs and more.
The Internet What is the Internet? The Internet is a lot of computers over the whole world connected together so that they can share information. It.
Uniform Resource Locator URL protocol URL host Path to file Every single website on the Internet has its own unique.
Active Directory Domain Services (AD DS). Identity and Access (IDA) – An IDA infrastructure should: Store information about users, groups, computers and.
Week-6 (Lecture-1) Publishing and Browsing the Web: Publishing: 1. upload the following items on the web Google documents Spreadsheets Presentations drawings.
Discovering Computers Fundamentals, 2011 Edition Living in a Digital World.
Web Page Design The Basics. The Web Page A document (file) created using the HTML scripting language. A document (file) created using the HTML scripting.
Ontologies and Linked Data (Introductory Lecture) Piotr Lapo, General Library Expert Nazarbayev University Library
Vincenzo Spinoso EGI.eu/INFN
Introduction to Persistent Identifiers
Some Common Terms The Internet is a network of computers spanning the globe. It is also called the World Wide Web. World Wide Web It is a collection of.
Who is Tim Berners-Lee? Born 8 June 1955 in London (age 59)
Web Site Development.
Component-Based Software Engineering: Technologies, Development Frameworks, and Quality Assurance Schemes X. Cai, M. R. Lyu, K.F. Wong, R. Ko.
Evaluating Internet Resources
Introduction to the Internet and Web
Bina Ramamurthy Chapter 9
Web Server Technology Unit 10 Website Design and Development.
Bina Ramamurthy Chapter 9
Bina Ramamurthy Chapter 9
Unit-3.
Computer Networks Protocols
INTRODUCTION TO THE INTERNET AND WEB
Presentation transcript:

OCLC Online Computer Library Center Erpanet Symposium on Persistent Identifiers PURLs Stuart Weibel Senior Research Scientist June 17, 2004

What do we want from Identifiers Authority Reliability Appropriate Functionality (resolution and other services) Persistence – throughout the life cycle of the information object What are the business models to support identifiers? Not just a matter of money, but costs are part of the equation

PURL: Persistent Uniform Resource Locators PURLs look like URLs… they ARE URLs PURLs emerged from OCLC’s participation in the IETF URN activity A tool for managing names and namespaces since 1996

The 404 Problem Resources disappear… Some are actually gone Disk reorganizations take place Changes in responsibility for resources occur… bought, sold, abandoned, removed URLs serve double duty as names and locators Making URLs symbolic names will improve their usefulness

What is a PURL? PURL: Persistent Uniform Resource Locators They look like URLs… they ARE URLs No new technology, no new protocols A toolset for managing names and namespaces

PURLs take advantage of inherent redirection facility in the HTTP protocol PURLs provide an additional level of indirection that maps a symbolic identifier to a network location PURLs work without plug-ins or other special code in browsers… they are ‘just’ URLs No New Technology added… a feature, not a bug How PURLs Work

Not a guarantee of perpetual access Not a magic solution to the 404 problem Not persistence of resources, but rather of the names PURLs are a toolset that can be used to manage resource names and locations with greater reliability What does Persistent mean?

Persistence derives from… The social or contractual commitments of organizations responsible for managing information resources. Technology can help, but the problem is, at its heart, a social one.

Logical Components of a PURL protocol resolver asset name address

PURL Server as a Redirection Server Client Resource Server resource PURL Server http GET http redirect http GET

PURL Server as a Resource Server Client resource PURL Server http GET

Do I have to run my own PURL Server? OCLC’s PURL Server is open to all, including the ability to request domains As of Monday, May 24, 2004 : PURLs Created = PURLs Resolved = Unique Client Systems = The PURL server software is available at the purl.org site for anyone to download and use without cost or restriction.

PURLs and The Identifier Layer Cake The Web: http…TCP/IP… Functionality Technology Policy Social Business

Functional Layer: Operational characteristics of Identifiers Is it globally unique? No problem – it’s a URL Matching persistence with the need? Organizational commitment Can a given identifier be reassigned? No Is it resolvable? Yes: To that which is assigned by the registrant How does it ‘behave’? Exactly like a URL, but managed Is the ‘name’ portion of the identifier opaque, or can it carry ‘semantics’? Determined by the registrant Do humans need to read and transcribe them? Probably Do identifiers need to be matched to the characteristics of the assets they identify? Determined by the registrant

PURL Technical Layer What dependencies are assumed? http What is the nature of the system Open Source, public domain Are servers centralized? federated? peer to peer? Distributed and stand-alone, but could be federated (see POIs, as an example) How is uniqueness assured? Inherent in the character of URLs

PURL Policy Layer Who has the ‘right’ to assign or distribute Identifiers? Anyone can register without cost Who has the ‘right’ to resolve them or offer services? Unspecified What are appropriate assets? Determined by the registrant Can identifiers be recycled? No Can ID-Asset bindings be changed? Yes, at the discretion of the registrant Is there supporting metadata? No intrinsic PURL metadata Is there a governance model? What you do in the privacy of your own PURL server is your own business

PURL Business model layer Who pays the cost? PURL.ORG is maintained by OCLC as a free service Anyone can run their own PURL server (and pay for it) How, and how much? Negligible costs Who decides? The server host The problem with identifier business models… Those who accrue the value are often not the same as those who bear the costs. Libraries are in the business, however, of aggregating costs and making them look free. You can’t collect revenue on resolution

PURL Social Layer: Who do you trust? Who do you trust? Governments? Cultural heritage institutions? Commercial entities? Non-profit consortia? It depends on the context, the service, and the motivations for the service.

In Summary PURLs offer a methodology and tool set for managing resource names and namespaces Neither PURLs nor any other technology are a replacement for policies or commitments to manage resource names PURLs represent a community-based solution founded in freely available, widely deployed technology.