Ed Summers edsu@loc.gov Linked Data at loc.gov show of hands: - people that work in the library - people already familiar with semweb - people wanting to learn more about semantic web Ed Summers edsu@loc.gov
Introduce myself: software developer, digital preservation Disclaimer about me not speaking for LC >32 million cataloged books 470 languages >61 million manuscripts
circa 1890 North Wing Capitol Bldg
1889 - 1897 Copyright Act of 1870
circa 1897, books acquired by Copyright
Charles Ammi Cutter Boston Athenaeum
title author subject classification
Sterling Memorial Library at Yale Subject Title Name
started in 1901 ended in 1997
[Men working at machines in Card Division Printing Office, Library of Congress, Washington, D.C.] circa 1900-1920
systems analyst and computer programmer formerly of the NSA developed MARC in the mid 1960s, initially for telling machines how to automate card printing ... in use around the world for sharing bibliographic data.
evil slide #1
ILS started in late 60s,70s Conversion of card catalogs to databases.
1990
xml rec 1998 Metadata Object Description Schema Brings us up to current day.
Show of hands how many people have seen the Linked Data Design Principles before? Walk through these design principles with real live examples from the Library of Congress.
Use URLs as names for things. timbl in his TED talk collapsed the 1st two rules into one Used to using URLs for documents.
david weinberger, eminent new yorker, and philosopher of the web
Internet as a topic.
Controlled vocabulary of topics. 342,689 concepts
http://id.loc.gov/authorities/sh92002816#concept To enable to use the data.
When someone looks up a URI, provide useful information.
evil slide #2
SKOS vocabulary taxonomies, thesauri, controlled vocabulary.
Include links to other URIs, so that they can discover other things. Since LCSH is really a target for URIs going to talk about another project. NDNP.
5 years into a 30 year project w/ NEH to work with libraries around the country to digitize historic newspaper collections NYPL a partner.
139,600 title metadata records
92,125 issues
986,497 pages
180 batches 60 TB
links to the data
Include links to other URIs, so that they can discover other things.
links to geonames, dbpedia, lingvoj so many more facts there and at geonames: maps, long/lat names
linking open data community discussion list enrichment by Chris Bizer
Standard thee tier architecture: Django rdflib Solr
django models serialize models as rdf using rdflib make available at predictable URIs Advertise the URIs your HTML
Why?
photo by Allan Engelhardt
People have probably already thought about the sorts of resources in your domain. Think about the resources in your app.
If you assign URIs to those resources. Use HTTP and the architecture of the web to deliver representations of them. Scalability and security layered in. Thought experiment: is syndication linked data. Atom instead of rdf/xml?
Suggested Reading Cool URIs for the Semantic Web. Leo Sauermann and Richard Cyganiak. Linked Data Design Issues . Tim Berners-Lee. On Linking Alternative Representations To Enable Discovery and Publishin g. T. V. Raman. How to Publish Linked Data on the Web . Chris Bizer, Richard Cyganiak and Tom Heath.