Linking Open Drug Data Susie Stephens, Principal Research Scientist, Eli Lilly
Overview Linked Data Principals Linking Open Drug Data
The Classic Web Single information space Built on URIs Web Browsers Search Engines Single information space Built on URIs globally unique IDs retrieval mechanism Built on Hyperlinks are the glue that holds everything together HTML HTML HTML hyper- links hyper- links A B C Source: Chris Bizer
Linked Data Use Semantic Web technologies to publish structured data on the Web and set links between data from one data source and data from another data sources B C Thing typed links A D E Search Engines Linked Data Mashups Linked Data Browsers Source: Chris Bizer
Data Objects Identified with HTTP URIs rdf:type pd:cygri foaf:Person foaf:name Richard Cyganiak foaf:based_near dbpedia:Berlin pd:cygri = http://richard.cyganiak.de/foaf.rdf#cygri dbpedia:Berlin = http://dbpedia.org/resource/Berlin Forms an RDF link between two data sources Source: Chris Bizer
Dereferencing URIs over the Web rdf:type pd:cygri foaf:Person foaf:name dp:Cities_in_Germany 3.405.259 dp:population skos:subject Richard Cyganiak foaf:based_near dbpedia:Berlin Source: Chris Bizer
Dereferencing URIs over the Web rdf:type pd:cygri foaf:Person foaf:name dp:Cities_in_Germany 3.405.259 dp:population skos:subject Richard Cyganiak foaf:based_near dbpedia:Berlin skos:subject dbpedia:Hamburg skos:subject dbpedia:Meunchen Source: Chris Bizer
The Linked Data Cloud > 2 billion RDF triples > 3 million links
Linking Open Drug Data HCLSIG task started October 1, 2008 Primary Objectives Survey publicly available data sets about drugs Publish and interlink these data sets on the Web Explore interesting questions that could be answered if the data sets are linked
LODD Participants Bosse Andersson Chris Bizer Kei Cheung Don Doherty Oktie Hassanzadeh Anja Jentzsch Scott Marshall Eric Prud’hommeaux Matthias Samwald Susie Stephens Jun Zhao
LODD: Data Set Evaluation Source: http://esw.w3.org/topic/HCLSIG/LODD/Data/DataSetEvaluation
Characterizing Drug Data Sources Source: Mark Sharp, et al: A Framework for Characterizing Drug Information Sources, 2008
LODD Data Sets
LODD in Marbles
Conclusions Rapidly growing cloud of Linked Data Many data sets related to life sciences W3C’s HCLSIG has published 4 drug related data sets Improvements needed to linking algorithms and data browsers