Linking Open Drug Data (HCLSIG LODD) Christian Bizer Freie Universität Berlin
Overview Linked Data Principles Linked Data Deployment on the Web What is Linked Data? Linked Data Deployment on the Web What data is out there? Linking Open Drug Data Status and plans of the HCLSIG LODD task
The Classic Web Single information space, build on URIs Hyperlinks globally unique IDs retrieval mechanism Hyperlinks are the glue that holds everything together Web Browsers Search Engines HTML HTML HTML hyper- links hyper- links Which WWW2008 papers have been written by people from companies of less than 100 people? How to overcome structure problem A B C
Use Semantic Web technologies to publish structured data on the Web, Linked Data Use Semantic Web technologies to publish structured data on the Web, set links between data from one data source to data within other data sources. Thing Thing Thing Thing Thing Thing Thing Thing Thing Thing Overcome seperation typed links typed links typed links typed links A B C D E
Data objects are identified with HTTP URIs rdf:type pd:cygri foaf:Person foaf:name Richard Cyganiak foaf:based_near dbpedia:Berlin Todo: Animation einfügen Namespaces erklären Retrieval erklären RDF graph erklären pd:cygri = http://richard.cyganiak.de/foaf.rdf#cygri dbpedia:Berlin = http://dbpedia.org/resource/Berlin Forms an RDF link between two data sources.
Dereferencing URIs over the Web rdf:type pd:cygri foaf:Person foaf:name dp:Cities_in_Germany 3.405.259 dp:population skos:subject Richard Cyganiak foaf:based_near dbpedia:Berlin Todo: Animation einfügen Namespaces erklären Retrieval erklären RDF graph erklären
Dereferencing URIs over the Web rdf:type pd:cygri foaf:Person foaf:name dp:Cities_in_Germany 3.405.259 dp:population skos:subject Richard Cyganiak foaf:based_near dbpedia:Berlin skos:subject dbpedia:Hamburg Todo: Animation einfügen Namespaces erklären Retrieval erklären RDF graph erklären dbpedia:Muenchen skos:subject
Applications What can I do with this? Linked Data Linked Data Mashups Browsers Linked Data Mashups Search Engines Thing Thing Thing Thing Thing Thing Thing Thing Thing Thing typed links typed links typed links typed links A B C D E
Falcons
DBpedia Mobile Geospatial entry point into the Web of Data Starts with DBpedia, Revyu and Flickr data
DERI Semantic Web Pipes
2. Linked Data Deployment on the Web W3C Linking Open Data Community Effort Bio2RDF Project
W3C Linking Open Data Project Community effort to publish existing open license datasets as Linked Data on the Web interlink things between different data sources Born out of the DBpedia project
The LOD Cloud More than 2 billion RDF triples More than 3 million links between datasets.
Organizations publishing Linked Data Universities and Research Institutes Massachusetts Institute of Technology (USA) University of Southampton (UK) Freie Universität Berlin (DE) DERI (IRE) KMi, Open University (UK) University of London (UK) Universität Hannover (DE) University of Pennsylvania (USA) Universität Leipzig (DE) Universität Karlsruhe (DE) Joanneum (AT) University of Toronto (CA) Companies BBC (UK) OpenLink (UK) Zitgist (USA) Talis (UK) Garlik (UK) Mondeca (FR) Cyc Foundation (USA) Works without funding
The Bio2RDF Project Goals Participants Make bioinformatics data available in RDF format on the Web. Promote the linked data vision within the bioinformatics community. Answer questions which were not possible or practical to ask before. Participants Université Laval, Canada Queensland University of Technology, Australia
The Bio2RDF Cloud 27 data sources 260 million records 2,7 billion RDF triples
3. Linking Open Drug Data HCLSIG task started October 1st, 2008 Primary Objectives Survey publicly available data sets about drugs Publish and interlink these data sets on the Web Explore interesting questions that could be answered if the data sets are linked.
Questions that LODD might help to answer Physicians and Pharmacists What are alternative drugs for a given indication (disease)? What are equivalent drugs (generic version of a brand name, or the chemical name of a active ingredient)? Are there ongoing clinical trials for a drug? Consumers What background information is available about a drug? Which alternative drugs are available? What are the contraindications of a drug? What are the results of clinical trials for a drug? Pharmaceutical Companies What are other companies with drugs in similar areas? Which companies have a similar therapeutic focus?
Public Drug Data Sources Source: Mark Sharp, et al: A Framework for Characterizing Drug Information Sources, 2008
esw.w3.org/topic/HCLSIG/LODD/Data/DataSetEvaluation
Potential Links between LODD Data Sets
LODD Participants Kristin Tolle (Microsoft) Eric Prud'hommeaux (W3C) Don Doherty (Brainstage) Susie Stephens (Lilly) Bosse Anderssen (AZ) Scott Marshall (University of Amsterdam) Chris Bizer (Freie Universitat Berlin) Glen Newton (National Research Council Canada) Michel Dumontier (Carleton University) TN Bhat (NIST) Oktie Hassanzadeh (University of Toronto) You?
Thanks! References Linking Open Drug Data HCLSIG Task http://esw.w3.org/topic/HCLSIG/LODD/ Linking Open Data Community Effort http://esw.w3.org/topic/SweoIG/TaskForces/CommunityProjects/ LinkingOpenData Bio2RDF Project http://bio2rdf.wiki.sourceforge.net/ Tutorial: How to Publish Linked Data on the Web http://www4.wiwiss.fu-berlin.de/bizer/pub/LinkedDataTutorial/