Linked Open Data: a new resource for eResearch Dr Anne Cregan eResearch Analyst, Intersect and ANDS

Slides:



Advertisements
Similar presentations
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Advertisements

Semantic Web Thanks to folks at LAIT lab Sources include :
An Introduction to RDF(S) and a Quick Tour of OWL
CS570 Artificial Intelligence Semantic Web & Ontology 2
OCLC Research TAI CHI Webinar 5/27/2010 A Gentle Introduction to Linked Data Ralph LeVan Sr. Research Scientist OCLC Research.
By Ahmet Can Babaoğlu Abdurrahman Beşinci.  Suppose you want to buy a Star wars DVD having such properties;  wide-screen ( not full-screen )  the extra.
Linked Data for Libraries, Archives, Museums. Learning objectives Define the concept of linked data State 3 benefits of creating linked data and making.
Linked Library Data Miiya Holmes October 6-7, 2012.
1 Publishing Linked Sensor Data Semantic Sensor Networks Workshop 2010 In conjunction with the 9th International Semantic Web Conference (ISWC 2010), 7-11.
Dewey Summaries as Multilingual Linked Data Dewey Breakfast/Update ALA Annual July 11, 2009.
CSCI 572 Project Presentation Mohsen Taheriyan Semantic Search on FOAF profiles.
Actual Trends Semantic Web Lecture WS 2010/2011. What‘s next? W3C view: Look at Semantic Web activity:
LINKED DATA COMS E6125 Prof. Gail Kaiser Presented By : Mandar Mohe ( msm2181 )
The Web of Linked Data Information Universe Seongmin Lim Dept. of Industrial Engineering Seoul National University.
Behshid Behkamal Ferdowsi University of Mashhad Web Technology Lab.
Intelligent Systems Semantic Web. Aims of the session To introduce the basic concepts of semantic web ontologies.
RDF: Building Block for the Semantic Web Jim Ellenberger UCCS CS5260 Spring 2011.
Semantic Web Presented by: Edward Cheng Wayne Choi Tony Deng Peter Kuc-Pittet Anita Yong.
JOSH FLECK Semantic Web. What is Semantic Web? Movement led by W3C that promotes common formats for data on the web Describes things in a way that computer.
The Data Cube Vocabulary: Statistics in the Web of Linked Data Arofan Gregory Open Data Foundation WICS, Geneva, 5-7 May 2015.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
Semantic Web author: Michał Dettlaff. Tim Berners-Lee director of W3C created the World Wide Web in 1990 proposed the idea of Semantic Web Tim Berners-Lee.
Practical RDF Chapter 1. RDF: An Introduction
Michalis Vafopoulos NTUA, GFOSS & The transformers GREEN CITY HACKATHON.
Denotation as a Two-Step Mapping in Semantic Web Architecture David Booth, Ph.D. Cleveland Clinic (contractor) Identity Workshop, IJCAI 2009, Pasadena.
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
Logics for Data and Knowledge Representation
The Semantic Web Web Science Systems Development Spring 2015.
Shared innovation Linking Distributed Data across the Web Dr Tom Heath Researcher, Platform Division Talis Information Ltd t
Linked data the next network?. The Web of documents is for people The Web of data is for computers The Web of documents is difficult for computers to.
Semantic Web Applications GoodRelations BBC Artists BBC World Cup 2010 Website Emma Nherera.
Boris Villazón-Terrazas, Ghislain Atemezing FI, UPM, EURECOM, Introduction to Linked Data.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Information Interchange on the Semantic Web an interactive talk by Piotr Kaminski, University of Victoria
Semantic Web - an introduction By Daniel Wu (danielwujr)
Access and Query Task Force Status at F2F1 Simon Miles.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
You sexy beast. Ok, inappropriate. How about: Web of links to Web of Meaning Hello Semantic Web!
Semantic Enhancement: Key to Massive and Heterogeneous Data Pools Violeta Damjanovic, Thomas Kurz, Rupert Westenthaler, Wernher Behrendt, Andreas Gruber,
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Introduction to the Semantic Web and Linked Data
Trustworthy Semantic Webs Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #4 Vision for Semantic Web.
Dr. Lowell Vizenor Ontology and Semantic Technology Practice Lead Alion Science and Technology Semantic Technology: A Basic Introduction.
THE BIBFRAME EDITOR AND THE LC PILOT Module 3 – Unit 1 The Semantic Web and Linked Data : a Recap of the Key Concepts Library of Congress BIBFRAME Pilot.
USB for Audio There are also several USB Audio chips. You install a custom driver on the host computer, and the USB sound device appears as a Windows (or.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lotzi Bölöni.
KAnOE: Research Centre for Knowledge Analytics and Ontological Engineering Managing Semantic Data NACLIN-2014, 10 Dec 2014 Dr. Kavi Mahesh Dean of Research,
Semantic Web COMS 6135 Class Presentation Jian Pan Department of Computer Science Columbia University Web Enhanced Information Management.
CITA 330 Section 11 The Web and Its Future. Web 1.0 News, music and everything else is moved to digital Web sites become super applications Ease of.
Linked Open Data for European Earth Observation Products Carlo Matteo Scalzo CTO, Epistematica epistematica.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
Linked Data Publishing on the Semantic Web Dr Nicholas Gibbins
Shared innovation Linking Distributed Data across the Web Dr Tom Heath Researcher, Platform Division Talis Information Ltd t
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
Overview of the Semantic Web Ralph R. Swick World Wide Web Consortium (W3C) 17 October 2009.
Shared innovation Linking Distributed Data across the Web Dr Tom Heath Researcher, Platform Division Talis Information Ltd t
Linking Open Drug Data (HCLSIG LODD)
The Semantic Web By: Maulik Parikh.
Linked Data Web that can be processed by machines
Keyword Search over RDF Graphs
Building the Semantic Web
Grid Computing 7700 Fall 2005 Lecture 18: Semantic Grid
Grid Computing 7700 Fall 2005 Lecture 18: Semantic Grid
LOD reference architecture
The Linked Data Cloud Source: Chris Bizer. Linking Open Drug Data Susie Stephens, Principal Research Scientist, Eli Lilly.
Information Networks: State of the Art
Linked Data 101 Things, URIs, RDF, Triples, Turtle, Ontologies, Vocabularies and SPARQL Linked Data is our Implementation choice for FAIR.
Linked Data Ryan McAlister.
Presentation transcript:

Linked Open Data: a new resource for eResearch Dr Anne Cregan eResearch Analyst, Intersect and ANDS

What this talk will cover Open data The web of data RDF triples RDF graphs The Linked Open Data project Publishing to the web of data Consuming the web of data

Open data The philosophy and practice of making data freely available to everyone, without restrictions from copyright, patents or other mechanisms of control.

Why make data open? Public money was used to fund the work, so it should be available to the public. Facts cannot legally be copyrighted. Sponsors of research do not get full value for money unless the resulting data are made freely available In scientific research, the rate of discovery is accelerated by better access to data. Source: How to Make the Dream Come True: The Astronomers Data Manifesto (Norris, 2007)

How to make open data useful… Principles Make it easy to find Make it available to everyone Separate it from the applications that use it Interlink it with related datasets in a meaningful way Make it machine processable

The web of data The web of data = a naming model + a data model on the web It’s a web of interlinked data that machines can read (whereas the web is a web of interlinked documents for people to read) Also known as the “Semantic Web” because of its formal semantics for reasoning and its relationship to meaning

The web of data It is an initiative of the World Wide Web Consortium (W3C), and is a collaborative effort of many parties It derives from W3C director Sir Tim Berners- Lee's vision of the Web as a universal medium for data, information, and knowledge exchange. Like the web, anyone can publish to it: anyone can say anything about anything.

The web of data It is an initiative of the World Wide Web Consortium (W3C) and is a collaborative effort of many parties It derives from W3C director Sir Tim Berners- Lee's vision of the Web as a universal medium for data, information, and knowledge exchange. Like the web, anyone can publish to it: anyone can say anything about anything. However, they need to say it in RDF, not HTML.

The web of data It is an initiative of the World Wide Web Consortium (W3C) and is a collaborative effort of many parties It derives from W3C director Sir Tim Berners- Lee's vision of the Web as a universal medium for data, information, and knowledge exchange. Like the web, anyone can publish to it: anyone can say anything about anything. However, they need to say it in RDF, not HTML. And anything they want to talk about has to be a URI.

URI = Uniform Resource Identifier The naming model for the web of data A URI is a unique name that identifies a resource A resource is anything to which we can attach identity A resource can be an information object, like a document or a webpage, but it can also be a real world object, like a person. It can be anything at all. For example: A URL is a kind of URI that names the resource and also indicates a means of acting upon or obtaining it via its primary access mechanism e.g. http, ftp URL: rg/People/Berne rs-Lee/ URL: TR/rdf-concepts/

RDF = Resource Description Framework A framework for describing and linking resources on the web Allows URIs to be connected into a directed graph Based on the idea of triples Subject Predicate Object

RDF = Resource Description Framework A framework for describing and linking resources on the web Allows URIs to be connected into a directed graph Based on the idea of triples: e.g. intersect.org.au/inter sect- team/AnneCregan intersect.org.au doac:organization

RDF = Resource Description Framework intersect.org.au doac:organization ands.org.au doac:organization Putting triples together creates a graph intersect.org.au/inter sect- team/AnneCregan

RDF = Resource Description Framework intersect.org.au doac:organization ands.org.au doac:organization Putting triples together creates a graph Nodes of the graph are URIs and literals intersect.org.au/inter sect- team/AnneCregan “Anne” foaf:firstName

RDF = Resource Description Framework intersect.org.au doac:organization ands.org.au doac:organization Has a schema to describe relationships between things, called RDF Schema intersect.org.au/inter sect- team/AnneCregan “Anne” foaf:firstName

RDF = Resource Description Framework intersect.org.au doac:organization ands.org.au doac:organization Is a World Wide Web consortium (W3C) Recommendation Is part of the Semantic Web “stack” intersect.org.au/inter sect- team/AnneCregan “Anne” foaf:firstName

Semantic Web Technology Stack The Semantic Web standards build on each other URI is the naming mechanism RDF, RDF-Schema and OWL are the languages for describing resources and relationships between them SPARQL is a query language for querying RDF graphs

RDF Graphs Putting triples together creates a directed graph

RDF Graphs Putting triples together creates a directed graph

RDF Graphs Graphs can be interconnected by referring to URIs in other graphs

RDF Graphs

Linking Open Data Project Community project of the W3C Semantic Web and Outreach (SWEO) group Started in 2007 Has grown rapidly by members of the community adding open datasets Has created the largest existing RDF graph – over 18 billion triples!

Linking Open Data Project October 2007

Linking Open Data Project September 2008

Linking Open Data Project July 2009

Linking Open Data Project July 2009

Linking Open Data Project April 2010

Linking Open Data Project As at May 2009 had created a linked open data cloud of 4.7 billion RDF triples; in April 2010 Linked Open Numbers added another 14 billion triples Datasets include: – DBpedia – linked data version of wikipedia – US Census – 2000 US Census data set – Gene Ontology – annotations from Gene Ontology db – Drug bank – info about FDA approved drugs – UniProt – life sciences data set – Lots of bio/life sciences data sets - BIO2RDF cloud More info at cts/LinkingOpenData/DataSets cts/LinkingOpenData/DataSets

Publishing to the Linked Open Data Cloud – Principles 1.Use URIs to name things 2.Use HTTP URIs so you can look up those things on the web 3.When someone looks up a URI, provide useful information (“dereference-able”) 4.Include RDF statements that link to other URIs so that they can discover related things These principles are from Tim Berners-Lee‘s 2007 note:

Consuming linked open data Browsing linked data is easy You need an RDF Browser like Tabulator, Disco, Zitgist, Marbles and OpenLink Let’s go for a ride on Disco: berlin.de/rdf_browser/ Start here: berlin.de/rdf_browser/ We can travel through the linked open data cloud between URIs linked using RDF RDF Browsers include Marbles

Consuming linked open data eResearch example: Enabling drug discovery Data sets published to the data cloud: – Linked CTLinked Clinical Trials 60,000 trials in 158 countries – DrugBankFDA-approved drugs 5,000 small molecule and biotech drugs – DiseasomeDisorders and Disease genes 4,300 Disorders, disease genes and associations – DailyMedChemical structures of marketed drugs 124,000 triples and 29,600 links – SWAN Alzheimers Hypothesis Browser Knowledgebase

Consuming linked open data Using an RDF browser: See all drugs in trials for Alzheimer’s disease in Linked CT, including a Phase III trial for Varenicline Follow a link to data from DailyMed showing that Varenicline is already on the market for nicotine addition. The typical dose is 1mg twice daily and the Linked CT trial used no higher than that so no new safety issues. Link to DrugBank to find that Varenicline is an alpha-4 beta-2 neuronal nicotine acetylcholine receptor agonist. Diseasome indicates that the corresponding genes are only important in nicotine addiction, not Alzheimers. But the SWAN Knowledgebase shows there are hypotheses relating Alzheimers to nicotinic receptors through amyloid beta.

Consuming linked open data Using the linked open data cloud with an RDF browser, able to : Browse data relating to companies, clinical trials, drugs, diseases and genetic variation See when extra data is available Gain access to data without needing to map identifiers and synonyms – interlinking has already been done Gain additional insights about interesting questions to ask Jentzsch et al “Enabling Tailored Therapeutics with Linked Data” events.linkeddata.org/ldow2009/papers/ldow2009_paper9. pdf

Consuming linked open data Querying using SPARQL Queries A SPARQL endpoint enables users (human or other) to query a knowledge base via the SPARQL language. Results are typically returned in one or more machine-processable formats. Examples:

Types of Queries Selection and extraction queries retrieve parts of the data based on its content, structure, or position Reduction queries specify which part of the data not to include in the answer Restructuring queries restructure data into possible formats/serialisations Aggregation queries aggregate several data item into one new data item Combination and inference queries combine information that is not explicitly connected

Summary Open data The web of data RDF triples RDF graphs The Linked Open Data project Publishing to the web of data Consuming the web of data

Thankyou More details are at – – yProjects/LinkingOpenDatahttp://esw.w3.org/topic/SweoIG/TaskForces/Communit yProjects/LinkingOpenData – Questions and comments may be ed to