1 Bluffers Guide to The Semantic Web Frank van Harmelen CS Department Vrije Universiteit Amsterdam Data wants to be free.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

1 ICS-FORTH EU-NSF Semantic Web Workshop 3-5 Oct Christophides Vassilis Database Technology for the Semantic Web Vassilis Christophides Dimitris Plexousakis.
Frank van Harmelen Vrije Universiteit Amsterdam The Information Universe of the (Near) Futur e Creative Commons License: allowed to share & remix, but.
Introduction to Semantic Web What? Why? How? So far? Next? Frank van Harmelen AI Department Vrije Universiteit Amsterdam Creative Commons License: allowed.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Frank van Harmelen Vrije Universiteit Amsterdam The Web of data and LarKC’s role in it Creative Commons License: allowed to share & remix, but must attribute.
10-Sep-02 Page 1 Gadjah Mada University - Yogyakarta - Indonesia Gadjah Mada University10-Sep-02 Page 1 Gadjah Mada University - Yogyakarta - Indonesia.
Ontologic View of Earth Sciences Why ontologies
By Ahmet Can Babaoğlu Abdurrahman Beşinci.  Suppose you want to buy a Star wars DVD having such properties;  wide-screen ( not full-screen )  the extra.
IPY and Semantics Siri Jodha S. Khalsa Paul Cooper Peter Pulsifer Paul Overduin Eugeny Vyazilov Heather lane.
Semantic Web Agents: Hope or Hype Nicholas Gibbins School of Electronics and Computer Science University of Southampton.
So What Does it All Mean? Geospatial Semantics and Ontologies Dr Kristin Stock.
RDF Briefing Frank van Harmelen Vrije Universiteit Amsterdam.
Frank van Harmelen Semantics: where are we now, where should we go? Creative Commons CC BY 3.0: allowed to share & remix (also commercial) but must attribute.
The Semantic Web: New-style data-integration (and how it works for life-scientists too!) Frank van Harmelen AI Department Vrije Universiteit Amsterdam.
Semantic Web research anno 2006: main streams, popular falacies, current status, future challenges Frank van Harmelen Vrije Universiteit Amsterdam.
CSCI 572 Project Presentation Mohsen Taheriyan Semantic Search on FOAF profiles.
Ontologies and the Semantic Web by Ian Horrocks presented by Thomas Packer 1.
The Semantic Web – WEEK 5: RDF Schema + Ontologies The “Layer Cake” Model – [From Rector & Horrocks Semantic Web cuurse]
1 CIS607, Fall 2006 Semantic Information Integration Instructor: Dejing Dou Week 10 (Nov. 29)
The Semantic Web: New-style data-integration (and how it works for life-scientists too!) Frank van Harmelen AI Department Vrije Universiteit Amsterdam.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
From SHIQ and RDF to OWL: The Making of a Web Ontology Language
Department of Computer Science, University of Maryland, College Park 1 Sharath Srinivas - CMSC 818Z, Spring 2007 Semantic Web and Knowledge Representation.
Module 2b: Modeling Information Objects and Relationships IMT530: Organization of Information Resources Winter, 2007 Michael Crandall.
Why, in the future, all sciences will be computer sciences Barry Smith.
Cloud based linked data platform for Structural Engineering Experiment Xiaohui Zhang
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Ontology Development Kenneth Baclawski Northeastern University Harvard Medical School.
Provenance Metadata for Shared Product Model Databases Etiel Petrinja, Vlado Stankovski & Žiga Turk University of Ljubljana Faculty of Civil and Geodetic.
Logics for Data and Knowledge Representation
The MMI Tools Carlos Rueda Monterey Bay Aquarium Research Institute OOS Semantic Interoperability Workshop Marine Metadata Interoperability Project Boulder,
Applying the Semantic Web at UCHSC - Center for Computational Pharmacology Ian Wilson.
Building an Ontology of Semantic Web Techniques Utilizing RDF Schema and OWL 2.0 in Protégé 4.0 Presented by: Naveed Javed Nimat Umar Syed.
Semantic Web Applications GoodRelations BBC Artists BBC World Cup 2010 Website Emma Nherera.
EU Project proposal. Andrei S. Lopatenko 1 EU Project Proposal CERIF-SW Andrei S. Lopatenko Vienna University of Technology
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
The Semantic Web from ft Frank van Harmelen Creative Commons License: allowed to share & remix, but must attribute & non-commercial.
Towards a semantic web Philip Hider. This talk  The Semantic Web vision  Scenarios  Standards  Semantic Web & RDA.
Coastal Atlas Interoperability - Ontologies (Advanced topics that we did not get to in detail) Luis Bermudez Stephanie Watson Marine Metadata Interoperability.
Semantic Web - an introduction By Daniel Wu (danielwujr)
Copyright OpenHelix. No use or reproduction without express written consent1.
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
E-Heritage and the VU Semantic Web group Guus Schreiber Computer Science VU University Amsterdam.
Metadata Schema for CERIF Andrei Lopatenko Vienna University of Technology
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Mining the Biomedical Research Literature Ken Baclawski.
Japan Consortium for Glycobiology and Glycotechnology DataBase 日本糖鎖科学統合データベース GDGDB - Glyco-Disease Genes Database The complexity of glycan metabolic pathways.
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
Find Research Data b2find.eudat.eu B2FIND User Training How to find data objects and collections using EUDAT’s B2FIND This work is licensed.
Working with XML. Markup Languages Text-based languages based on SGML Text-based languages based on SGML SGML = Standard Generalized Markup Language SGML.
Semantic Web COMS 6135 Class Presentation Jian Pan Department of Computer Science Columbia University Web Enhanced Information Management.
WonderWeb. Ontology Infrastructure for the Semantic Web. IST WP4: Ontology Engineering Heiner Stuckenschmidt, Michel Klein Vrije Universiteit.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
Chapter 8A Semantic Web Primer 1 Chapter 8 Conclusion and Outlook Grigoris Antoniou Frank van Harmelen.
Extended Metadata Registries and Semantics (Part 2: Implementation) Karlo Berket Ecoterm IV Environmental Terminology Workshop April 18, 2007 Diplomatic.
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
OWL (Ontology Web Language and Applications) Maw-Sheng Horng Department of Mathematics and Information Education National Taipei University of Education.
Components.
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
Cloud based linked data platform for Structural Engineering Experiment
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Traditional Linked DATA or Connect Your Data Your Way
RDF For Semantic Web Dhaval Patel 2nd Year Student School of IT
Ontology.
LOD reference architecture
Linked Open Data in 10 Minutes Sandro Hawke, W3C
Presentation transcript:

1 Bluffers Guide to The Semantic Web Frank van Harmelen CS Department Vrije Universiteit Amsterdam Data wants to be free

2 Semantics as your saviour?

3

4 Outline The general idea: a Web of Data What must be done to realise this How far away is this Nex steps, do’s, don’ts

5 The Scientist’s Problem Too much unintegrated data: l from a variety of incompatible sources l no standard naming convention l each with a custom browsing and querying mechanism (no common interface) l and poor interaction with other data sources Everybody’s

6 What are the Data Sources? Flat Files URLs Proprietary Databases Public Databases Spreadsheets s … Data wants to be free Maps

7 In which disciplines? Archeology Chemistry Genomics, proteomics,... (bio/life-sciences) Communication science Social history Linguistics Bio-diversity Environmental sciences (climate studies).... libraries (KB), archives (sound&vision) One dataset per sitea new database each month historical datalaymen data international data (for their first time) Geo?

8 Outline The general idea: a Web of Data What must be done to realise this How far away is this Nex steps, do’s, don’ts

The Current Web of text and pictures                     linked web-pages, written by people, written for people, used only by people... Many of these pages already come from data, that is usable by computers! But we can’t link the data.... ? ? ? ? The Future Web of Data ? linked data, usable by computers! useful for people! Data wants to be free

10 Which Semantic Web? Version 1: “Enrichment of the current Web” recipe: Annotate and classify web-content enable better search & browse,..

11 Which Semantic Web? Version 2: "Semantic Web as Web of Data" (TBL) recipe: expose databases on the web, use RDF, integrate meta-data from: l expressing DB schema semantics in machine interpretable ways enable integration and unexpected re-use

12 Outline The general idea: a Web of Data What must be done to realise this How far away is this Nex steps, do’s, don’ts

13 machine accessible meaning (What it’s like to be a machine) symptoms drug administration disease IS-A alleviates META-DATA

14 What is meta-data? it's just data it's data describing other data its' meant for machine consumption disease name symptoms drug administration

15 Required are: 1. a standard syntax l so meta-data can be recognised as such 2.one or more shared vocabularies l so data producers and data consumers all speak the same language 3. lots of resources with meta-data attached mechanisms for attribution and trust

1. A standard syntax things & relations between things Semantic Web data model: RDF

17 RDF Triples in Life Sciences

18 RDF Triples in Geo geo:point:_ geo:lat geo:long Remember: RDF = simple model for data Remember: RDF = simple model for data

19 RDF Schema: vocabulary for data types Classes + subclass hierarchy rivers are waterways Properties + subproperty hierarchy father-of implies parent-of Domain of properties X capital-of Y  X has-type city Range of properties X capital-of Y  Y has-type country Simple standardised inferences

20 OWL: richer vocabulary for data types Things RDF Schema cannot express: Description Logic SHOIN(D) l equality, disjunction, negation, l min/max number restrictions l inverse, symmetric, transitive properties l and much more… Example: Every country has precisely one capital: Inference TheHague ≠ A’dam & A’dam = capital  TheHague ≠ capital Integrity checks after data-merging Example: Every country has precisely one capital: Inference TheHague ≠ A’dam & A’dam = capital  TheHague ≠ capital Integrity checks after data-merging Complex standardised inferences OWL

Web of Data: a nybody can say anything about anything All identifiers are URL's (= on the Web) l Allows total decoupling of data vocabulary meta-data x T [ IsOfType ] different owners & locations Data wants to be free

22 2. Shared vocabularies Mesh l Medical Subject Headings, National Library of Medicine l descriptions EMTREE l Commercial Elsevier, Drugs and diseases l terms, synonyms UMLS l Integrates 100 different vocabularies SNOMED l concepts, College of American Pathologists Gene Ontology l terms in molecular biology NCBI Cancer Ontology: l 17,000 classes (about 1M definitions) BioMed Geo?

23 Outline The general idea: a Web of Data What must be done to realise this How far away is this Nex steps, do’s, don’ts

24 How far away is this ? Stable data formats & standardised inferences Lots of shared vocabularies (+ ways to convert them) Lots of data sources (+ ways to convert them) Lots of tools l convert, construct, edit (data, vocabularies) l store, search, query, reason l interlink l visualise l...

already many billions of facts & rules How far away is this ? Not very far away! rapidly growing Linked Open Data cloud. Encyclopedia Geographic names (millions) names of artists & art works (10.000’s) scientific bibliographies hierarchical dictionaries (UK, FR, NL) hierarchical dictionaries (UK, FR, NL) life-science databases any CD ever recorded (almost) every book sold by Amazon basic facts on every country on the planet common sense rules & facts ( ’s) It gets bigger every month

26 Example use-case: bbc.co.uk/music/artists Content is BBC + LOD Use an ontology as basis for the site Serve data back out as RDF “The Web is becoming our content management platform”

27 Outline The general idea: a Web of Data What must be done to realise this How far away is this Nex steps, do’s, don’ts

28 Next steps 1.hunt for shared vocabularies l try to avoid building them 2.wrap legacy data sources l your own l from others 3.link wrapped sources 4.publish linked data on the web l make noise 5.reconstruct some old results 6.produce new results 7.get famous Can you get famous by sharing data? Can you get famous by sharing data? papers in oncology, in communication science, dedicated conferences in chemistry, earth-sciences, life- sciences, humanities funding opportunities in humanities, social sciences, life sciences learn / get access to some basic technology in-use systems in communication science, KB, Beeld & Geluid, Europeana A little semantics goes a long way

29 Questions & discussion