1 Semantic Web: Vaporware or Worthy Dream? Slides adapted from Nick Kushmerick Rose colored glasses are never made in bi-focals because no-body wants to.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Lukas Blunschi Claudio Jossen Donald Kossmann Magdalini Mori Kurt Stockinger.
Semantic Web Thanks to folks at LAIT lab Sources include :
CS570 Artificial Intelligence Semantic Web & Ontology 2
RDF Tutorial.
Listening non-stop for 150min per week, for 16 weeks –4000$ (your tuition).. Re-viewing all the lecture videos on Youtube –100000$ (in lost girl friends/boy.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
Search Engines and Information Retrieval
Listening non-stop for 150min per week, for 16 weeks –4000$ (your tuition).. Watching Rao sip 30+ doppio machchiatos –30$ (aggravation fee).. Catching.
Interactive Review + a (corny) ending 12/05  Project due today (with extension)  Homework 4 due Friday  Demos (to the TA) as scheduled.
The Last Lecture Agenda –1:40-2:00pm Integrating XML and Search Engines—Niagara way –2:00-2:10pm My concluding remarks (if any) –2:10-2:45pm Interactive.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
Listening non-stop for 150min per week, for 16 weeks –4000$ (your tuition).. Catching up on your beauty sleep in the class –300$ (chairs not very comfy)
1 5/4: Final Agenda… 3:15—3:20 Raspberry bars »In lieu of Google IPO shares.. Homework 3 returned; Questions on Final? 3:15--3:40 Demos of student projects.
1 Semantic Web: Vaporware or Worthy Dream? Slides adapted from Nick Kushmerick Rose colored glasses are never made in bi-focals because no-body wants to.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
RDF: Building Block for the Semantic Web Jim Ellenberger UCCS CS5260 Spring 2011.
Semantic Web Presented by: Edward Cheng Wayne Choi Tony Deng Peter Kuc-Pittet Anita Yong.
Samad Paydar Web Technology Laboratory Computer Engineering Department Ferdowsi University of Mashhad 1389/11/20 An Introduction to the Semantic Web.
Computer communication B Introduction to the Semantic Web.
The Semantic Web An Emerging Successor of the Web By Tim Berners-Lee, James Hendler and Ora Lassila Presentedby Veera Chandra Sekhar.
The Internet & The World Wide Web Notes
ONTOLOGY SUPPORT For the Semantic Web. THE BIG PICTURE  Diagram, page 9  html5  xml can be used as a syntactic model for RDF and DAML/OIL  RDF, RDF.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
Introduction. » How the course works ˃Homework ˃Project ˃Exams ˃Grades » prerequisite ˃CSCI 6441: Mandatory prerequisite ˃Take the prereq or get permission.
Chapter 16 The World Wide Web. 2 The Web An infrastructure of information combined and the network software used to access it Web page A document that.
Search Engines and Information Retrieval Chapter 1.
An Introduction to the Resource Description Framework Eric Miller Online Computer Library Center, Inc. Office of Research Dublin, Ohio 元智資工所 系統實驗室 楊錫謦.
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
Computing Fundamentals Module Lesson 19 — Using Technology to Solve Problems Computer Literacy BASICS.
Unit 15 Webpage Creator. Outlines Introduction Starter Listening Language Work Work study Speaking Writing.
1.NET Web Forms Business Forms © 2002 by Jerry Post.
1 UNIT 15 Webpage Creator Lecturer: fadwa tlaelan.
ITCS373: Internet Technology Lecture 5: More HTML.
Towards a semantic web Philip Hider. This talk  The Semantic Web vision  Scenarios  Standards  Semantic Web & RDA.
Semantic Web - an introduction By Daniel Wu (danielwujr)
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
INFO1408 Database Design Concepts Week 15: Introduction to Database Management Systems.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
OWL Representing Information Using the Web Ontology Language.
Computing Fundamentals Module Lesson 6 — Using Technology to Solve Problems Computer Literacy BASICS.
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Data Integration Hanna Zhong Department of Computer Science University of Illinois, Urbana-Champaign 11/12/2009.
Metadata : an overview XML and Educational Metadata, SBU, London, 10 July 2001 Pete Johnston UKOLN, University of Bath Bath, BA2 7AY UKOLN is supported.
Scalable Hybrid Keyword Search on Distributed Database Jungkee Kim Florida State University Community Grids Laboratory, Indiana University Workshop on.
1cs The Need “Most of the Web's content today is designed for humans to read, not for computer programs to manipulate meaningfully.” Berners-Lee,
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
IT Enablement Approaches Large Business may have hundreds of processes to be enabled by IT. Several Types of Application may be deployed –Departmental.
Fall CSE330/CIS550: Introduction to Database Management Systems Prof. Susan Davidson Office: 278 Moore Office hours: TTh
CP3024 Lecture 12 Search Engines. What is the main WWW problem?  With an estimated 800 million web pages finding the one you want is difficult!
Semantic Web COMS 6135 Class Presentation Jian Pan Department of Computer Science Columbia University Web Enhanced Information Management.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
ASET 1 Amity School of Engineering & Technology B. Tech. (CSE/IT), III Semester Database Management Systems Jitendra Rajpurohit.
MICROSOFT ACCESS – CHAPTER 5 MICROSOFT ACCESS – CHAPTER 6 MICROSOFT ACCESS – CHAPTER 7 Sravanthi Lakkimsety Mar 14,2016.
On-Line BankCard Center Presentation Cardholder Role During the Presentation click the mouse on this button to move back a slide During the Presentation.
Linked Data Publishing on the Semantic Web Dr Nicholas Gibbins
1 UNIT 13 The World Wide Web. Introduction 2 Agenda The World Wide Web Search Engines Video Streaming 3.
1 UNIT 13 The World Wide Web. Introduction 2 The World Wide Web: ▫ Commonly referred to as WWW or the Web. ▫ Is a service on the Internet. It consists.
General Architecture of Retrieval Systems 1Adrienn Skrop.
(class #2) CLICK TO CONTINUE done by T Batchelor.
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
Building the Semantic Web
Piotr Kaminski University of Victoria September 24th, 2002
Computer Literacy BASICS
INFO/CSE 100, Spring 2006 Fluency in Information Technology
Attributes and Values Describing Entities.
Presentation transcript:

1 Semantic Web: Vaporware or Worthy Dream? Slides adapted from Nick Kushmerick Rose colored glasses are never made in bi-focals because no-body wants to read smallprint in dreams

2 SemanticWeb “ The Semantic Web ” (Berners-Lee, Hendler, Lassila; Scientific American, May 2001) “W3C Semantic Web Activity” (Koivenen, Miller; Dec 2001) COMP-4016 ~ Computer Science Department ~ University College Dublin ~ ~ © Nicholas Kushmerick 2002

3 Today’s Syntactic Web HTML (modest compliance with standards thanks to robust browsers) Hyperlinks (no data types; just annnotated with text [sometimes merely “Click here”!]; often dangling references) Human eyeballs & common sense (Just barely?!) suitable/scalable for –“trivial pursuit” information retrieval What’s the capital of Botswana? Will it rain tomorrow? … –“mundane” transactions/services Buying a book; Playing a game of chess; …

4 Automating people out of the loop “The bane of my existence is doing things that I know the computer could do for me” (D Connolly; W3C guru) 150 years ago, the telephone was outrageously sophisticated: “Do you seriously predict that every room in every building will have a small device that you type a few numbers into and you can talk to the person in any other room of any other building in the entire world??!!” 30 years ago, was outrageously sophisticated: “Do you seriously predict that every person will have a small device that you can type a person’s name into and you can send a private message to any other person in the world, that they can read even on the beach in Tahiti?!!?!” 15 years ago, the Web was outrageously sophisticated: “Do you seriously predict that every person will have a device with which they can send their grocery list to the shop and in a few hours the groceries arrive!!??!!” Why can’t my online calendar & bank account negotiate with my garage’s to arrange a mutually convenient time & price to repair my leaking tyre?

5 The Challenge All the relvant data is (or soon will be) “on the Web” -- but in a form specialized to human vision/processing, not automated machine processing How can my agent find/parse/extract garage’s free times? Which of my appointments are critical/flexible? Even if I annotated entries, what if the garage’s timetable doesn’t have such a concept? And there are dozens of constraints: How long will it take to get to the garage? Would I pay extra if they have can collect the car? Can they repair the door lock too?

6 What/where is the Semantic Web? Layered on top of existing Web. (Just like HTTP is built on top of TCP, which is on top of IP, which is on top of the data-link layer) TCP IP Data-Link research / vapourware solid implementations

7 12/9

8 Final Agenda… 12:15-12:16 Pointer about Online EvaluationsOnline Evaluations 12:16-12:17 Final Exam related qns 12:17-12:45 Semantic Web 12:45-1pm Buffer (Bridge Topics…) 1-1:25pm Interactive Semester Review 1:25-1:30pm Adieu

9 Final Exam Mostly about the second half of the semester –Although I reserve the right to ask questions that combine the two halves Will be take-home Will (likely be) put up on Thursday You will have about 3 full days to work on the exam –There will be escape clauses for folks who have two exams crammed in those three days Exam has to be submitted in HARD COPY.

10 Layer 1: URI Everything is a “Resource” (people, books, the attribute “title” of an Amazon “book” object, Web pages, the concept “laziness”, …) Ever resource has a unique identifier -- Uniform Resource Identifier eg, the URI of a Web Page is its URL Eg, the URI of my address is Owner of object can pick any URI they want as long is it is unique. Often has “URL-like” syntax but that is purely convention/arbitrary

11 Layer 2: XML Use XML as common formatting standard for encoding data. (Could invent a new format for every kind of data but why bother?) War & Peace … book title … title name … Data Meta-Data Meta- Meta-Data Danger/Warning: Made-up syntax!!

12 XML Schema An XML Schema document is an XML document that defines a set of XML tags (and how they may be used)

13 XML Namespaces An XML documents may use tags defined in more than one XML Schema document “Namespace” prefixes (xxx:yyy) are used to unambiguously point to the defining XML Schema document Nick’s Home Page

14 Layer 3: RDF All data/knowledge/facts/opinions/information is expressed on the Semantic Web as “Resource Description Framework” statements Very simple language for making assertions: –Triple: (value) (attribute) (object) (is address of) (Nick Kushmerick) –( ) (is ISBN number of) (War & Peace) –(field 5 of database A) (is a field of type) (postal code)

15 Everything is XML Remember (Nick’s Home Page) (is title of) ( is actually encoded as some very ugly XML: Nick’s Home Page

16 Layer 4: Ontologies (RDF Schema) There are lots of common RDF attribute-sets for lots of common tasks eg -- “Dublin Core” [Ohio, Sorry!] defines a few dozen standard attributes for asserting statements about documents: title, author, date, version, format, owner, … But suppose you want to define your own concepts/attributes -- –RDF Schema = set of RDF tags for defining a new set of RDF tags (no, this isn’t circular)

17 RDF Schema for Dublin Core Ontology

18 Layer 4½: Mapping Between Ontologies Taxonomy Crisis: –How can your agent know that my “title” is your “name”?! –How can my agent know that some of your “address” objects are post-boxes, not physical addresses?! –How can my agent know that many Asian first names correspond to Western surnames? Semantic Web Solution: Services for translating/mapping between “related” ontologies. Suppose Amazon.com uses Dublin Core (“title”), while Fred Hanna uses it’s own document ontology (“name”). So far … my agent is forced to choose a ontology, or must be carefully crafted to understand both lanuages A better solution: A niche now exists for a independent entity (UniversalBookInfo.com) that maps “title”  “name” etc

19 without UniversalBookInfo.com Nick wants to buy War & Peace Nick’s very complicated agent Amazon Fred Hanna Amazon ontology FredHanna ontology Programmer’s bank account €€€

20 with UniversalBookInfo.com Nick wants to buy War & Peace Nick’s agent Amazon Universal BookInfo.com Fred Hanna Joe’s agent Jane’s Agent Bank Account €€€ €€ €

21 Layer 5: Logic Ontologies also allow axioms –“All people have brains” Expressiveness: Key challenge in formalizing axioms: want to be able to say anything you need to in a particular domain. –“All people have brains, except George Bush.” But more expressive logics mean slower inference –Intuitively, applying a rule such as “You can’t fool all of the people all of the time” could require checking everyone in the universe to determine if there exists even one foolable person. DAML + OIL

22 Integrating Services Source can be “services” rather than “data repositories” –Eg. Amazon as a composite service for book buying –Separating line is somewhat thin Handling services –Description (API;I/O spec) WSDL –Composition Planning in general –Execution Data-flow architectures –See next part

23 Who will annotate the data? Semantic web works if the users annotate their pages using some existing ontology (or their own ontology, but with mapping to other ontologies) –But users typically do not conform to standards.. and are not patient enough for delayed gratification… –The way to force them is to act as if you are helping them write web-pages Currently most people don’t write their HTML code—the MS frontpages and Claris Homepages of the world do.. What if we change the MS Frontpage/Claris Homepage so that they (slyly) add annotations? E.g. The Mangrove project at U. Wash.Mangrove –Help user in tagging their data (allow graphical editing) –Provide instant gratification by running services that use the tags.

24 Layer 6: Proofs ugly XML encoding Proof Verifier Yes this proof is correct No this proof is flawed (Easy to build once the Logic layer is fixed)

25 Proofs: Huh?!??! ugly XML encoding Proof Verifier Yes this proof is correct No this proof is flawed (Easy to build once the Logic layer is fixed) I am an employee of XYZ Corp (because it says so on this Web page, which is an XYZ Corp official document) I would like to buy this book; please send my company an invoice OK, book successfully ordered Sorry, we need a credit card!

26 Proofs  Trust In the Semantic Web, a “proof” is a procedure that can be automatically followed in order to verify an assertion. Believability is always relative to a set of resources that you trust –I own bank account # , because my Digital Signature XXXX matches the record on Web page and you trust this page because you own bank.com

27 Summary Distributed global information ecosystem enables wide variety of value-added information services (monitoring your online purchases; finding entertainment in which you might be interested; scheduling appointments; …) But doing so is difficult/impossible if relevant data is tied up in legacy documents intended for human eyes/common sense The Semantic Web as Global Database/Brain for All Humanity -- Probably hopelessly futile But within sufficiently motivated (ie, rich) segments of the Web today’s Syntactic Web may well evolve into A Semantic Web Rose colored glasses are never made in bi-focals because no-body wants to read smallprint in dreams

28 Final Agenda… 12:15-12:16 Pointer about Online EvaluationsOnline Evaluations 12:16-12:17 Final Exam related qns 12:17-12:45 Semantic Web 12:45-1pm Buffer (Bridge Topics…) 1-1:25pm Interactive Semester Review 1:25-1:30pm Adieu

29 Soft Joins..WHIRL [Cohen] Often you have database relations some of whose fields are “Textual” –E.g. a movie database, which has, in addition to year, director etc., a column called “Review” which is unstructured text Normal DB operations ignore this unstructured stuff (can’t join over them). –SQL sometimes supports “Contains” constraint (e.g. give me movies that contain “Rotten” in the review We can extend the notion of Joins to “Similarity Joins” where similarity is measured in terms of vector similarity over the text attributes. So, the join tuples are output n a ranked form—with the rank proportional to the similarity –Neat idea… but does have some implementation difficulties –Most tuples in the cross-product will have non-zero similarities. So, need query processing that will somehow just produce highly ranked tuples

30 BANKSBANKS: Keyword Search in DB

31 Final Agenda… 12:15-12:16 Pointer about Online EvaluationsOnline Evaluations 12:16-12:17 Final Exam related qns 12:17-12:45 Semantic Web 12:45-1pm Buffer (Bridge Topics…) 1-1:25pm Interactive Semester Review 1:25-1:30pm Adieu

32 Course Outcomes After this course, you should be able to answer: –How search engines work and why are some better than others –Can web be seen as a collection of (semi)structured databases? If so, can we adapt database technology to Web? –Can useful patterns be mined from the pages/data of the web? What did you think these were going to be?? REVIEW

33 Main Topics Approximately three halves plus a bit: –Information retrieval –Information integration/Aggregation –Information mining –other topics as permitted by time REVIEW

34 Adapting old disciplines for Web-age Information (text) retrieval –Scale of the web –Hyper text/ Link structure –Authority/hub computations Databases –Multiple databases Heterogeneous, access limited, partially overlapping –Network (un)reliability Datamining [Machine Learning/Statistics/Databases] –Learning patterns from large scale data REVIEW

35 A Farside treasury…

36 Final Agenda… 12:15-12:16 Pointer about Online EvaluationsOnline Evaluations 12:16-12:17 Final Exam related qns 12:17-12:45 Semantic Web 12:45-1pm Buffer (Bridge Topics…) 1-1:25pm Interactive Semester Review 1:25-1:30pm Adieu

37 Rao: I could've taught more...I could've taught more, if I'd just...I could've taught more... Sree: Rao, there are twenty people who are mad at you because you taught too much. Look at them. Rao: If I'd made more time...I wasted so much time, you have no idea. If I'd just... Sree: There will be generations (of bitter people) because of what you did. Rao: I didn't do enough. Sree: You did so much. Rao: This slide. We could’ve removed this slide. Why did I keep the slide? Two minutes, right there. Two minutes, two more minutes.. This music, a bit on p2p. This review. Two points on custom portals. I could easily have made two for it. At least one. I could’ve gotten one more point across. One more. One more point. A point, Sree. For this. I could've gotten one more point across and I didn't.  Adieu with an Oscar Schindler Routine.. Schindler: I could've got more...I could've got more, if I'd just...I could've got more... Stern: Oskar, there are eleven hundred people who are alive because of you. Look at them. Schindler: If I'd made more money...I threw away so much money, you have no idea. If I'd just... Stern: There will be generations because of what you did. Schindler: I didn't do enough. Stern: You did so much. Schindler: This car. Goeth would've bought this car. Why did I keep the car? Ten people, right there. Ten people, ten more people...(He rips the swastika pin from his lapel) This pin, two people. This is gold. Two more people. He would've given me two for it. At least one. He would've given me one. One more. One more person. A person, Stern. For this. I could've gotten one more person and I didn't. Top few things I would have done if I had more time More on dimensionality reduction Extracting tagged info from text pages More on structure—non-structure integration Soft joins[Cohen]; Keyword search on DB Customized portal generation Optimization issues in Data Integration P2P mediation Services—and service standards Be less demanding more often (or even once…)

38