The Semantic Web in Ten Passages Harold Boley Institute for Information Technology e- Business New Brunswick, Canada.

Slides:



Advertisements
Similar presentations
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Advertisements

Chapter 5: Introduction to Information Retrieval
DELIVERING STORIES WITH PURSUIT Story-delivery presentation and demo Ben Tagger and Dirk Trossen (UCAM) Stuart Porter (CTVC)
Learning Semantic Information Extraction Rules from News The Dutch-Belgian Database Day 2013 (DBDBD 2013) Frederik Hogenboom Erasmus.
Knowledge Representation
UCLA : GSE&IS : Department of Information StudiesJF : 276lec1.ppt : 5/2/2015 : 1 I N F S I N F O R M A T I O N R E T R I E V A L S Y S T E M S Week.
1 Knowledge Representation We’ve discussed generic search techniques. Usually we start out with a generic technique and enhance it to take advantage of.
A Linguistic Approach for Semantic Web Service Discovery International Symposium on Management Intelligent Systems 2012 (IS-MiS 2012) July 13, 2012 Jordy.
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
Leveraging Your Taxonomy to Increase User Productivity MAIQuery and TM Navtree.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
CS652 Spring 2004 Summary. Course Objectives  Learn how to extract, structure, and integrate Web information  Learn what the Semantic Web is  Learn.
IR & Metadata. Metadata Didn’t we already talk about this? We discussed what metadata is and its types –Data about data –Descriptive metadata is external.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
OWL-AA: Enriching OWL with Instance Recognition Semantics for Automated Semantic Annotation 2006 Spring Research Conference Yihong Ding.
Automatic Web Page Categorization by Link and Context Analysis Giuseppe Attardi Antonio Gulli Fabrizio Sebastiani.
12 CHAPTER DATABASES Databases are the key to accessing information throughout our lives. Used in hospitals, grocery stores, schools, department stores,
1 IS380 Class Agenda 01/11/05 Sock H. Chung 1.Syllabus 2.Chapter 1 3.Introduction 4. Request.
Overall Information Extraction vs. Annotating the Data Conference proceedings by O. Etzioni, Washington U, Seattle; S. Handschuh, Uni Krlsruhe.
1 Introduction The Database Environment. 2 Web Links Google General Database Search Database News Access Forums Google Database Books O’Reilly Books Oracle.
Annotating Documents for the Semantic Web Using Data-Extraction Ontologies Dissertation Proposal Yihong Ding.
Semantics For the Semantic Web: The Implicit, the Formal and The Powerful Amit Sheth, Cartic Ramakrishnan, Christopher Thomas CS751 Spring 2005 Presenter:
Chapter 19: Information Retrieval
Towards Semantic Web: An Attribute- Driven Algorithm to Identifying an Ontology Associated with a Given Web Page Dan Su Department of Computer Science.
IMT530- Organization of Information Resources1 Feedback Like exercises –But want more instructions and feedback on them –Wondering about grading on these.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Chapter 1 1 © Prentice Hall, 2002 Database Design Dr. Bijoy Bordoloi Introduction to Database Processing.
Improving Data Discovery in Metadata Repositories through Semantic Search Chad Berkley 1, Shawn Bowers 2, Matt Jones 1, Mark Schildhauer 1, Josh Madin.
Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher Laura Po and Sonia Bergamaschi DII, University of Modena and Reggio Emilia, Italy.
MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management Dave Salisbury ( )
1 Chapter 19: Information Retrieval Chapter 19: Information Retrieval Relevance Ranking Using Terms Relevance Using Hyperlinks Synonyms., Homonyms,
Data Modelling – ERD Entity Relationship Diagram’s Entity Relationship Diagrams and how to create them. 1.
Multilingual Information Exchange APAN, Bangkok 27 January 2005
Knowledge Representation and Indexing Using the Unified Medical Language System Kenneth Baclawski* Joseph “Jay” Cigna* Mieczyslaw M. Kokar* Peter Major.
Web Searching Basics Dr. Dania Bilal IS 530 Fall 2009.
1 Technologies for (semi-) automatic metadata creation Diana Maynard.
© 2007 by Prentice Hall 1 Introduction to databases.
WEB SEARCH PERSONALIZATION WITH ONTOLOGICAL USER PROFILES Data Mining Lab XUAN MAN.
Theory and Application of Database Systems A Hybrid Approach for Extending Ontology from Text He Wei.
 Architecture and Description Of Module Architecture and Description Of Module  KNOWLEDGE BASE KNOWLEDGE BASE  PRODUCTION RULES PRODUCTION RULES 
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Search Engine Architecture
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
Ontology-Centered Personalized Presentation of Knowledge Extracted from the Web Ralitsa Angelova.
Algorithmic Detection of Semantic Similarity WWW 2005.
How Do We Find Information?. Key Questions  What are we looking for?  How do we find it?  Why is it difficult? “A prudent question is one-half of wisdom”
OWL Representing Information Using the Web Ontology Language.
Trustworthy Semantic Webs Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #4 Vision for Semantic Web.
WEB 2.0 PATTERNS Carolina Marin. Content  Introduction  The Participation-Collaboration Pattern  The Collaborative Tagging Pattern.
1 Resolving Schematic Discrepancy in the Integration of Entity-Relationship Schemas Qi He Tok Wang Ling Dept. of Computer Science School of Computing National.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Some Thoughts to Consider 8 How difficult is it to get a group of people, or a group of companies, or a group of nations to agree on a particular ontology?
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
Achieving Semantic Interoperability through Controlled Annotations Michael Gertz Department of Computer Science University of California, Davis
Passage 1 1. Meaningful Search
Search Engine Architecture
Erasmus University Rotterdam
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Searching for and Accessing Information
Information Retrieval
Clustering Semantically Enhanced Web Search Results
Chapter 1: The Database Environment
Data Mining Chapter 6 Search Engines
Introduction to Information Retrieval
Search Engine Architecture
Chaitali Gupta, Madhusudhan Govindaraju
Web archives as a research subject
The Database Environment
MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management
Presentation transcript:

The Semantic Web in Ten Passages Harold Boley Institute for Information Technology e- Business New Brunswick, Canada

Passage 1: Meaningful Search in the Billion- Fold Planetary Network

 Like searching for a specific grain of sand in two tightly packed 1m x 1m x 1m boxes  Current search engines: keyword-based, ranked search (good, but …)  Future search engines: “understand” the semantics (answers/services, not just ranked pages)  Knowledge representation: moving into focus on the Web

Passage 2: The Search Engine and its Crawler

 Crawlers: enter central & frequent words into a huge “address book”  You get the “hit list” for word w when you type in w  Example Wonder drug for head pain (1,160,000 hits) “Wonder drug for head pain” (no hits)  Abmiguities drug = medicine or narcotics head = body part or front or direction pain = ache or hurt or suffering or distress wonder = puzzlement or monumental creation  Missing the relationships among the words

Passage 3: Precision and Recall – Conflicting Measures for Search Results

 Aspirin (5,860,000 hits) – low precision  Aspirin “head pain” (8,040 hits) Better, but still low precision Recall problems: many “headache” pages missed – Aspirin headache (649,000 hits)  Aspirin “head pain” OR “head hurt” (583,000 hits) But now what about also “migraine” Query starts to get hard

Passage 4: Semantics – From Common Words to Standard Concepts

 Semantically want the concept that can be named “head pain” OR “headache” OR “migraine”  Semantic search engine would find the pages “meant”  Ideally Recall: complete Precision: perfect as well

Passage 5: Semantic Relationships Between Standard Concepts and …

 “Asprin cures head pain” vs. “Asprin causes head pain”  Semantic search engine should recognize semantic relationships between concepts  “Address book” becomes a “knowledge base”  Facts in the knowledge base Asprin --- cures --- headache Subject PRECICATE Object  Increases both recall and precision

Passage 5: … and Knowledge Derivation

 Suppose you want “Asprin CURES Headache AND Asprin CAUSES Headache”  Could store fact: “Asprin AMB Headache” (AMB = ambivalent)  Could instead write a rule IF pharmaceutical CURES sickness AND pharmaceutical CAUSES sickness THEN pharmaceutical AMB sickness Semantic search engine would find pages satisfying the IF part and hence necessarily also the THEN part  How? Semantic relationships between standard concepts Knowledge representation

Passage 6: Where do the Standard Concepts and Predicates Come from?

 Experts of a specialized field agree to share normative definitions of their concepts and predicates Shared, explicit concept catalogues Ontologies  Hierarchical superconcept-subconcept dubbed most important: Headache ISA Pain

Passage 7: Assigning Concepts/Predicates to Common Words: How?

 Build ontologies – tough job!  Automating the building of ontologies is very difficult – why?

Passage 7: Assigning Concepts/Predicates to Common Words: How?  Build ontologies – tough job!  Automating the building of ontologies is very difficult – why? Meaning often depends on context Granularity: e.g. general “stomach ache” or specific “appendix attack” Sentence analysis – NLP known to be hard Audio and video – can’t apply textual techniques Sometimes necessary to extend ontology, which only domain experts should be allowed to do  Semi-automatic construction System proposes concepts – expert agrees/fixes/completes TANGO

Passage 8: Where Will the Assignments be Stored as Metadata?

 External E.g. the “address book” Advantages  Possible to annotate pages not owned  Better for multiple annotations for different ontologies  More convenient for queries  Internal Annotations in the pages themselves Advantages  Can be updated when page changes  Compromise: only URL pointer placed in page  Change/maintenance problem for annotations

Passage 9: Refined Standard Concepts Inherit Refined Semantic Relationships

 Suppose: Headache ISA Pain; Sporadic-Headache ISA Headache; Chronic-Headache ISA Heacache Aspirin --- CURES --- Headache  Now suppose someone decides this should be different: Aspirin --- CURES --- Sporadic-Headache Now, what about all the annotated pages before the change? (two possibilities)  UPDATE all old annotations: But now domain experts should decide which was meant for each “Headache” occurrence – “Sporadic-Headache” or “Chronic- Headache”  SWITCH ontologies but access old via old: eventually leads to versions of versions and … problems

Passage 10: Library Catalogues as Metadata Ontologies

 “UPDATE” is the “nicer” solution, but many libraries have chosen “SWITCH” – you sometimes have to search in two or more catalogues  Will eventually become a big problem Competing ontologies Complementary ontologies Could be overwhelmed by ontologies