Applications Chapter 9, Cimiano Ontology Learning Textbook Presented by Aaron Stewart.

Slides:



Advertisements
Similar presentations
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Advertisements

Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Yansong Feng and Mirella Lapata
The 20th International Conference on Software Engineering and Knowledge Engineering (SEKE2008) Department of Electrical and Computer Engineering
Advanced Information Systems Laboratory Department of Computer Science and Systems Engineering GI-DAYS MÜNSTER A software tool.
Albert Gatt Corpora and Statistical Methods Lecture 13.
Web Mining Research: A Survey Authors: Raymond Kosala & Hendrik Blockeel Presenter: Ryan Patterson April 23rd 2014 CS332 Data Mining pg 01.
GENERATING AUTOMATIC SEMANTIC ANNOTATIONS FOR RESEARCH DATASETS AYUSH SINGHAL AND JAIDEEP SRIVASTAVA CS DEPT., UNIVERSITY OF MINNESOTA, MN, USA.
Wrap up  Matching  Geometry  Semantics  Multiscale modelling / incremental update / generalization  Geometric algorithms  Web Services.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
Search Engines and Information Retrieval
WebMiningResearch ASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Clustering… in General In vector space, clusters are vectors found within  of a cluster vector, with different techniques for determining the cluster.
Low-cost semantics-enhanced web browsing with Magpie Enrico Motta Knowledge Media Institute The Open University, UK.
Automatic Image Annotation and Retrieval using Cross-Media Relevance Models J. Jeon, V. Lavrenko and R. Manmathat Computer Science Department University.
Developing Semantic Web Sites: Results and Lessons Learnt Enrico Motta, Yuangui Lei, Martin Dzbor, Vanessa Lopez, John Domingue, Jianhan Zhu, Liliana Cabral,
Gimme’ The Context: Context- driven Automatic Semantic Annotation with CPANKOW Philipp Cimiano et al.
Experiments on Using Semantic Distances Between Words in Image Caption Retrieval Presenter: Cosmin Adrian Bejan Alan F. Smeaton and Ian Quigley School.
1 Information Retrieval and Web Search Introduction.
WebMiningResearchASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007 Revised.
Semantics For the Semantic Web: The Implicit, the Formal and The Powerful Amit Sheth, Cartic Ramakrishnan, Christopher Thomas CS751 Spring 2005 Presenter:
Ranking by Odds Ratio A Probability Model Approach let be a Boolean random variable: document d is relevant to query q otherwise Consider document d as.
Multi-view Exploratory Learning for AKBC Problems Bhavana Dalvi and William W. Cohen School Of Computer Science, Carnegie Mellon University Motivation.
Xiaomeng Su & Jon Atle Gulla Dept. of Computer and Information Science Norwegian University of Science and Technology Trondheim Norway June 2004 Semantic.
Overview of Search Engines
Ontology Learning and Population from Text: Algorithms, Evaluation and Applications Chapters Presented by Sole.
Information Retrieval in Practice
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Word Sense Disambiguation for Automatic Taxonomy Construction from Text-Based Web Corpora 12th International Conference on Web Information System Engineering.
Disambiguation of References to Individuals Levon Lloyd (State University of New York) Varun Bhagwan, Daniel Gruhl (IBM Research Center) Varun Bhagwan,
Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA.
AQUAINT Kickoff Meeting – December 2001 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
Query Relevance Feedback and Ontologies How to Make Queries Better.
Search Engines and Information Retrieval Chapter 1.
GEM/IRDR Social Vulnerability and Resilience Information System and Metadata Portal IRDR Scientific Board Meeting Chengdu 03/11/2012.
RuleML-2007, Orlando, Florida1 Towards Knowledge Extraction from Weblogs and Rule-based Semantic Querying Xi Bai, Jigui Sun, Haiyan Che, Jin.
Exploiting Wikipedia as External Knowledge for Document Clustering Sakyasingha Dasgupta, Pradeep Ghosh Data Mining and Exploration-Presentation School.
Name : Emad Zargoun Id number : EASTERN MEDITERRANEAN UNIVERSITY DEPARTMENT OF Computing and technology “ITEC547- text mining“ Prof.Dr. Nazife Dimiriler.
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
Mining the Structure of User Activity using Cluster Stability Jeffrey Heer, Ed H. Chi Palo Alto Research Center, Inc – SIAM Web Analytics Workshop.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
WebMining Web Mining By- Pawan Singh Piyush Arora Pooja Mansharamani Pramod Singh Praveen Kumar 1.
Clustering Supervised vs. Unsupervised Learning Examples of clustering in Web IR Characteristics of clustering Clustering algorithms Cluster Labeling 1.
Basic Machine Learning: Clustering CS 315 – Web Search and Data Mining 1.
1 Automatic Classification of Bookmarked Web Pages Chris Staff Second Talk February 2007.
Collocations and Information Management Applications Gregor Erbach Saarland University Saarbrücken.
Evaluating Semantic Metadata without the Presence of a Gold Standard Yuangui Lei, Andriy Nikolov, Victoria Uren, Enrico Motta Knowledge Media Institute,
Wikipedia as Sense Inventory to Improve Diversity in Web Search Results Celina SantamariaJulio GonzaloJavier Artiles nlp.uned.es UNED,c/Juan del Rosal,
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
WEB MINING. In recent years the growth of the World Wide Web exceeded all expectations. Today there are several billions of HTML documents, pictures and.
Some questions -What is metadata? -Data about data.
OWL Representing Information Using the Web Ontology Language.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Document Databases for Information Management Gregor Erbach FTW, Wien DFKI, Saarbrucken ETL, Tsukuba
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Enhancing Text Clustering by Leveraging Wikipedia Semantics.
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
Trends in NL Analysis Jim Critz University of New York in Prague EurOpen.CZ 12 December 2008.
Information Organization: Overview
Information Retrieval and Web Search
Information Retrieval and Web Search
Information Retrieval and Web Search
Social Knowledge Mining
How to publish in a format that enhances literature-based discovery?
CS246: Information Retrieval
Web Mining Research: A Survey
Information Retrieval and Web Design
Information Organization: Overview
Information Retrieval and Web Search
Presentation transcript:

Applications Chapter 9, Cimiano Ontology Learning Textbook Presented by Aaron Stewart

Typical Applications of Ontologies Agent communication Data integration Description of service capabilities for matching and composition purposes Formal verification of process descriptions Unification of terminology across communities

Text Applications of Ontologies Information Retrieval (IR) Clustering and Classification of Documents Semantic Annotation Natural Language Processing

Task-Based Evaluation (Porzel and Malaka 2005)

Task-Based Evaluation Requirements 1.Algorithm output can be quantified 2.Task can use background knowledge 3.Ontology is an additional parameter 4.Output can be traced to the ontology

Contents 1.Text Clustering and Classification 2.Information Highlighting for Supporting Search 3.Related Work

Text Clustering and Classification What is the difference?

Text Clustering

Text Classification ArrowsWeatherFlat shapes3-D formsSmile!

Dot Kom Project One of many competitions

Approaches Bag of words Manually engineered MeSH Tree Structures Automatically constructed ontologies

What is a “Bag of Words” anyway? the quick brown fox

Bag of Words thequickbrownfoxjumpsoverthelazydog (2)

Building Hierarchies

Note on Ontologies Our ontologies (“micro”) – Like a database record schema Their ontologies (“macro”) – Like WordNet

Clustering Hierarchical Agglomerative Clustering Bi-Section K-means “A Comparison of Document Clustering Techniques” –

Document Representations Bag of Words Certain words + ontology -> extended features Strategies: add, replace, only

Vectors and Cosine Similarity

Classification Results (Categories)

Classification Results (Documents)

Cluster Metrics P : computer-generated clusters L : human-created clusters P, L : sets of clusters (partitioning)

Clustering Results

Information Highlighting for Supporting Search Challenge: – 10 minute limit – KMi Planet News web site – Compile a list of important People Technologies

Information Highlighting for Supporting Search Tools: – Regular browser – Magpie – ESpotter – C-PANKOW

Teams A : web browser only B : web browser with AKT information C : web browser with AKT++ information

AKT++ Lexicon

Scores

Conclusions (for this section) Generated ontologies can be comparable to hand-crafted ontologies Humans can trust the computer too much! (Group C drop in score)

Related Work Query Expansion Information Retrieval Text Clustering and Classification Natural Language Processing

Ambiguity resolution – Bank Compounds – Headache medicine Vague words – With, of, has – Selectional restrictions Anaphora

More Applications Word sense disambiguation Classification of unknown words Named Entity Recognition (NER) Anaphora Resolution Question Answering – Who wrote the Hobbit? – Tolkien is the author of the Hobbit. Information Extraction – AUTOSLOG, ASIUM

Analysis/Conclusion Pro/con: – Focused on two systems – Passing survey of others