Graph Data Management Lab, School of Computer Science Put conference information here.

Slides:



Advertisements
Similar presentations
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Advertisements

Classification & Your Intranet: From Chaos to Control Susan Stearns Inmagic, Inc. E-Libraries E204 May, 2003.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
CILC2011 A framework for structured knowledge extraction and representation from natural language via deep sentence analysis Stefania Costantini Niva Florio.
YAGO: A Large Ontology from Wikipedia and WordNet Fabian M. Suchanek, Gjergji Kasneci, Gerhard Weikum Max-Planck-Institute for Computer Science, Saarbruecken,
GridVine: Building Internet-Scale Semantic Overlay Networks By Lan Tian.
CS570 Artificial Intelligence Semantic Web & Ontology 2
So What Does it All Mean? Geospatial Semantics and Ontologies Dr Kristin Stock.
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
Ontology Notes are from:
Sunita Sarawagi.  Enables richer forms of queries  Facilitates source integration and queries spanning sources “Information Extraction refers to the.
Chapter 6: Modeling and Representation Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley, 2005.
The Semantic Web – WEEK 5: RDF Schema + Ontologies The “Layer Cake” Model – [From Rector & Horrocks Semantic Web cuurse]
Basi di dati distribuite Prof. M.T. PAZIENZA a.a
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
Enhance legal retrieval applications with an automatically induced knowledge base Ka Kan Lo.
Ontology-based Access Ontology-based Access to Digital Libraries Sonia Bergamaschi University of Modena and Reggio Emilia Modena Italy Fausto Rabitti.
Saarbrucken / Germany ¨
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Ontologies: Making Computers Smarter to Deal with Data Kei Cheung, PhD Yale Center for Medical Informatics CBB752, February 9, 2015, Yale University.
SC32 WG2 Metadata Standards Tutorial Metadata Registries and Big Data WG2 N1945 June 9, 2014 Beijing, China.
Semantic Web outlook and trends May The Past 24 Odd Years 1984 Lenat’s Cyc vision 1989 TBL’s Web vision 1991 DARPA Knowledge Sharing Effort 1996.
YAGO:A LARGE ONTOLOGY FROM WIKIPEDIA AND WORDNET FABIAN M. SUCHANEK, GJERGJI KASNECI, GERHARD WEIKUM Subbalakshmi Iyer.
Knowledge representation
Name : Emad Zargoun Id number : EASTERN MEDITERRANEAN UNIVERSITY DEPARTMENT OF Computing and technology “ITEC547- text mining“ Prof.Dr. Nazife Dimiriler.
Logics for Data and Knowledge Representation
Author: William Tunstall-Pedoe Presenter: Bahareh Sarrafzadeh CS 886 Spring 2015.
GLOSSARY COMPILATION Alex Kotov (akotov2) Hanna Zhong (hzhong) Hoa Nguyen (hnguyen4) Zhenyu Yang (zyang2)
Ontology-Driven Automatic Entity Disambiguation in Unstructured Text Jed Hassell.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
updated CmpE 583 Fall 2008 Ontology Integration- 1 CmpE 583- Web Semantics: Theory and Practice ONTOLOGY INTEGRATION Atilla ELÇİ Computer.
Towards a semantic web Philip Hider. This talk  The Semantic Web vision  Scenarios  Standards  Semantic Web & RDA.
Developing “Geo” Ontology Layers for Web Query Faculty of Design & Technology Conference David George, Department of Computing.
Mining Topic-Specific Concepts and Definitions on the Web Bing Liu, etc KDD03 CS591CXZ CS591CXZ Web mining: Lexical relationship mining.
Taken from Schulze-Kremer Steffen Ontologies - What, why and how? Cartic Ramakrishnan LSDIS lab University of Georgia.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Ontology Mapping in Pervasive Computing Environment C.Y. Kong, C.L. Wang, F.C.M. Lau The University of Hong Kong.
Artificial Intelligence 2004 Ontology
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
Strategies for subject navigation of linked Web sites using RDF topic maps Carol Jean Godby Devon Smith OCLC Online Computer Library Center Knowledge Technologies.
Mining the Biomedical Research Literature Ken Baclawski.
Semantic Web BY: Josh Rachner and Julio Pena. What is the Semantic Web? The semantic web is a part of the world wide web that allows data to be better.
1 Masters Thesis Presentation By Debotosh Dey AUTOMATIC CONSTRUCTION OF HASHTAGS HIERARCHIES UNIVERSITAT ROVIRA I VIRGILI Tarragona, June 2015 Supervised.
1 NAGA: Searching and Ranking Knowledge Gjergji Kasneci Joint work with: Fabian M. Suchanek, Georgiana Ifrim, Maya Ramanath, and Gerhard Weikum.
Commonsense Reasoning in and over Natural Language Hugo Liu, Push Singh Media Laboratory of MIT The 8 th International Conference on Knowledge- Based Intelligent.
Using Wikipedia for Hierarchical Finer Categorization of Named Entities Aasish Pappu Language Technologies Institute Carnegie Mellon University PACLIC.
Gaby Nativ, SDBI  Motivation  Other Ontologies  System overview  YAGO Dive IN  LEILA  NAGA  Conclusion.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Toward Entity Retrieval over Structured and Text Data Mayssam Sayyadian, Azadeh Shakery, AnHai Doan, ChengXiang Zhai Department of Computer Science University.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
Johannes Hoffart, Fabian Suchanek Klaus BerberichBy Gerhard Weikum Madhav Charan.
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
Mapping the NCI Thesaurus and the Collaborative Inter-Lingual Index Amanda Hicks University of Florida HealthInsight Workshop, Oslo, Norway.
SERVICE ANNOTATION WITH LEXICON-BASED ALIGNMENT Service Ontology Construction Ontology of a given web service, service ontology, is constructed from service.
Knowledge Representation Part I Ontology Jan Pettersen Nytun Knowledge Representation Part I, JPN, UiA1.
Service-Oriented Computing: Semantics, Processes, Agents
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Web IR: Recent Trends; Future of Web Search
Ontology.
Service-Oriented Computing: Semantics, Processes, Agents
Extracting Semantic Concept Relations
Frontiers of Computer Science, 2015, 9(6):980–989
Service-Oriented Computing: Semantics, Processes, Agents
DBpedia 2014 Liang Zheng 9.22.
How to publish in a format that enhances literature-based discovery?
Yago Type Heuristics 丁基伟.
Presentation transcript:

Graph Data Management Lab, School of Computer Science Put conference information here Reporter: Qi Liu YAGO

2 Graph Data Management Lab, School of Computer Science What is YAGO?  A semantic web  A knowledge base  A combination of WordNet and wikipedia

3 Graph Data Management Lab, School of Computer Science Semantic web  Advocated by W3C(World Wide Web Consortium)  Aimed at reconstructing the WWW  A standard framework: RDF(Resource Description Framework)

4 Graph Data Management Lab, School of Computer Science What is YAGO?  A semantic web  A knowledge base  A combination of WordNet and wikipedia

5 Graph Data Management Lab, School of Computer Science Knowledge base  To be: A special database for knowledge management  To do: Provides a means for collecting, organising, searching and utilising information  Three types: Machine-readable knowledge bases(DBpedia) Human-readable konwledge bases(Wikipedia) Knowledge base analysis and design

6 Graph Data Management Lab, School of Computer Science What is YAGO?  A semantic web  A knowledge base  A combination of WordNet and wikipedia

7 Graph Data Management Lab, School of Computer Science WordNet  To be: A lexical database for English since 1985  To do: Groups words into synsets Provides short, general definitions Records the semantic relations between these synsets  25 basic noun groups & 15 verb groups

8 Graph Data Management Lab, School of Computer Science Key Concepts  Ontology vs Taxonomy  Lexicon:the bridge between a language and the knowledge expressed in that language  Syntactic (there vs their)  Semantic (sight vs site)  Pragmatic (infer vs imply)

9 Graph Data Management Lab, School of Computer Science Figure 1: Hierarchy of top-level categories in KR ontology  See also

10 Graph Data Management Lab, School of Computer Science Semantics of YAGO  Five relations: Domain Range subRelationof Type subClassOf  Entities: Domain Relation Range Literal......

11 Graph Data Management Lab, School of Computer Science Axiomatic rules

12 Graph Data Management Lab, School of Computer Science Reasoning rules  correctness and completeness

13 Graph Data Management Lab, School of Computer Science The YAGO system  Knowledge extraction  YAGO storage  Enriching YAGO

14 Graph Data Management Lab, School of Computer Science Knowledge extraction  TYPE relation  SUBCLASSOF relation  MEANS relation  Other relations  Meta-relations

15 Graph Data Management Lab, School of Computer Science TYPE relation extraction  The Wikipedia Category System Types: conceptual, administrative, relational, thematic  Identifying Conceptual Categories Conceptual  TYPE Adm and relational ones: excluded by hand Employ a shallow linguistic parsing(Noun Group Parser) of the left two categories E.g. Naturalized citizens of United States domain and range extracted at the same time

16 Graph Data Management Lab, School of Computer Science SUBCLASSOF relation extraction  Wikipedia categories DAG(directed acyclic graph) Reflect merely the thematic structure Use only the leaf categories of Wikipedia  Integrating WordNet Synsets Match or prefer WordNet  Establishing subClassOf American people in Japan  Exceptions Correct manually

17 Graph Data Management Lab, School of Computer Science Means relation extraction  Exploiting WordNet Synsets A synset{urban center,metropolis, city} Attach a class for the synset ‘city’  Exploiting Wikipedia Redirects Search “Einstein, Albert”, redirected to “Albert, Einstein”  Parsing Person Names givenNameOf subRelationOf means familyNameOf subRelationOf means

18 Graph Data Management Lab, School of Computer Science Other relations extraction  BornInYear & DiedInYear  EstablisedIn & LocatedIn  WrittenInYear  PolitionOf  HasWonPrize  Filtering the Results

19 Graph Data Management Lab, School of Computer Science Meta-relations extraction  Descriptions Individual DESCRIBES URL  Witness Fact FoundIn URL(of its witness page) ExtractedBy  Context Linkages btw A&B: A Context B

20 Graph Data Management Lab, School of Computer Science Knowledge extraction  TYPE relation  SUBCLASSOF relation  MEANS relation  Other relations  Meta-relations

21 Graph Data Management Lab, School of Computer Science The YAGO system  Knowledge extraction  YAGO storage  Enriching YAGO

22 Graph Data Management Lab, School of Computer Science YAGO storage  Model independent of storage  Storage: Text files, XML, database tables, RDF

23 Graph Data Management Lab, School of Computer Science Enriching YAGO  Add the fact(x,r,y) Map x,y to existing entities(word sense disambiguation) If mapping failed, add new entity. Map r to YAGO ontology If mapping successed, add a FoundIn relation If mapping failed, add a new fact!

24 Graph Data Management Lab, School of Computer Science Summary on YAGO1  1M entities & 5M facts  Accuracy around 95%

25 Graph Data Management Lab, School of Computer Science

26 Graph Data Management Lab, School of Computer Science YAGO2: In Time, Space and Many Languages  YAGO: about 100 manually defined relations  Build YAGO2 architecture based on such rules: Factual rules E.g. Exceptions,definition of all relations, domains, ranges and classes Implication rules Inferring rules from the facts in the database Replacement rules Normalize numbers, tags and other formats Extraction rules Extracting facts from a given source text

27 Graph Data Management Lab, School of Computer Science Temporal Dimension  People wasBornOnDate & diedOnDate  Groups wasCreatedOnDate&wasDestroyedOnDate  Artifacts(buildings, songs,cities) [same as above]  Events startedOnDate & endedOnDate =>startExistingOnDate&endExistingOnDate  Facts  Entities in a fact =>subjectStartRelation&objectStartRelation

28 Graph Data Management Lab, School of Computer Science GEO-SPATIAL Dimension  All physical objects have a location in space!  Define it with geographical coordinates, i.e. Latitude and longtitude =>yagoGeoCoordinates, =>hasGeoCoordinates  Two sources: Wikipedia GeoNames locatedIn & hasGeoCoordinates &

29 Graph Data Management Lab, School of Computer Science Textual Dimension  hasWikipediaAnchorText  hasWikipediaCategory  hasCitationTitle subClassOf hasContext Integrating UWN to including 200 languages