YAGO-NAGA Project Presented By: Mohammad Dwaikat To: Dr. Yuliya Lierler CSCI 8986 – Fall 2012.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

…to Ontology Repositories Mathieu dAquin Knowledge Media Institute, The Open University From…
Multilinguality & Semantic Search Eelco Mossel (University of Hamburg) Review Meeting, January 2008, Zürich.
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
Haystack: Per-User Information Environment 1999 Conference on Information and Knowledge Management Eytan Adar et al Presented by Xiao Hu CS491CXZ.
Date: 2014/05/06 Author: Michael Schuhmacher, Simon Paolo Ponzetto Source: WSDM’14 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Knowledge-based Graph Document.
Research Internships Advanced Research and Modeling Research Group.
Search in Source Code Based on Identifying Popular Fragments Eduard Kuric and Mária Bieliková Faculty of Informatics and Information.
SWIMs: From Structured Summaries to Integrated Knowledge Base
Gerhard Weikum Max Planck Institute for Informatics & Saarland University Semantic Search: from Names and Phrases to.
Web Mining Research: A Survey Authors: Raymond Kosala & Hendrik Blockeel Presenter: Ryan Patterson April 23rd 2014 CS332 Data Mining pg 01.
Universal Search and Social Networking Exploiting the features of each to enhance the other and the tools that make it possible Peter Wallqvist Ravn Systems.
Hermes: News Personalization Using Semantic Web Technologies
Building and Analyzing Social Networks Web Data and Semantics in Social Network Applications Dr. Bhavani Thuraisingham February 15, 2013.
© Copyright 2012 STI INNSBRUCK Apache Stanbol.
Graph Data Management Lab, School of Computer Science Put conference information here.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
Xyleme A Dynamic Warehouse for XML Data of the Web.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Database and Information- Retrieval Methods for Knowledge Discovery Database and Information- Retrieval Methods for Knowledge Discovery Gerhard Weikum,
March 17, 2008SAC WT Hermes: a Semantic Web-Based News Decision Support System* Flavius Frasincar Erasmus University Rotterdam.
Shared Ontology for Knowledge Management Atanas Kiryakov, Borislav Popov, Ilian Kitchukov, and Krasimir Angelov Meher Shaikh.
IST NeOn-project.org The Semantic Web is growing… #SW Pages Lee, J., Goodwin, R. (2004) The Semantic.
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
Which Nobel laureate survived both world wars and all his four children? Tandem What‘s this? Question AnsweringPhoto AnnotationTimeline Analysis.
1 ALiSS Adaptive Links Suggestion Service Antonio De Marinis, Stefan Jensen (EEA) Alec Ghica (Finsiel RO), Sasha Vinčić (Systemvaruhuset) Ecoterm III FAO.
The Exchange of Retrieval Knowledge about Services between Agents Mirjam Minor Mike Wernicke.
SEEKING STATEMENT-SUPPORTING TOP-K WITNESSES Date: 2012/03/12 Source: Steffen Metzger (CIKM’11) Speaker: Er-gang Liu Advisor: Dr. Jia-ling Koh 1.
Survey of Semantic Annotation Platforms
The Semantic Web Web Science Systems Development Spring 2015.
DBrev: Dreaming of a Database Revolution Gjergji Kasneci, Jurgen Van Gael, Thore Graepel Microsoft Research Cambridge, UK.
Semantic Search: different meanings. Semantic search: different meanings Definition 1: Semantic search as the problem of searching documents beyond the.
Web Data Management Dr. Daniel Deutch. Web Data The web has revolutionized our world Data is everywhere Constitutes a great potential But also a lot of.
Dynamic Hypermedia Generations through a Mediator using CRM and Web Service Jen-Shin Hong National ChiNan University,Taiwan
Ontology-Based Information Extraction: Current Approaches.
-1- Philipp Heim, Thomas Ertl, Jürgen Ziegler Facet Graphs: Complex Semantic Querying Made Easy Philipp Heim 1, Thomas Ertl 1 and Jürgen Ziegler 2 1 Visualization.
Assigning Global Relevance Scores to DBpedia Facts Philipp Langer, Patrick Schulze, Stefan George, Tobias Metzke, Ziawasch Abedjan, Gjergji Kasneci DESWeb.
A Probabilistic Graphical Model for Joint Answer Ranking in Question Answering Jeongwoo Ko, Luo Si, Eric Nyberg (SIGIR ’ 07) Speaker: Cho, Chin Wei Advisor:
LOD for the Rest of Us Tim Finin, Anupam Joshi, Varish Mulwad and Lushan Han University of Maryland, Baltimore County 15 March 2012
Q2Semantic: A Lightweight Keyword Interface to Semantic Search Haofen Wang 1, Kang Zhang 1, Qiaoling Liu 1, Thanh Tran 2, and Yong Yu 1 1 Apex Lab, Shanghai.
Natural Language Questions for the Web of Data Mohamed Yahya 1, Klaus Berberich 1, Shady Elbassuoni 2 Maya Ramanath 3, Volker Tresp 4, Gerhard Weikum 1.
Oracle Database 11g Semantics Overview Xavier Lopez, Ph.D., Dir. Of Product Mgt., Spatial & Semantic Technologies Souripriya Das, Ph.D., Consultant Member.
GeoNames is … Gazetteer aggregator of open geo data I am... Marc Wick GeoNames.
Natural Language Questions for the Web of Data 1 Mohamed Yahya, Klaus Berberich, Gerhard Weikum Max Planck Institute for Informatics, Germany 2 Shady Elbassuoni.
Tool for Ontology Paraphrasing, Querying and Visualization on the Semantic Web Project By Senthil Kumar K III MCA (SS)‏
Introduction to the Semantic Web and Linked Data
Natural Language Questions for the Web of Data Mohamed Yahya, Klaus Berberich, Gerhard Weikum Max Planck Institute for Informatics, Germany Shady Elbassuoni.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
1 NAGA: Searching and Ranking Knowledge Gjergji Kasneci Joint work with: Fabian M. Suchanek, Georgiana Ifrim, Maya Ramanath, and Gerhard Weikum.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
Gaby Nativ, SDBI  Motivation  Other Ontologies  System overview  YAGO Dive IN  LEILA  NAGA  Conclusion.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
STEWARD: A Spatio-Textual Document Search Engine for HUDUSER.ORG Prof. Hanan Samet Department of Computer Science, University of Maryland, College Park,
26/02/ WSMO – UDDI Semantics Review Taxonomies and Value Sets Discussion Paper Max Voskob – February 2004 UDDI Spec TC V4 Requirements.
Semantic Graph Mining for Biomedical Network Analysis: A Case Study in Traditional Chinese Medicine Tong Yu HCLS
Mohammad Alqahtani, Dr. Eric Atwell
Einat Minkov University of Haifa, Israel CL course, U
Visual Information Retrieval
YAGO-QA Answering Questions by Structured Knowledge Queries
Harnessing the Semantic Web to Answer Scientific Questions:
Database and Information - Retrieval Methods for Knowledge Discovery
Semantic Database Builder
Associative Query Answering via Query Feature Similarity
Analyzing and Securing Social Networks
Semantic Network & Knowledge Graph
Property consolidation for entity browsing
DBpedia 2014 Liang Zheng 9.22.
International Marketing and Output Database Conference 2005
Introduction Dataset search
Presentation transcript:

YAGO-NAGA Project Presented By: Mohammad Dwaikat To: Dr. Yuliya Lierler CSCI 8986 – Fall 2012

Agenda What is YAGO-NAGA? Why YAGO-NAGA? How YAGO-NAGA Works? Demonstration YAGO-NAGA Sub-Projects

Agenda What is YAGO-NAGA? Why YAGO-NAGA? How YAGO-NAGA Works? Demonstration YAGO-NAGA Sub-Projects

What is YAGO-NAGA? Harvesting, Searching, and Ranking Knowledge from the Web. Building a conveniently searchable, large-scale, highly accurate knowledge base of common facts in a machine-processable representation. Harvested knowledge about millions of entities and facts about their relationships, from Wikipedia and WordNet with careful integration of these two sources.

What is YAGO-NAGA? Its vision is a confluence of Semantic Web (Ontologies), Social Web (Web 2.0), and Statistical Web (Information Extraction) assets towards a comprehensive repository of human knowledge.

YAGO Yet Another Great Ontology (YAGO) Knowledge base. It is a huge semantic knowledge base, derived from Wikipedia, WordNet, and GeoNames.GeoNames It knows almost 10 million entities (e.g. persons, organizations, cities), and 120 million facts about these entities. It has a manually confirmed accuracy of 95%. YAGO is an ontology that is anchored in time and space. – It attaches a temporal dimension and a spacial dimension to many of its facts and entities.

YAGO It contains all the entities and ontological facts extracted from Wikipedia (from ), with categories mapped to the WordNet class hierarchy. It also contains multi-lingual data from the Universal WordNet (UWN).

YAGO It contains all the entities and facts from GeoNames - (from a dump of August 2010). It also contains textual and structural data from Wikipedia. All links+anchor texts between the YAGO entities. All Wikipedia category names. The titles of references.

YAGO It is particularly suited for disambiguation purposes, as it contains a large number of names for entities. It also knows the gender of people. YAGO is the resulting knowledge base, the facts are represented as RDF triples (Resource Description Framework). Methods and prototype systems have been developed for querying, ranking, and exploring knowledge.

NAGA Not Another Google Answer (NAGA) is a new semantic search engine which provides ranked answers to queries based on statistical models. It can operate on knowledge bases that are organized as graphs with labeled nodes and edges, so called relationship graphs. As of now, NAGA uses a projection of YAGO as its knowledge base. The underlying query language supports keyword search for the casual user as well as graph-based queries with regular expressions for the expert user.

Agenda What is YAGO-NAGA? Why YAGO-NAGA? How YAGO-NAGA Works? Demonstration YAGO-NAGA Sub-Projects

Consider These Questions Which German Nobel laureate survived both world wars and outlived all four of his children? – The answer is Max Planck. Which politicians are also accomplished scientists? – The German chancellor Angela Merkel and Benjamin Franklin. How are Max Planck, Angela Merkel, Jim Gray, and the Dalai Lama related? – All four have doctoral degrees from German universities.

Why YAGO-NAGA? Three major research: – Semantic-Web-style knowledge repositories. Such as SUMO, OpenCyc, and WordNet. – Large-scale information extraction. – Social tagging and Web 2.0 communities that constitute the social Web. Wikipedia is another example of the Social Web paradigm. The challenge is how to extract the important facts from the Web and organize them into an explicit knowledge base that captures entities and semantic relationships among them.

Agenda What is YAGO-NAGA? Why YAGO-NAGA? How YAGO-NAGA Works? Demonstration YAGO-NAGA Sub-Projects

How YAGO-NAGA Works? YAGO adopts concepts from the standardized SPARQL Protocol and RDF Query Language for RDF data but extends them through more expressive pattern matching and ranking. The prototype system that implements these features is NAGA.

Query for the YAGO Knowledge Base

A big US city with two airports, one named after a World War II hero, and one named after a World War II battle field?

Select Distinct ?c Where { ?c type City. ?c locatedIn USA. ?a1 type Airport. ?a2 type Airport. ?a1 locatedIn ?c. ?a2 locatedIn ?c. ?a1 namedAfter ?p. ?p type WarHero. ?a2 namedAfter ?b. ?b type BattleField. } Structured Knowledge Queries

Growing the Knowledge Base Word Net Wikipedia + YAGO Core Extractors YAGO Core Checker YAGO Core YAGO Gatherer YAGO Gatherer Hypotheses YAGO Gatherer YAGO Scrutinizer YAGO Web sources G r o w i n g knows  all entities focus on facts 19/38

Information Extraction from Wikipedia

Combine knowledge from WordNet & Wikipedia. Additional Gazetteers (geonames.org). YAGO Knowledge Base

Searching & Ranking RDF Graphs in NAGA Queries with regular expressions: Discovery queries: Connectedness queries: Ling$xscientist type hasFirstName | hasLastName $yZhejiang locatedIn * worksFor Beng Chin Ooi (coAuthor | advisor) * Kiel$xscientist type bornIn Ranking based on confidence, compactness and relevance $x Nobel prize hasWon $a diedOn $y hasSon $b diedOn > Thomas MannGoethe * German novelist type

Agenda What is YAGO-NAGA? Why YAGO-NAGA? How YAGO-NAGA Works? Demonstration YAGO-NAGA Sub-Projects

YAGO Server: UI & API

YAGO-UI – Interactive online demo – RDF with time, space & provenance annotations – SPARQL + keywords YAGO-API Two basic WebServices: – processQuery (String query) – getYagoEntitiesByNames (String[] names)

Browse through the YAGO knowledge base. – sb.mpg.de/webyagospotlx/Browser sb.mpg.de/webyagospotlx/Browser Ask queries on YAGO using SPOTLX patterns. View the results on a map and timeline. – sb.mpg.de/webyagospotlx/WebInterface sb.mpg.de/webyagospotlx/WebInterface YAGO

Agenda What is YAGO-NAGA? Why YAGO-NAGA? How YAGO-NAGA Works? Demonstration YAGO-NAGA Sub-Projects

More than 13 sub-projects of YAGO-NAGA. AIDA: is a method, implemented in an online tool, for disambiguating mentions of named entities that occur in natural-language text or Web tables. –

Names, Surface Patterns & Paraphrases Which chemist was born in London? (I) Named entity disambiguation – chemist  wordnet_chemist, wordnet_pharmacist – born  Bertran_de_Born, Born_Identity_(Movie), Born_(Album) – London  London_UK, London_Arkansas, Antonio_London (II) Mapping surface patterns onto semantic relations – was_born_in  bornIn(, ) – was_born_in  bornOn(, ) (III) Paraphrases of questions [was] born in -born NNVBDVBNINNNP/LOC  bornIn(, )

References YAGO-NAGA Project: – YAGO: – NAGA: – naga/naga/demo.html naga/naga/demo.html