Aiming at prize for brilliant idea the world is not ready for.

Slides:



Advertisements
Similar presentations
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Advertisements

Understanding Tables on the Web Jingjing Wang. Problem to Solve A wealth of information in the World Wide Web Not easy to access or process by machine.
Arnd Christian König Venkatesh Ganti Rares Vernica Microsoft Research Entity Categorization Over Large Document Collections.
YAGO: A Large Ontology from Wikipedia and WordNet Fabian M. Suchanek, Gjergji Kasneci, Gerhard Weikum Max-Planck-Institute for Computer Science, Saarbruecken,
Gerhard Weikum Max Planck Institute for Informatics & Saarland University Semantic Search: from Names and Phrases to.
YAGO-NAGA Project Presented By: Mohammad Dwaikat To: Dr. Yuliya Lierler CSCI 8986 – Fall 2012.
joint work with Shady Elbassuoni, Georgiana Ifrim, Gjergji Kasneci,
“ Leveraging SharePoint 2010 Search Technologies ” With: Ivan Neganov.
Ontologies and the Semantic Web by Ian Horrocks presented by Thomas Packer 1.
Database and Information- Retrieval Methods for Knowledge Discovery Database and Information- Retrieval Methods for Knowledge Discovery Gerhard Weikum,
Open Information Extraction From The Web Rani Qumsiyeh.
Presented by Zeehasham Rasheed
Security in Databases. 2 Outline review of databases reliability & integrity protection of sensitive data protection against inference multi-level security.
Which Nobel laureate survived both world wars and all his four children? Tandem What‘s this? Question AnsweringPhoto AnnotationTimeline Analysis.
Enhance legal retrieval applications with an automatically induced knowledge base Ka Kan Lo.
Semantic Web outlook and trends May The Past 24 Odd Years 1984 Lenat’s Cyc vision 1989 TBL’s Web vision 1991 DARPA Knowledge Sharing Effort 1996.
SEEKING STATEMENT-SUPPORTING TOP-K WITNESSES Date: 2012/03/12 Source: Steffen Metzger (CIKM’11) Speaker: Er-gang Liu Advisor: Dr. Jia-ling Koh 1.
Mining the Semantic Web: Requirements for Machine Learning Fabio Ciravegna, Sam Chapman Presented by Steve Hookway 10/20/05.
Entity Recognition via Querying DBpedia ElShaimaa Ali.
DBrev: Dreaming of a Database Revolution Gjergji Kasneci, Jurgen Van Gael, Thore Graepel Microsoft Research Cambridge, UK.
Real-time population of Knowledge Bases: Opportunities and Challenges Ndapa Nakashole Gerhard Weikum AKBC Workshop at NAACL 2012.
An RDF and XML Database John Snelson, Lead Engineer 23 rd October 2013.
Ontology-Based Information Extraction: Current Approaches.
Optimizing Complex Extraction Programs over Evolving Text Data Fei Chen 1, Byron Gao 2, AnHai Doan 1, Jun Yang 3, Raghu Ramakrishnan 4 1 University of.
Information Extraction Lecture 8 – Ontological and Open IE CIS, LMU München Winter Semester Dr. Alexander Fraser.
Fabian M. SuchanekLEILA - Learning to Extract Information by Linguistic Analysis 1 LEILA – Learning to Extract Information by Linguistic Analysis presented.
Oracle Database 11g Semantics Overview Xavier Lopez, Ph.D., Dir. Of Product Mgt., Spatial & Semantic Technologies Souripriya Das, Ph.D., Consultant Member.
Object Oriented Multi-Database Systems An Overview of Chapters 4 and 5.
You sexy beast. Ok, inappropriate. How about: Web of links to Web of Meaning Hello Semantic Web!
McGraw-Hill/Irwin ©2009 The McGraw-Hill Companies, All Rights Reserved CHAPTER 6 DATABASES AND DATA WAREHOUSES CHAPTER 6 DATABASES AND DATA WAREHOUSES.
Summary Knowledge Bases from Web are Real, Big & Useful: Entities, Classes & Relations Key Asset for Intelligent Applications: Semantic Search, Question.
1 NAGA: Searching and Ranking Knowledge Gjergji Kasneci Joint work with: Fabian M. Suchanek, Georgiana Ifrim, Maya Ramanath, and Gerhard Weikum.
Tutorial: Knowledge Bases for Web Content Analytics
Gaby Nativ, SDBI  Motivation  Other Ontologies  System overview  YAGO Dive IN  LEILA  NAGA  Conclusion.
Content Analytics – Uncovering Critical Insight YellowBrix 3/2/20161.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
A Review of Relational Machine Learning for Knowledge Graphs CVML Reading Group Xiao Lin.
Fundamental of Database Systems
Automatically Labeled Data Generation for Large Scale Event Extraction
An Introduction to Markov Logic Networks in Knowledge Bases
Data Mining, Data Science, Big Data
Automatic Video Shot Detection from MPEG Bit Stream
Blaz Novak, Mitja Trampus, Blaz Fortuna, Marko Grobelnik JSI
The reasons for marital breakdowns
David Shepherd, Zachary P. Fry, Emily Hill, Lori Pollock, and K
Database and Information - Retrieval Methods for Knowledge Discovery
DSS: Decision Support Systems and AI: Artificial Intelligence
Reading Report: Open QA Systems
Information Extraction from Wikipedia: Moving Down the Long Tail
Kbv Research | +1 (646) | Global Text Analytics Market Knowledge Based Value (KBV) Research Full report -
Web IR: Recent Trends; Future of Web Search
Lieneke Hoeksma Eurostat Dissemination Working Group, 1 October 2015
Author: dr. Martin Rusnák
SmaRT Visualization of Legal Rules for Compliance
Is BPM the killer app for agents?
Huayi Zhagn and Haiyan Liang
Extracting Semantic Concept Relations
Text Categorization Document classification categorizes documents into one or more classes which is useful in Information Retrieval (IR). IR is the task.
Introduction Task: extracting relational facts from text
TRANSPORTATION SYSTEM DESIGN CHALLENGE
DBpedia 2014 Liang Zheng 9.22.
Introduction to Information Retrieval
Pei Lee, ICDE 2014, Chicago, IL, USA
MIS2502: Data Analytics Things You Can Do With Data & Information Architecture Zhe (Joe) Deng
ProBase: common Sense Concept KB and Short Text Understanding
Hierarchical, Perceptron-like Learning for OBIE
The Matrix Individually:
Rachit Saluja 03/20/2019 Relation Extraction with Matrix Factorization and Universal Schemas Sebastian Riedel, Limin Yao, Andrew.
CS 239 – Big Data Systems Fall 2018
KnowItAll and TextRunner
Presentation transcript:

aiming at prize for brilliant idea the world is not ready for

Sex http://www.ofai.at/rascalli/

Drugs http://www-tsujii.is.s.u-tokyo.ac.jp/medie/

What? Who? Why? Information Extraction (Text2Record) & Fact Gathering by Rules, Patterns, Learning, Linguistics, Statistics Cimple / DBlife (U Wisconsin) KnowItAll / TextRunner (U Washington) Avatar (IBM) YAGO-NAGA (MPI) Google, Microsoft, Yahoo, ... ... Most new knowledge first produced in NL text (news, blogs, scientific/business articles, dossiers, wikipedia) Lift text-borne content into value-added „semantic DB“ (entities & relations, RDF)

Sex: Scale Matters Can we correctly extract all worthy facts at the scale and rate by which news, blogs, gossips are produced ? Benchmark Challenge: for all people in Wikipedia (100,000‘s) gather all spouses, incl. divorced & widowed, and corresponding time periods! >95% accuracy, >95% coverage, in one night redundancy of sources helps, stresses scalability even more consistency constraints are potentially helpful: FD‘s: {husband, time}  wife, {wife, time}  husband inclusion dependencies: marriedPerson  adultPerson age/time/gender restrictions: birthdate +  < marriage < divorce

Databases (Know-How) scalability and system-oriented approach coping with trade-offs: precision vs. recall, speed vs. quality, freshness vs. accuracy quality boosting and efficiency gains by consistency constraints capturing & tracking confidence and lineage showcase for prob DB (more tractable than general case) killer app for RDF graph data

Conclusion Information extraction (text2data) is a major research avenue. It‘s sexy, and should get you hooked! DB-system know-how is vital!