HANDLING UNCERTAINTY IN INFORMATION EXTRACTION Maurice van Keulen and Mena Badieh Habib URSW 23 Oct 2011.

Slides:



Advertisements
Similar presentations
Slide 1 of 18 Uncertainty Representation and Reasoning with MEBN/PR-OWL Kathryn Blackmond Laskey Paulo C. G. da Costa The Volgenau School of Information.
Advertisements

Google News Personalization: Scalable Online Collaborative Filtering
Bayesian Network and Influence Diagram A Guide to Construction And Analysis.
Representing and Querying Correlated Tuples in Probabilistic Databases
5/17/20151 Probabilistic Reasoning CIS 479/579 Bruce R. Maxim UM-Dearborn.
 Andisheh Keikha Ryerson University Ebrahim Bagheri Ryerson University May 7 th
Causal and Bayesian Network (Chapter 2) Book: Bayesian Networks and Decision Graphs Author: Finn V. Jensen, Thomas D. Nielsen CSE 655 Probabilistic Reasoning.
A Generic Framework for Handling Uncertain Data with Local Correlations Xiang Lian and Lei Chen Department of Computer Science and Engineering The Hong.
CPSC 322, Lecture 26Slide 1 Reasoning Under Uncertainty: Belief Networks Computer Science cpsc322, Lecture 27 (Textbook Chpt 6.3) March, 16, 2009.
Erasmus University Rotterdam Frederik HogenboomEconometric Institute School of Economics Flavius Frasincar.
Date:2011/06/08 吳昕澧 BOA: The Bayesian Optimization Algorithm.
Self-Tuning and Self-Configuring Systems Zachary G. Ives University of Pennsylvania CIS 650 – Database & Information Systems March 16, 2005.
Bayesian Networks. Motivation The conditional independence assumption made by naïve Bayes classifiers may seem to rigid, especially for classification.
Bayesian networks Chapter 14 Section 1 – 2.
Bayesian Belief Networks
1 Learning Entity Specific Models Stefan Niculescu Carnegie Mellon University November, 2003.
Shared Ontology for Knowledge Management Atanas Kiryakov, Borislav Popov, Ilian Kitchukov, and Krasimir Angelov Meher Shaikh.
1 Bayesian Reasoning Chapter 13 CMSC 471 Adapted from slides by Tim Finin and Marie desJardins.
Representing Uncertainty CSE 473. © Daniel S. Weld 2 Many Techniques Developed Fuzzy Logic Certainty Factors Non-monotonic logic Probability Only one.
Pattern Classification, Chapter 1 1 Basic Probability.
Methodology Conceptual Database Design
CIS 410/510 Probabilistic Methods for Artificial Intelligence Instructor: Daniel Lowd.
Bayesian Reasoning. Tax Data – Naive Bayes Classify: (_, No, Married, 95K, ?)
Cramer-Shoup is Plaintext Aware in the Standard Model Alexander W. Dent Information Security Group Royal Holloway, University of London.
Efficient Query Evaluation over Temporally Correlated Probabilistic Streams Bhargav Kanagal, Amol Deshpande ΗΥ-562 Advanced Topics on Databases Αλέκα Σεληνιωτάκη.
1 Institute of Engineering Mechanics Leopold-Franzens University Innsbruck, Austria, EU H.J. Pradlwarter and G.I. Schuëller Confidence.
Database Support for Semantic Web Masoud Taghinezhad Omran Sharif University of Technology Computer Engineering Department Fall.
Ranking Queries on Uncertain Data: A Probabilistic Threshold Approach Wenjie Zhang, Xuemin Lin The University of New South Wales & NICTA Ming Hua,
Value of information Marko Tainio Decision analysis and Risk Management course in Kuopio
Database Design Part of the design process is deciding how data will be stored in the system –Conventional files (sequential, indexed,..) –Databases (database.
Bayesian networks. Motivation We saw that the full joint probability can be used to answer any question about the domain, but can become intractable as.
An Introduction to Artificial Intelligence Chapter 13 & : Uncertainty & Bayesian Networks Ramin Halavati
CS774. Markov Random Field : Theory and Application Lecture 19 Kyomin Jung KAIST Nov
Computing & Information Sciences Kansas State University Wednesday, 22 Oct 2008CIS 530 / 730: Artificial Intelligence Lecture 22 of 42 Wednesday, 22 October.
LANGUAGE MODELS FOR RELEVANCE FEEDBACK Lee Won Hee.
Characterizing the Uncertainty of Web Data: Models and Experiences Lorenzo Blanco, Valter Crescenzi, Paolo Merialdo, Paolo Papotti Università degli Studi.
Facilitating Document Annotation using Content and Querying Value.
A Scalable Machine Learning Approach for Semi-Structured Named Entity Recognition Utku Irmak(Yahoo! Labs) Reiner Kraft(Yahoo! Inc.) WWW 2010(Information.
Data Grid Research Group Dept. of Computer Science and Engineering The Ohio State University Columbus, Ohio 43210, USA David Chiu & Gagan Agrawal Enabling.
Uncertainty Management in Rule-based Expert Systems
Uncertainty. Assumptions Inherent in Deductive Logic-based Systems All the assertions we wish to make and use are universally true. Observations of the.
Majid Sazvar Knowledge Engineering Research Group Ferdowsi University of Mashhad Semantic Web Reasoning.
Uncertainty Chapter 13. Outline Uncertainty Probability Syntax and Semantics Inference Independence and Bayes' Rule.
1 Information Retrieval LECTURE 1 : Introduction.
LO: I will evaluate Hume’s argument against Miracles. Hmk – Prepare presentations for Tuesday’s lesson.
DeepDive Model Dongfang Xu Ph.D student, School of Information, University of Arizona Dec 13, 2015.
Query Suggestions in the Absence of Query Logs Sumit Bhatia, Debapriyo Majumdar,Prasenjit Mitra SIGIR’11, July 24–28, 2011, Beijing, China.
Textbook Basics of an Expert System: – “Expert systems: Design and Development,” by: John Durkin, 1994, Chapters 1-4. Uncertainty (Probability, Certainty.
1 Scalable Probabilistic Databases with Factor Graphs and MCMC Michael Wick, Andrew McCallum, and Gerome Miklau VLDB 2010.
Semantic Data Extraction for B2B Integration Syntactic-to-Semantic Middleware Bruno Silva 1, Jorge Cardoso 2 1 2
A Unified Approach to Ranking in Probabilistic Databases Jian Li, Barna Saha, Amol Deshpande University of Maryland, College Park, USA VLDB
CSE 473 Uncertainty. © UW CSE AI Faculty 2 Many Techniques Developed Fuzzy Logic Certainty Factors Non-monotonic logic Probability Only one has stood.
Data Grid Research Group Dept. of Computer Science and Engineering The Ohio State University Columbus, Ohio 43210, USA David Chiu and Gagan Agrawal Enabling.
Facilitating Document Annotation Using Content and Querying Value.
Chapter 12. Probability Reasoning Fall 2013 Comp3710 Artificial Intelligence Computing Science Thompson Rivers University.
Business Process Modeling What is a process model? – A formal way of representing how a business system operates. – Illustrates the activities that are.
REASONING UNDER UNCERTAINTY: CERTAINTY THEORY
Learning Bayesian Networks for Complex Relational Data
Business Process Modeling
Probabilistic Data Management
Probabilistic Data Management
Probabilistic Data Management
Lecture 16: Probabilistic Databases
Mutually exclusive nothing in common.
Batyr Charyyev.
Class #21 – Monday, November 10
Probabilistic Databases
28th September 2005 Dr Bogdan L. Vrusias
Probabilistic Reasoning
Presentation transcript:

HANDLING UNCERTAINTY IN INFORMATION EXTRACTION Maurice van Keulen and Mena Badieh Habib URSW 23 Oct 2011

Uncertainty Reasoning for the Semantic Web, Bonn, Germany 2 INFORMATION EXTRACTION Unstructured Text Web of Data Information extraction Inherently imperfect process Word “Paris” First name? City? … City => over 60 cities “Paris” Toponyms: 46% >2 refs Word “Paris” First name? City? … City => over 60 cities “Paris” Toponyms: 46% >2 refs source: GeoNames Goal: Technology to support the development of domain specific information extractors “We humans happily deal with doubt and misinterpretation every day; Why shouldn’t computers?”

 Annotations are uncertain Maintain alternatives + probabilities throughout process (incl. result)  Unconventional starting point Not “no annotations”, but “no knowledge, hence anything is possible”  Developer interactively defines information extractor until “good enough” Iterations: Add knowledge, apply to sample texts, evaluate result  Scalability for storage, querying, manipulation of annotations From my own field (databases): Probabilistic databases? 23 Oct 2011Uncertainty Reasoning for the Semantic Web, Bonn, Germany 3 SHERLOCK HOLMES-STYLE INFORMATION EXTRACTION “when you have eliminated the impossible, whatever remains, however improbable, must be the truth” Information extraction is about gathering enough evidence to decide upon a certain combination of annotations among many possible ones Evidence comes from ML + developer (generic) + end user (instances) Information extraction is about gathering enough evidence to decide upon a certain combination of annotations among many possible ones Evidence comes from ML + developer (generic) + end user (instances)

23 Oct 2011Uncertainty Reasoning for the Semantic Web, Bonn, Germany 4 SHERLOCK HOLMES-STYLE INFORMATION EXTRACTION EXAMPLE: NAMED ENTITY RECOGNITION (NER) “when you have eliminated the impossible, whatever remains, however improbable, must be the truth” Person City Toponym dnc isa inter-actively defined

 |A|=O(klt) linear?!? k: length of string l: maximum length phrases considered t: number of entity types  Here: 28 * 3 = 84 possible annotations  URSW call for papers about 1300 words say 20 types say max length 6 (I saw one with 5) = roughly 1300 * 20 * 6 = roughly 156,000 possible annotations 23 Oct 2011Uncertainty Reasoning for the Semantic Web, Bonn, Germany 5 SHERLOCK HOLMES-STYLE INFORMATION EXTRACTION EXAMPLE: NAMED ENTITY RECOGNITION (NER) Although conceptual/theoretical, it doesn’t seem to be a severe challenge for a probabilistic database The problem is not in the amount of alternative annotations!

23 Oct 2011Uncertainty Reasoning for the Semantic Web, Bonn, Germany 6 ADDING KNOWLEDGE = CONDITIONING Paris Hilton stayed in the Paris Hilton Person --- dnc --- City x 1 2 (“Paris” is a City)[a] x 8 1 (“Paris Hilton” is a Person)[b] become mutually exclusive a and b independent P(a)=0.6 P(b)=0.8 a and b mutually exclusive (a ∧ b is not possible)

23 Oct 2011Uncertainty Reasoning for the Semantic Web, Bonn, Germany 7 ADDING KNOWLEDGE CREATES DEPENDENCIES NUMBER OF DEPS MAGNITUDES IN SIZE SMALLER THAN POSSIBLE COMBINATIONS Paris Hilton stayed Person City Person City dnc neq

I’m looking for a scalable approach to reason and redistribute probability mass considering all these dependencies to find the remaining possible interpretations and their probabilities  Feasibility approach hinges on efficient representation and conditioning of probabilistic dependencies  Solution directions (in my own field):  Koch etal VLDB 2008 (Conditioning in MayBMS)  Getoor etal VLDB 2008 (Shared correlations)  This is not about only learning a joint probability distribution. Here I’d like to estimate a joint probability distribution based on initial independent observations and then batch-by-batch add constraints/dependencies and recalculate  Techniques out there that fit this problem? 23 Oct 2011Uncertainty Reasoning for the Semantic Web, Bonn, Germany 8 PROBLEM AND SOLUTION DIRECTIONS Questions / Suggestions?