PLUIE: Probability and Logic Unified for Information Extraction Stuart Russell Patrick Gallinari, Patrice Perny.

Slides:



Advertisements
Similar presentations
Slide 1 of 18 Uncertainty Representation and Reasoning with MEBN/PR-OWL Kathryn Blackmond Laskey Paulo C. G. da Costa The Volgenau School of Information.
Advertisements

Dimensional Modeling.
SEARCHING QUESTION AND ANSWER ARCHIVES Dr. Jiwoon Jeon Presented by CHARANYA VENKATESH KUMAR.
PROBABILITY. Uncertainty  Let action A t = leave for airport t minutes before flight from Logan Airport  Will A t get me there on time ? Problems :
Flipping A Biased Coin Suppose you have a coin with an unknown bias, θ ≡ P(head). You flip the coin multiple times and observe the outcome. From observations,
Plan Recognition with Multi- Entity Bayesian Networks Kathryn Blackmond Laskey Department of Systems Engineering and Operations Research George Mason University.
Cloud Computing Lecture #3 More MapReduce Jimmy Lin The iSchool University of Maryland Wednesday, September 10, 2008 This work is licensed under a Creative.
PR-OWL: A Framework for Probabilistic Ontologies by Paulo C. G. COSTA, Kathryn B. LASKEY George Mason University presented by Thomas Packer 1PR-OWL.
Introduction  Bayesian methods are becoming very important in the cognitive sciences  Bayesian statistics is a framework for doing inference, in a principled.
CSE 574 – Artificial Intelligence II Statistical Relational Learning Instructor: Pedro Domingos.
16722 Sensing and Sensors Mel Siegel )
KI2 - 2 Kunstmatige Intelligentie / RuG Probabilities Revisited AIMA, Chapter 13.
Semantic (Language) Models: Robustness, Structure & Beyond Thomas Hofmann Department of Computer Science Brown University Chief Scientist.
EG1204: Earth Systems: an introduction Meteorology and Climate Lecture 7 Climate: prediction & change.
Semantics For the Semantic Web: The Implicit, the Formal and The Powerful Amit Sheth, Cartic Ramakrishnan, Christopher Thomas CS751 Spring 2005 Presenter:
1 Department of Computer Science and Engineering, University of South Carolina Issues for Discussion and Work Jan 2007  Choose meeting time.
Ai in game programming it university of copenhagen Welcome to... the Crash Course Probability Theory Marco Loog.
1 Natural Language Processing for the Web Prof. Kathleen McKeown 722 CEPSR, Office Hours: Wed, 1-2; Tues 4-5 TA: Yves Petinot 719 CEPSR,
Ch. 3.1 – Measurements and Their Uncertainty
Bayesian Analysis for Extreme Events Pao-Shin Chu and Xin Zhao Department of Meteorology School of Ocean & Earth Science & Technology University of Hawaii-
Probabilistic Reasoning
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
An approach to Intelligent Information Fusion in Sensor Saturated Urban Environments Charalampos Doulaverakis Centre for Research and Technology Hellas.
Science & Technology Centers Program Center for Science of Information Bryn Mawr Howard MIT Princeton Purdue Stanford Texas A&M UC Berkeley UC San Diego.
Mean Field Inference in Dependency Networks: An Empirical Study Daniel Lowd and Arash Shamaei University of Oregon.
CMPS 3223 Theory of Computation Automata, Computability, & Complexity by Elaine Rich ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Slides provided.
Author: William Tunstall-Pedoe Presenter: Bahareh Sarrafzadeh CS 886 Spring 2015.
Today’s Topics Chapter 2 in One Slide Chapter 18: Machine Learning (ML) Creating an ML Dataset –“Fixed-length feature vectors” –Relational/graph-based.
Probability and Statistics Required!. 2 Review Outline  Connection to simulation.  Concepts to review.  Assess your understanding.  Addressing knowledge.
Understanding PML Paulo Pinheiro da Silva. PML PML is a provenance language (a language used to encode provenance knowledge) that has been proudly derived.
Dr. Matthew Iklé Department of Mathematics and Computer Science Adams State College Probabilistic Quantifier Logic for General Intelligence: An Indefinite.
Visibility Graph. Voronoi Diagram Control is easy: stay equidistant away from closest obstacles.
Flexible Text Mining using Interactive Information Extraction David Milward
Representational and inferential foundations for possible large-scale information extraction and question-answering from the web Stuart Russell Computer.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Grade 8 – Writing Standards Text Types and Purposes (1b) Write arguments to support claims with clear reasons and relevant evidence. Support claim(s) with.
1 Knowledge Representation CS 171/CS How to represent reality? Use an ontology (a formal representation of reality) General/abstract domain Specific.
Artificial Intelligence Chapter 18. Representing Commonsense Knowledge.
Intro to Scientific Research Methods in Geography Chapter 2: Fundamental Research Concepts.
Company LOGO Digital Infrastructure of RPI Personal Library Qi Pan Digital Infrastructure of RPI Personal Library Qi Pan.
Physical Science Methods and Math Describing Matter The Scientific Method Measurements and Calculations 1.
Majid Sazvar Knowledge Engineering Research Group Ferdowsi University of Mashhad Semantic Web Reasoning.
Computing & Information Sciences Kansas State University Monday, 25 Sep 2006CIS 490 / 730: Artificial Intelligence Lecture 14 of 42 Monday, 25 September.
Uncertainty Chapter 13. Outline Uncertainty Probability Syntax and Semantics Inference Independence and Bayes' Rule.
Mining the Biomedical Research Literature Ken Baclawski.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Inference: Probabilities and Distributions Feb , 2012.
Writing Informative Grades College and Career Readiness Standards for Writing Text Types and Purposes arguments 1.Write arguments to support a substantive.
DeepDive Model Dongfang Xu Ph.D student, School of Information, University of Arizona Dec 13, 2015.
The Unreasonable Effectiveness of Data
SATMathVideos.Net A set S consists of all multiples of 4. Which of the following sets are contained within set S? A) S2 only B) S4 only C) S2 and S4 D)
Representational and inferential foundations for possible large-scale information extraction and question-answering from the web Stuart Russell Computer.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Uncertainty Let action A t = leave for airport t minutes before flight Will A t get me there on time? Problems: 1.partial observability (road state, other.
1 Scalable Probabilistic Databases with Factor Graphs and MCMC Michael Wick, Andrew McCallum, and Gerome Miklau VLDB 2010.
Review of statistical modeling and probability theory Alan Moses ML4bio.
Graphing Notes Graphs – diagrams telling how two variables or factors are related Graphs – diagrams telling how two variables or factors are related Show.
Kansas State University Department of Computing and Information Sciences CIS 730: Introduction to Artificial Intelligence Lecture 24 of 41 Monday, 18 October.
N EVER -E NDING L ANGUAGE L EARNING (NELL) Jacqueline DeLorie.
Definition and Technologies Knowledge Representation.
Chapter 3 Language Acquisition: A Linguistic Treatment Jang, HaYoung Biointelligence Laborotary Seoul National University.
Measurement Systems for Sustainability Arrow’10 Inclusive wealth – one particular metric Parris & Kates Review 12 indicator initiatives  How do we choose.
Learning Bayesian Networks for Complex Relational Data
View Controllers In the Model-View-Controller (MVC) design pattern, a controller object provides the custom logic needed to bridge the application’s data.
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Social Knowledge Mining
Ontology Reuse In MBSE Henson Graves Abstract January 2011
cs540 - Fall 2016 (Shavlik©), Lecture 20, Week 11
Statistical Relational AI
Presentation transcript:

PLUIE: Probability and Logic Unified for Information Extraction Stuart Russell Patrick Gallinari, Patrice Perny

Project Goals “Open” information extraction – Construct knowledge bases from the web – Learn new classes, relations, linguistic patterns – Learn new predictive regularities – Integrate facts, entities across multiple documents – Support question answering Accuracy, consistency, integration, and utility; not scale for its own sake

Approach Probabilistic inference with the Web as evidence Generative models when available WorldWeb

Approach, contd. Open-universe probability models (e.g., BLOG) – First-order expressive power (objects, relations, functions, quantifiers, equality, etc.) – Allow for uncertainty about existence, identity of objects Generative model consists of – What might be true in the world – Who might choose to say what – How they might choose to say it

Approach contd. Rigorous ontological framework – Standard taxonomic hierarchy that supports distinctions needed for language E.g., mass nouns (water) vs count nouns (lake) – Proper treatment of events and time; avoid deficient “facts” such as Man Utd beat Chelsea; Chelsea beat Man Utd (PowerSet) Hank Paulson is the CEO of Goldman Sachs (NELL)

Open questions Efficient inference – What is extracted? Posterior over possible worlds? How to identify new categories and relations HCI: Presenting infinite heterogeneous posterior distributions: Who wrote what when when “who,” “what” and “when” vary across worlds? Making use of partially extracted or unextracted information – “data spaces” (Franklin, Halevy) Adversarial data: game-theoretic analysis?

Plan Reading group – Weekly meeting (day and time?) – Participants take turns presenting – Reading list at Formal project (ANR) runs 1/1/13 to 8/31/14 – Will continue indefinitely – Hiring two postdocs Possible collaborations – Tom Mitchell’s NELL project (CMU) – Andrew McCallum (UMass) – Kevin Murphy (Google’s Knowledge Graph project)