Oregon State University – CS539 PRMs Learning Probabilistic Models of Link Structure Getoor, Friedman, Koller, Taskar.

Slides:



Advertisements
Similar presentations
Day Topic: Lab Write Up Objective: I will graph data from table and compare XT graphs of uniform and accelerated motion. Starter: Acceleration.
Advertisements

Learning Relational Probability Trees Jennifer Neville David Jensen Lisa Friedland Michael Hay Presented by Andrew Tjang.
Learning Probabilistic Relational Models Daphne Koller Stanford University Nir Friedman Hebrew University Lise.
Identifying Conditional Independencies in Bayes Nets Lecture 4.
For Monday Read chapter 18, sections 1-2 Homework: –Chapter 14, exercise 8 a-d.
An introduction to Bayesian networks Stochastic Processes Course Hossein Amirkhani Spring 2011.
CPSC 422, Lecture 33Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 33 Apr, 8, 2015 Slide source: from David Page (MIT) (which were.
Modelling Relational Statistics With Bayes Nets School of Computing Science Simon Fraser University Vancouver, Canada Tianxiang Gao Yuke Zhu.
DATA MINING LECTURE 12 Link Analysis Ranking Random walks.
Relational Learning with Gaussian Processes By Wei Chu, Vikas Sindhwani, Zoubin Ghahramani, S.Sathiya Keerthi (Columbia, Chicago, Cambridge, Yahoo!) Presented.
Hierarchical Probabilistic Relational Models for Collaborative Filtering Jack Newton and Russ Greiner
1 Learning Entity Specific Models Stefan Niculescu Carnegie Mellon University November, 2003.
I.1 ii.2 iii.3 iv.4 1+1=. i.1 ii.2 iii.3 iv.4 1+1=
Testing an individual module
I.1 ii.2 iii.3 iv.4 1+1=. i.1 ii.2 iii.3 iv.4 1+1=
Lecture 10 Comparison and Evaluation of Alternative System Designs.
Journal Status* Using the PageRank Algorithm to Rank Journals * J. Bollen, M. Rodriguez, H. Van de Sompel Scientometrics, Volume 69, n3, pp , 2006.
Online Stacked Graphical Learning Zhenzhen Kou +, Vitor R. Carvalho *, and William W. Cohen + Machine Learning Department + / Language Technologies Institute.
Software Testing and QA Theory and Practice (Chapter 4: Control Flow Testing) © Naik & Tripathy 1 Software Testing and Quality Assurance Theory and Practice.
Chapter 2 Modeling and Finding Abnormal Nodes. How to define abnormal nodes ? One plausible answer is : –A node is abnormal if there are no or very few.
A Logical Formulation of PRMs. Example Institute(InstId,Type) Researcher(RID,Area,Salary,InstID) Paper(PaperId,Topic) Author(RId,PaperID) Cites(PaperId1,PaperId2)
4.4 Equations as Relations
Undirected Models: Markov Networks David Page, Fall 2009 CS 731: Advanced Methods in Artificial Intelligence, with Biomedical Applications.
No. 1 Classification and clustering methods by probabilistic latent semantic indexing model A Short Course at Tamkang University Taipei, Taiwan, R.O.C.,
Collective Classification A brief overview and possible connections to -acts classification Vitor R. Carvalho Text Learning Group Meetings, Carnegie.
CS315-Web Search & Data Mining. A Semester in 50 minutes or less The Web History Key technologies and developments Its future Information Retrieval (IR)
Max-Margin Markov Networks by Ben Taskar, Carlos Guestrin, and Daphne Koller Presented by Michael Cafarella CSE574 May 25, 2005.
Graph-based Text Classification: Learn from Your Neighbors Ralitsa Angelova , Gerhard Weikum : Max Planck Institute for Informatics Stuhlsatzenhausweg.
UIUC CS 497: Section EA Lecture #8 Reasoning in Artificial Intelligence Professor: Eyal Amir Spring Semester 2004 (Based on slides by Lise Getoor (UMD))
Probabilistic Models of Object-Relational Domains
No. 1 Knowledge Acquisition from Documents with both Fixed and Free Formats* Shigeich Hirasawa Department of Industrial and Management Systems Engineering.
Algorithmic Detection of Semantic Similarity WWW 2005.
Daphne Koller Message Passing Belief Propagation Algorithm Probabilistic Graphical Models Inference.
Probabilistic Models for Relational Data Seminar Data Mining (SS 2005) Prof. Dr. Thomas Hofmann Dipl. Inform. Steffen Hartmann Xin Dong 05,07,2005.
Advisor : Prof. Sing Ling Lee Student : Chao Chih Wang Date :
Advisor : Prof. Sing Ling Lee Student : Chao Chih Wang Date :
CPSC 322, Lecture 33Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 33 Nov, 30, 2015 Slide source: from David Page (MIT) (which were.
CPSC 422, Lecture 33Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 34 Dec, 2, 2015 Slide source: from David Page (MIT) (which were.
DeepDive Model Dongfang Xu Ph.D student, School of Information, University of Arizona Dec 13, 2015.
Bibliography Scientific Paper ? ? ? Document Collection.
Copyright © 2013 Pearson Education, Inc. Section 3.2 Linear Equations in Two Variables.
A Bayesian Perspective to Semantic Web – Uncertainty modeling in OWL Jyotishman Pathak 04/28/2005.
Learning Statistical Models From Relational Data Lise Getoor University of Maryland, College Park Includes work done by: Nir Friedman, Hebrew U. Daphne.
3.1 Solving Systems Using Tables and Graphs When you have two or more related unknowns, you may be able to represent their relationship with a system of.
Probabilistic Equational Reasoning Arthur Kantor
1 Scalable Probabilistic Databases with Factor Graphs and MCMC Michael Wick, Andrew McCallum, and Gerome Miklau VLDB 2010.
04/21/2005 CS673 1 Being Bayesian About Network Structure A Bayesian Approach to Structure Discovery in Bayesian Networks Nir Friedman and Daphne Koller.
“The Monkey’s Paw” By W. W. Jacobs. Partner Extension Activity 1. With your partner, choose one extension activity to complete. 2. Look at your Mood and.
Introduction on Graphic Models
Refined Online Citation Matching and Adaptive Canonical Metadata Construction CSE 598B Course Project Report Huajing Li.
Enhanced hypertext categorization using hyperlinks Soumen Chakrabarti (IBM Almaden) Byron Dom (IBM Almaden) Piotr Indyk (Stanford)
CIS750 – Seminar in Advanced Topics in Computer Science Advanced topics in databases – Multimedia Databases V. Megalooikonomou Link mining ( based on slides.
Probabilistic Reasoning Inference and Relational Bayesian Networks.
Ontology Engineering and Feature Construction for Predicting Friendship Links in the Live Journal Social Network Author:Vikas Bahirwani 、 Doina Caragea.
Algebra 1 Section 4.2 Graph linear equation using tables The solution to an equation in two variables is a set of ordered pairs that makes it true. Is.
No. 1 Classification Methods for Documents with both Fixed and Free Formats by PLSI Model* 2004International Conference in Management Sciences and Decision.
Control Flow Testing Handouts
Handouts Software Testing and Quality Assurance Theory and Practice Chapter 4 Control Flow Testing
Sofus A. Macskassy Fetch Technologies
Outline of the Chapter Basic Idea Outline of Control Flow Testing
Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 18
CAP 5636 – Advanced Artificial Intelligence
RESPONSE.
Understanding the Features of a Web Site
CS 188: Artificial Intelligence
Hierarchical Relational Models for Document Networks
Lesson 1.4 Page 25 Ordered Pairs and Relations.
Discriminative Probabilistic Models for Relational Data
Label and Link Prediction in Relational Data
Probabilistic Databases with MarkoViews
Presentation transcript:

Oregon State University – CS539 PRMs Learning Probabilistic Models of Link Structure Getoor, Friedman, Koller, Taskar

Oregon State University – CS539 PRMs Example Application: WebKB  Classify web page as course, student, professor, project, none using… Words on the web page Links from other web pages (and the class of those pages, recursively) Words in the “anchor text” from the other page anchor text.  Web pages obtained from Cornell, Texas, Washington, and Wisconsin

Oregon State University – CS539 PRMs Example Application: CORA  Classify documents according to topic (7 levels) using… words in the document papers cited by the document papers citing the document

Oregon State University – CS539 PRMs Standard PRM  parents(Doc.class) = {MODE(Doc.citers.class),MODE(Doc.cited.clas s)} Document class words Document class words Document class words Document class words Document class words Document class words Document class words Document class words citers cited MODE

Oregon State University – CS539 PRMs Problem: The Citation Structure is Fixed  The existence (or non-existence) of a link cannot serve as evidence  Individually-linked papers only influence the class through the MODE.

Oregon State University – CS539 PRMs Possible Solution: Link Uncertainty  Model the existence of links as random variables  Create a Link instance for each pair of possibly-linked objects

Oregon State University – CS539 PRMs Unrolled Network Document class words Document class words Document class words Cites Exists Cites Exists Cites Exists

Oregon State University – CS539 PRMs Getoor’s Diagram  Entity classes (Paper)  Relation classes (Cites)  Technically, every instance has an Exists variable which is true for all Entity instances.

Oregon State University – CS539 PRMs Semantics  P is the basic CPT  P* will be the equivalent unrolled CPT  Require that an object does not exist if any of the objects it points to do not exist

Oregon State University – CS539 PRMs WebKB Network

Oregon State University – CS539 PRMs Experimental Results  Cora and WebKB

Oregon State University – CS539 PRMs WebKB with various features

Oregon State University – CS539 PRMs A Second Approach: Reference Uncertainty  Treat reference attributes as random variables Each reference attribute takes as value an object of the indicated class  Citation Citing: reference attribute, value is a Paper Cited: reference attribute, value is a Paper

Oregon State University – CS539 PRMs Problems  How many citation objects exist? Consequently, how many reference random variables exist?  How do we represent P(Citation.cites | …)? Citation.cites could take on thousands of possible values. Huge conditional probability table Costly inference at run time

Oregon State University – CS539 PRMs Solutions Problem 1: How many citations?  Fix the number of Citation objects  This gives the “object skeleton”

Oregon State University – CS539 PRMs Problem 2: Too many potential values for a reference attribute  Attach to each reference attribute a set of partition attributes The reference attribute chooses a partition A Paper is then chosen uniformly at random from the partition Citation Citing Cited Paper Theory Paper Graphics Paper Learning

Oregon State University – CS539 PRMs Representing Constraints Between Citing and Cited Papers Parents(Cites.Cited) = {Cites.Citing.Topic}

Oregon State University – CS539 PRMs Details  Each reference attribute  has a selector attribute S  that chooses the partition. Citation Paper Learning Paper Theory Paper Graphics Paper S citing Citing S cited Cited

Oregon State University – CS539 PRMs Class-level Dependency Graph  Five types of edges Type I: edges within a single object Type II: edges between objects Type III: edges from every reference attribute along any reference paths Type IV: edges from every partition attribute to the selector attributes that use those partition attributes to choose a partition Type V: edge from selector attributes to their corresponding reference attributes

Oregon State University – CS539 PRMs Movie Theater Example  Type I: Genre  Popularity  Type II: Shows.Movie.Genre  Shows.Profit Shows.Theater.Type  S Movie  Type III: Move  Profit; Theater  S movie  Type IV: Genre  S Movie  Type V: S Theater  Theater; S Movie  Movie

Oregon State University – CS539 PRMs Unrolled Graph?  The Unrolled Graph can have a huge number of edges  Is learning and inference really feasible?

Oregon State University – CS539 PRMs Homework Exercise  Construct the dependency graph for the citation example  Construct an unrolled network for a reference uncertainty example