Bayesian Networks (Directed Acyclic Graphical Models)

Slides:

Advertisements

Similar presentations

Markov Networks Alan Ritter.

Advertisements

Probabilistic models Jouni Tuomisto THL. Outline Deterministic models with probabilistic parameters Hierarchical Bayesian models Bayesian belief nets.

Graphical Models BRML Chapter 4 1. the zoo of graphical models Markov networks Belief networks Chain graphs (Belief and Markov ) Factor graphs =>they.

BAYESIAN NETWORKS CHAPTER#4 Book: Modeling and Reasoning with Bayesian Networks Author : Adnan Darwiche Publisher: CambridgeUniversity Press 2009.

Identifying Conditional Independencies in Bayes Nets Lecture 4.

Bayesian Networks VISA Hyoungjune Yi. BN – Intro. Introduced by Pearl (1986 ) Resembles human reasoning Causal relationship Decision support system/ Expert.

EE462 MLCV Lecture Introduction of Graphical Models Markov Random Fields Segmentation Tae-Kyun Kim 1.

Chapter 8-3 Markov Random Fields 1. Topics 1. Introduction 1. Undirected Graphical Models 2. Terminology 2. Conditional Independence 3. Factorization.

From Variable Elimination to Junction Trees

Bayesian Networks A causal probabilistic network, or Bayesian network,

Bayesian Networks Chapter 2 (Duda et al.) – Section 2.11

PGM 2003/04 Tirgul 3-4 The Bayesian Network Representation.

Bayesian Belief Networks

Bayesian Network Representation Continued

. Bayesian Networks Lecture 9 Edited from Nir Friedman’s slides by Dan Geiger from Nir Friedman’s slides.

. Inference I Introduction, Hardness, and Variable Elimination Slides by Nir Friedman.

Bayesian Networks Alan Ritter.

PGM 2002/03 Tirgul5 Clique/Junction Tree Inference.

. DAGs, I-Maps, Factorization, d-Separation, Minimal I-Maps, Bayesian Networks Slides by Nir Friedman.

Made by: Maor Levy, Temple University  Probability expresses uncertainty.  Pervasive in all of Artificial Intelligence  Machine learning 

Undirected Models: Markov Networks David Page, Fall 2009 CS 731: Advanced Methods in Artificial Intelligence, with Biomedical Applications.

Introduction to Bayesian Networks

1 COROLLARY 4: D is an I-map of P iff each variable X is conditionally independent in P of all its non-descendants, given its parents. Proof  : Each variable.

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 8: GRAPHICAL MODELS.

Computing & Information Sciences Kansas State University Data Sciences Summer Institute Multimodal Information Access and Synthesis Learning and Reasoning.

Bayesian Networks Aldi Kraja Division of Statistical Genomics.

1 Bayesian Networks (Directed Acyclic Graphical Models) The situation of a bell that rings whenever the outcome of two coins are equal can not be well.

1 Use graphs and not pure logic Variables represented by nodes and dependencies by edges. Common in our language: “threads of thoughts”, “lines of reasoning”,

Christopher M. Bishop, Pattern Recognition and Machine Learning 1.

Reasoning Under Uncertainty: Independence and Inference CPSC 322 – Uncertainty 5 Textbook §6.3.1 (and for HMMs) March 25, 2011.

Introduction on Graphic Models

1 BN Semantics 2 – Representation Theorem The revenge of d-separation Graphical Models – Carlos Guestrin Carnegie Mellon University September 17.

1 BN Semantics 1 Graphical Models – Carlos Guestrin Carnegie Mellon University September 15 th, 2006 Readings: K&F: 3.1, 3.2, 3.3.

. Bayesian Networks Some slides have been edited from Nir Friedman’s lectures which is available at Changes made by Dan Geiger.

CS 2750: Machine Learning Bayesian Networks Prof. Adriana Kovashka University of Pittsburgh March 14, 2016.

Knowledge Representation & Reasoning Lecture #5 UIUC CS 498: Section EA Professor: Eyal Amir Fall Semester 2005 (Based on slides by Lise Getoor and Alvaro.

Bayesian Networks Chapter 2 (Duda et al.) – Section 2.11 CS479/679 Pattern Recognition Dr. George Bebis.

Instructor: Eyal Amir Grad TAs: Wen Pu, Yonatan Bisk

CS 2750: Machine Learning Directed Graphical Models

Bayesian networks Chapter 14 Section 1 – 2.

Presented By S.Yamuna AP/CSE

Exam Preparation Class

Exact Inference ..

Bayesian Networks Background Readings: An Introduction to Bayesian Networks, Finn Jensen, UCL Press, Some slides have been edited from Nir Friedman’s.

Bell & Coins Example Coin1 Bell Coin2

The set  of all independence statements defined by (3

Dependency Models – abstraction of Probability distributions

General Gibbs Distribution

Bayesian Networks Based on

Independence in Markov Networks

Markov Networks Independencies Representation Probabilistic Graphical

CAP 5636 – Advanced Artificial Intelligence

Bayesian Networks Independencies Representation Probabilistic

Markov Networks.

Exact Inference ..

Independence in Markov Networks

Markov Random Fields Presented by: Vladan Radosavljevic.

CS 188: Artificial Intelligence

Exact Inference Continued

CS 188: Artificial Intelligence Spring 2007

Factorization & Independence

Factorization & Independence

Markov Networks Independencies Representation Probabilistic Graphical

Bayesian networks Chapter 14 Section 1 – 2.

Independence in Markov Networks

Markov Networks Independencies Representation Probabilistic Graphical

CS 188: Artificial Intelligence Spring 2006

Bayesian networks (2) Lirong Xia. Bayesian networks (2) Lirong Xia.

CS 188: Artificial Intelligence Fall 2008

Bayesian networks (2) Lirong Xia.

Presentation transcript:

Bayesian Networks (Directed Acyclic Graphical Models) Coin1 Coin2 Bell The situation of a bell that rings whenever the outcome of two coins are equal can not be well represented by undirected graphical models. A clique will be formed because of induced dependency of the two coins given the bell.

Bayesian Networks (Directed Graphical Models)

Example I X1 X2 X3 X4 ID( X3 ; X1 | X2) ID( X4 ; {X1, X2}| X3)

Example II In the order V,S,T,L,B,A,X,D, we have: ID( S; V ) ID( T; S | V ) ID( l; {T, V} | S ) … ID( X; {V,S,T,L,B,D} | A) Does ID( {X, D} ; V | A ) also hold ? To answer this question one needs to analyze the types of paths that connect {X, D} and V.

Paths Intuition: dependency must “flow” along paths in the graph A path is a sequence of neighboring variables Examples: X  A  D  B A  L  S  B V S L T A B X D

Path blockage Every path is classified given the evidence: active -- creates a dependency between the end nodes blocked – does not create a dependency between the end nodes Evidence means the assignment of a value to a subset of nodes.

Path Blockage Three cases: Common cause Blocked Blocked Active S L B S

Path Blockage Three cases: Common cause Intermediate cause Blocked Active Blocked S A L

Path Blockage Three cases: Common cause Intermediate cause Common Effect Blocked Active Blocked T L X A T L X A

Definition of Path Blockage Definition: A path is active, given evidence Z, if Whenever we have the configuration then either A or one of its descendents is in Z No other nodes in the path are in Z. Definition: A path is blocked, given evidence Z, if it is not active. T L A Definition: X is d-separated from Y, given Z, if all paths from a node in X and a node in Y are blocked, given Z.

Example ID(T,S|) = yes V S L T A B X D

Example ID (T,S |) = yes ID(T,S|D) = no V S L T A B X D

Example ID (T,S |) = yes ID(T,S|D) = no ID(T,S|{D,L,B}) = yes V S L T

d-separation The definition of ID (X; Y | Z) is such that: Soundness [Theorem 9]: ID (X; Y | Z) = yes implies IP(X;Y|Z) follows from Basis(G) Completeness [Theorem 10]: ID (X; Y | Z) = no implies IP(X;Y|Z) does not follow from Basis(G)

Revisiting Example II So does IP( {X, D} ; V | A ) hold ? V S L T A B

Extension of the Markov Chain Property

How Expressive are Bayesian Networks

Quantifying the links of Bayesian Networks p(v) p(s) V S L T A B X D p(t|v) p(l|s) p(b|s) p(a|t,l) p(d|a,b) p(x|a) Bayesian network = Directed Acyclic Graph (DAG), annotated with conditional probability distributions.

Bayesian Network (cont.) Each Directed Acyclic Graph defines a factorization of the form: p(t|v) V S L T A B X D p(x|a) p(d|a,b) p(a|t,l) p(b|s) p(l|s) p(s) p(v)

Independence in Bayesian networks IP( Xi ; {X1,…,Xi-1}\Pai | Pai ) This set of independence assertions is denoted Basis(G) . All other independence assertions that are entailed by (*) are derivable using the semi-graphoid axioms.

Local distributions- Asymmetric independence Lung Cancer (Yes/No) Tuberculosis Abnormality in Chest (Yes/no) p(A|T,L) Table: p(A=y|L=n, T=n) = 0.02 p(A=y|L=n, T=y) = 0.60 p(A=y|L=y, T=n) = 0.99 p(A=y|L=y, T=y) = 0.99