1 Exact Inference Algorithms Bucket-elimination and more COMPSCI 179, Spring 2010 Set 8: Rina Dechter (Reading: chapter 14, Russell and Norvig.

Slides:



Advertisements
Similar presentations
Exact Inference. Inference Basic task for inference: – Compute a posterior distribution for some query variables given some observed evidence – Sum out.
Advertisements

Variational Methods for Graphical Models Micheal I. Jordan Zoubin Ghahramani Tommi S. Jaakkola Lawrence K. Saul Presented by: Afsaneh Shirazi.
CS498-EA Reasoning in AI Lecture #15 Instructor: Eyal Amir Fall Semester 2011.
Lauritzen-Spiegelhalter Algorithm
Join-graph based cost-shifting Alexander Ihler, Natalia Flerova, Rina Dechter and Lars Otten University of California Irvine Introduction Mini-Bucket Elimination.
Bucket Elimination: A unifying framework for Probabilistic inference Rina Dechter presented by Anton Bezuglov, Hrishikesh Goradia CSCE 582 Fall02 Instructor:
Bayesian Networks Bucket Elimination Algorithm 主講人:虞台文 大同大學資工所 智慧型多媒體研究室.
MURI Progress Report, June 2001 Advances in Approximate and Hybrid Reasoning for Decision Making Under Uncertainty Rina Dechter UC- Irvine Collaborators:
CSCE 582 Computation of the Most Probable Explanation in Bayesian Networks using Bucket Elimination -Hareesh Lingareddy University of South Carolina.
Constraint Optimization Presentation by Nathan Stender Chapter 13 of Constraint Processing by Rina Dechter 3/25/20131Constraint Optimization.
Exact Inference in Bayes Nets
Junction Trees And Belief Propagation. Junction Trees: Motivation What if we want to compute all marginals, not just one? Doing variable elimination for.
MPE, MAP AND APPROXIMATIONS Lecture 10: Statistical Methods in AI/ML Vibhav Gogate The University of Texas at Dallas Readings: AD Chapter 10.
Pearl’s Belief Propagation Algorithm Exact answers from tree-structured Bayesian networks Heavily based on slides by: Tomas Singliar,
GS 540 week 6. HMM basics Given a sequence, and state parameters: – Each possible path through the states has a certain probability of emitting the sequence.
1 Directional consistency Chapter 4 ICS-179 Spring 2010 ICS Graphical models.
Graphical Models - Inference - Wolfram Burgard, Luc De Raedt, Kristian Kersting, Bernhard Nebel Albert-Ludwigs University Freiburg, Germany PCWP CO HRBP.
1 Directional consistency Chapter 4 ICS-275 Spring 2007.
1 Exact Inference Algorithms for Probabilistic Reasoning; COMPSCI 276 Fall 2007.
Belief Propagation, Junction Trees, and Factor Graphs
SampleSearch: A scheme that searches for Consistent Samples Vibhav Gogate and Rina Dechter University of California, Irvine USA.
M. HardojoFriday, February 14, 2003 Directional Consistency Dechter, Chapter 4 1.Section 4.4: Width vs. Local Consistency Width-1 problems: DAC Width-2.
Tutorial #9 by Ma’ayan Fishelson
1 Hybrid of search and inference: time- space tradeoffs chapter 10 ICS-275A Fall 2003.
On the Power of Belief Propagation: A Constraint Propagation Perspective Rina Dechter Bozhena Bidyuk Robert Mateescu Emma Rollon.
1 Bayesian Networks Chapter ; 14.4 CS 63 Adapted from slides by Tim Finin and Marie desJardins. Some material borrowed from Lise Getoor.
AND/OR Search for Mixed Networks #CSP Robert Mateescu ICS280 Spring Current Topics in Graphical Models Professor Rina Dechter.
Inference in Gaussian and Hybrid Bayesian Networks ICS 275B.
Computing & Information Sciences Kansas State University Lecture 28 of 42 CIS 530 / 730 Artificial Intelligence Lecture 28 of 42 William H. Hsu Department.
Importance Sampling ICS 276 Fall 2007 Rina Dechter.
Ahsanul Haque *, Swarup Chandra *, Latifur Khan * and Charu Aggarwal + * Department of Computer Science, University of Texas at Dallas + IBM T. J. Watson.
Computing & Information Sciences Kansas State University Lecture 27 of 42 CIS 530 / 730 Artificial Intelligence Lecture 27 of 42 William H. Hsu Department.
Ahsanul Haque *, Swarup Chandra *, Latifur Khan * and Michael Baron + * Department of Computer Science, University of Texas at Dallas + Department of Mathematical.
Approximation Techniques for Automated Reasoning Irina Rish IBM T.J.Watson Research Center Rina Dechter University of California, Irvine.
Introduction to Bayesian Networks
1 Variable Elimination Graphical Models – Carlos Guestrin Carnegie Mellon University October 11 th, 2006 Readings: K&F: 8.1, 8.2, 8.3,
Generalizing Variable Elimination in Bayesian Networks 서울 시립대학원 전자 전기 컴퓨터 공학과 G 박민규.
Probabilistic Networks Chapter 14 of Dechter’s CP textbook Speaker: Daniel Geschwender April 1, 2013 April 1&3, 2013DanielG--Probabilistic Networks1.
Two Approximate Algorithms for Belief Updating Mini-Clustering - MC Robert Mateescu, Rina Dechter, Kalev Kask. "Tree Approximation for Belief Updating",
1 CMSC 671 Fall 2001 Class #21 – Tuesday, November 13.
1 Directional consistency Chapter 4 ICS-275 Spring 2009 ICS Constraint Networks.
Wednesday, January 29, 2003CSCE Spring 2003 B.Y. Choueiry Directional Consistency Chapter 4.
CPSC 422, Lecture 11Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 11 Oct, 2, 2015.
Exact Inference in Bayes Nets. Notation U: set of nodes in a graph X i : random variable associated with node i π i : parents of node i Joint probability:
Advances in Bayesian Learning Learning and Inference in Bayesian Networks Irina Rish IBM T.J.Watson Research Center
Foundations of Constraint Processing, Spring 2009 Structure-Based Methods: An Introduction 1 Foundations of Constraint Processing CSCE421/821, Spring 2009.
1 CMSC 671 Fall 2001 Class #20 – Thursday, November 8.
Pattern Recognition and Machine Learning
1 Tutorial #9 by Ma’ayan Fishelson. 2 Bucket Elimination Algorithm An algorithm for performing inference in a Bayesian network. Similar algorithms can.
Daphne Koller Overview Conditional Probability Queries Probabilistic Graphical Models Inference.
Today Graphical Models Representing conditional dependence graphically
1 Structure Learning (The Good), The Bad, The Ugly Inference Graphical Models – Carlos Guestrin Carnegie Mellon University October 13 th, 2008 Readings:
Daphne Koller Overview Maximum a posteriori (MAP) Probabilistic Graphical Models Inference.
1 Variable Elimination Graphical Models – Carlos Guestrin Carnegie Mellon University October 15 th, 2008 Readings: K&F: 8.1, 8.2, 8.3,
Exact Inference 275b.
Instructor: Eyal Amir Grad TAs: Wen Pu, Yonatan Bisk
Inference in Bayesian Networks
Constraint Optimization And counting, and enumeration 275 class
Irina Rish IBM T.J.Watson Research Center
Exact Inference Continued
CSCI 5822 Probabilistic Models of Human and Machine Learning
Exact Inference ..
Class #19 – Tuesday, November 3
Expectation-Maximization & Belief Propagation
Class #16 – Tuesday, October 26
Lecture 3: Exact Inference in GMs
Clique Tree Algorithm: Computation
Variable Elimination Graphical Models – Carlos Guestrin
presented by Anton Bezuglov, Hrishikesh Goradia CSCE 582 Fall02
Iterative Join Graph Propagation
Presentation transcript:

1 Exact Inference Algorithms Bucket-elimination and more COMPSCI 179, Spring 2010 Set 8: Rina Dechter (Reading: chapter 14, Russell and Norvig

Counting How many people? SUM operator CHAINstructure

Maximization What is the maximum? MAX operator TREEstructure

12”14”15” S IIIIII P 60G80G H 6C9C B Min-Cost Assignment What is minimum cost configuration? 6C9C I3050 II4055 III∞60 IIIIII 12”45∞∞ 14” ”∞ G80G 12” ” ”50∞ IIIIII 12”75∞∞ 14” ”∞ ”14”15” II 30 I 60 III + MIN-SUMoperators CHAINstructure ” 75 12” ” ” 30 12” 50 15” +=

Belief Updating B uzz sound M echanical problem H igh temperature F aulty head R ead delays HP(H) FP(F) HFMP(M|H,F) FRP(R|F) P(F | B=1) = ? Mh 1 (M) HFMBel(M,H,F) Hh 2 (H) Fh 3 (F) Fh 4 (F) HFMP(M|H,F) ** = MBP(B|M) ** = FP(F,B=1) P(B=1) = Probability of evidence P(F=1|B=1) =.3725 Updated belief SUM-PRODoperators POLY-TREEstructure P(h,f,r,m,b) = P(h) P(f) P(m|h,f) P(r|f) P(b|m)

X YZ TRLM Belief updating (sum-prod)

X YZ TRLM MPE (max-prod)

CSP – consistency (projection-join) X YZ TRLM

X YZ TRLM #CSP (sum-prod)

X YZ TRLM Tree-solving Belief updating (sum-prod) MPE (max-prod) CSP – consistency (projection- join) #CSP (sum-prod)

Belief Propagation Instances of tree message passing algorithm Exact for trees Linear in the input size Importance: – One of the first algorithms for inference in Bayesian networks – Gives a cognitive dimension to its computations – Basis for conditioning algorithms for arbitrary Bayesian network – Basis for Loopy Belief Propagation (approximate algorithms) [Pearl, 1988]

21 Exact Inference Algorithms Bucket-elimination COMPSCI 179, Spring 2010 Set 8: Rina Dechter (Reading: chapter 14, Russell and Norvig

26 Belief Updating lung Cancer Smoking X-ray Bronchitis Dyspnoea P (lung cancer=yes | smoking=no, dyspnoea=yes ) = ?

27 Belief updating: P(X|evidence)=? “Moral” graph A D E C B P(a|e=0) P(a,e=0)= P(a)P(b|a)P(c|a)P(d|b,a)P(e|b,c)= P(a) P(b|a)P(d|b,a)P(e|b,c) BC ED Variable Elimination P(c|a)

28 Bucket elimination Algorithm BE-bel (Dechter 1996) Elimination operator P(a|e=0) W*=4 ”induced width” (max clique size) bucket B: P(a) P(c|a) P(b|a) P(d|b,a) P(e|b,c) bucket C: bucket D: bucket E: bucket A: e=0 B C D E A

29 “Moral” graph A D E C B

30 BE-BEL

IntelligenceDifficulty Grade Letter SAT Job Apply Student Network example P(J)?

36 E D C B A B C D E A

Fall 2003 ICS 275A - Constraint Networks 37 The induced-width width: is the max number of parents in the ordered graph Induced-width: width of induced graph: recursively connecting parents going from last node to first. Induced-width w*(d) = the max induced-width over all nodes Induced-width of a graph: max w*(d) over all d

38 Complexity of elimination The effect of the ordering: “Moral” graph A D E C B B C D E A E D C B A

39 More accurately: O(r exp(w*(d)) where r is the number of cpts. For Bayesian networks r=n. For Markov networks? BE-BEL

40

41 The impact of observations Moral graph Induced Moral graph Adjusted Graph for evidence in B Induced- adjusted.

42 Probabilistic Inference Tasks  Belief updating:  Finding most probable explanation (MPE)

43 Elimination operator MPE W*=4 ”induced width” (max clique size) bucket B: P(a) P(c|a) P(b|a) P(d|b,a) P(e|b,c) bucket C: bucket D: bucket E: bucket A: e=0 B C D E A Algorithm elim-mpe (Dechter 1996)

44 Generating the MPE-tuple C: E: P(b|a) P(d|b,a) P(e|b,c)B: D: A: P(a) P(c|a) e=0

12”14”15” S IIIIII P 60G80G H 6C9C B Min-Cost Assignment What is minimum cost configuration? 6C9C I3050 II4055 III∞60 IIIIII 12”45∞∞ 14” ”∞ G80G 12” ” ”50∞ IIIIII 12”75∞∞ 14” ”∞ ”14”15” II 30 I 60 III + MIN-SUMoperators CHAINstructure ” 75 12” ” ” 30 12” 50 15” +=

46

47 BE-MPE

48 Finding small induced-width NP-complete A tree has induced-width of ? Greedy algorithms: – Min width – Min induced-width – Max-cardinality – Fill-in (thought as the best) – See anytime min-width (Gogate and Dechter)