Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 1 Jeff A. Bilmes University of Washington Department of Electrical Engineering EE512 Spring,

Slides:



Advertisements
Similar presentations
CS188: Computational Models of Human Behavior
Advertisements

CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu Lecture 27 – Overview of probability concepts 1.
Bayesian Networks CSE 473. © Daniel S. Weld 2 Last Time Basic notions Atomic events Probabilities Joint distribution Inference by enumeration Independence.
For Monday Read chapter 18, sections 1-2 Homework: –Chapter 14, exercise 8 a-d.
Bayesian Networks VISA Hyoungjune Yi. BN – Intro. Introduced by Pearl (1986 ) Resembles human reasoning Causal relationship Decision support system/ Expert.
EE462 MLCV Lecture Introduction of Graphical Models Markov Random Fields Segmentation Tae-Kyun Kim 1.
CPSC 322, Lecture 26Slide 1 Reasoning Under Uncertainty: Belief Networks Computer Science cpsc322, Lecture 27 (Textbook Chpt 6.3) March, 16, 2009.
Bayesian Networks Chapter 2 (Duda et al.) – Section 2.11
Lec 5: April 11th, 2006EE512 - Graphical Models - J. BilmesPage 1 Jeff A. Bilmes University of Washington Department of Electrical Engineering EE512 Spring,
Lec 12: May 4th, 2006EE512 - Graphical Models - J. BilmesPage 1 Jeff A. Bilmes University of Washington Department of Electrical Engineering EE512 Spring,
Lec 11: May 2nd, 2006EE512 - Graphical Models - J. BilmesPage 1 Jeff A. Bilmes University of Washington Department of Electrical Engineering EE512 Spring,
Lec 2: March 30th, 2006EE512 - Graphical Models - J. BilmesPage 1 Jeff A. Bilmes University of Washington Department of Electrical Engineering EE512 Spring,
PGM 2003/04 Tirgul 3-4 The Bayesian Network Representation.
Lec 6: April 13th, 2006EE512 - Graphical Models - J. BilmesPage 1 Jeff A. Bilmes University of Washington Department of Electrical Engineering EE512 Spring,
Lec 3: April 4th, 2006EE512 - Graphical Models - J. BilmesPage 1 Jeff A. Bilmes University of Washington Department of Electrical Engineering EE512 Spring,
Lec 9: April 25th, 2006EE512 - Graphical Models - J. BilmesPage 1 Jeff A. Bilmes University of Washington Department of Electrical Engineering EE512 Spring,
Lec 16: May 25th, 2006EE512 - Graphical Models - J. BilmesPage 1 Jeff A. Bilmes University of Washington Department of Electrical Engineering EE512 Spring,
Graphical Models Lei Tang. Review of Graphical Models Directed Graph (DAG, Bayesian Network, Belief Network) Typically used to represent causal relationship.
Goal: Reconstruct Cellular Networks Biocarta. Conditions Genes.
5/25/2005EE562 EE562 ARTIFICIAL INTELLIGENCE FOR ENGINEERS Lecture 16, 6/1/2005 University of Washington, Department of Electrical Engineering Spring 2005.
Lec 10: April 26th, 2006EE512 - Graphical Models - J. BilmesPage 1 Jeff A. Bilmes University of Washington Department of Electrical Engineering EE512 Spring,
Lec 8: April 20th, 2006EE512 - Graphical Models - J. BilmesPage 1 Jeff A. Bilmes University of Washington Department of Electrical Engineering EE512 Spring,
Lec 4: April 6th, 2006EE512 - Graphical Models - J. BilmesPage 1 Jeff A. Bilmes University of Washington Department of Electrical Engineering EE512 Spring,
Lec 7: April 18th, 2006EE512 - Graphical Models - J. BilmesPage 1 Jeff A. Bilmes University of Washington Department of Electrical Engineering EE512 Spring,
Today Logistic Regression Decision Trees Redux Graphical Models
Bayesian Networks Alan Ritter.
CPSC 422, Lecture 18Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 18 Feb, 25, 2015 Slide Sources Raymond J. Mooney University of.
Computer vision: models, learning and inference Chapter 10 Graphical Models.
Lec 8: April 20th, 2006EE512 - Graphical Models - J. BilmesPage 1 Jeff A. Bilmes University of Washington Department of Electrical Engineering EE512 Spring,
. DAGs, I-Maps, Factorization, d-Separation, Minimal I-Maps, Bayesian Networks Slides by Nir Friedman.
Lec 14: May 18th, 2006EE512 - Graphical Models - J. BilmesPage 1 Jeff A. Bilmes University of Washington Department of Electrical Engineering EE512 Spring,
1 Bayesian Networks Chapter ; 14.4 CS 63 Adapted from slides by Tim Finin and Marie desJardins. Some material borrowed from Lise Getoor.
Lec 18: May 31st, 2006EE512 - Graphical Models - J. BilmesPage 1 Jeff A. Bilmes University of Washington Department of Electrical Engineering EE512 Spring,
Lec 15: May 22nd, 2006EE512 - Graphical Models - J. BilmesPage 1 Jeff A. Bilmes University of Washington Department of Electrical Engineering EE512 Spring,
Advanced Artificial Intelligence
Machine Learning CUNY Graduate Center Lecture 21: Graphical Models.
Bayes’ Nets  A Bayes’ net is an efficient encoding of a probabilistic model of a domain  Questions we can ask:  Inference: given a fixed BN, what is.
A Brief Introduction to Graphical Models
Bayesian networks Classification, segmentation, time series prediction and more. Website: Twitter:
第十讲 概率图模型导论 Chapter 10 Introduction to Probabilistic Graphical Models
For Wednesday Read Chapter 11, sections 1-2 Program 2 due.
Introduction to Bayesian Networks
Ch 8. Graphical Models Pattern Recognition and Machine Learning, C. M. Bishop, Revised by M.-O. Heo Summarized by J.W. Nam Biointelligence Laboratory,
Announcements Project 4: Ghostbusters Homework 7
1 CMSC 671 Fall 2001 Class #21 – Tuesday, November 13.
Overview of the final test for CSC Overview PART A: 7 easy questions –You should answer 5 of them. If you answer more we will select 5 at random.
Lecture 2: Statistical learning primer for biologists
Exact Inference in Bayes Nets. Notation U: set of nodes in a graph X i : random variable associated with node i π i : parents of node i Joint probability:
Christopher M. Bishop, Pattern Recognition and Machine Learning 1.
1 CMSC 671 Fall 2001 Class #20 – Thursday, November 8.
1 BN Semantics 2 – Representation Theorem The revenge of d-separation Graphical Models – Carlos Guestrin Carnegie Mellon University September 17.
CPSC 322, Lecture 26Slide 1 Reasoning Under Uncertainty: Belief Networks Computer Science cpsc322, Lecture 27 (Textbook Chpt 6.3) Nov, 13, 2013.
1 BN Semantics 1 Graphical Models – Carlos Guestrin Carnegie Mellon University September 15 th, 2006 Readings: K&F: 3.1, 3.2, 3.3.
Chapter 12. Probability Reasoning Fall 2013 Comp3710 Artificial Intelligence Computing Science Thompson Rivers University.
CS 2750: Machine Learning Bayesian Networks Prof. Adriana Kovashka University of Pittsburgh March 14, 2016.
Bayesian Networks Chapter 2 (Duda et al.) – Section 2.11 CS479/679 Pattern Recognition Dr. George Bebis.
A Brief Introduction to Bayesian networks
Reasoning Under Uncertainty: Belief Networks
CS 2750: Machine Learning Directed Graphical Models
Qian Liu CSE spring University of Pennsylvania
Read R&N Ch Next lecture: Read R&N
Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 18
Read R&N Ch Next lecture: Read R&N
CAP 5636 – Advanced Artificial Intelligence
CS 188: Artificial Intelligence
CS 188: Artificial Intelligence Spring 2007
CS 188: Artificial Intelligence Spring 2006
Warm-up as you walk in Each node in a Bayes net represents a conditional probability distribution. What distribution do you get when you multiply all of.
Chapter 14 February 26, 2004.
CS 188: Artificial Intelligence Fall 2008
Presentation transcript:

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 1 Jeff A. Bilmes University of Washington Department of Electrical Engineering EE512 Spring, 2006 Graphical Models Jeff A. Bilmes Lecture 1 Slides March 28 th, 2006

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 2 Class overview What are graphical models Semantics of Bayesian networks Outline of Today’s Lecture

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 3 Books and Sources for Today Jordan: Chapters 1 and 2

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 4 L1: Tues, 3/28: Overview, GMs, Intro BNs. L2: Thur, 3/30 L3: Tues, 4/4 L4: Thur, 4/6 L5: Tue, 4/11 L6: Thur, 4/13 L7: Tues, 4/18 L8: Thur, 4/20 L9: Tue, 4/25 L10: Thur, 4/27 L11: Tues, 5/2 L12: Thur, 5/4 L13: Tues, 5/9 L14: Thur, 5/11 L15: Tue, 5/16 L16: Thur, 5/18 L17: Tues, 5/23 L18: Thur, 5/25 L19: Tue, 5/30 L20: Thur, 6/1: final presentations Class Road Map

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 5 READING: Chapter 1,2 in Jordan’s book (pick up book from basement of communications copy center). List handout: name, department, and List handout: regular makeup slot, and discussion section Syllabus Course web page: Goal: powerpoint slides this quarter, they will be on the web page after lecture (but not before as you are getting them hot off the press). Announcements

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 6 A graphical model is a visual, abstract, and mathematically formal description of properties of families of probability distributions (densities, mass functions) There are many different types of Graphical model, ex: –Bayesian Networks –Markov Random Fields –Factor Graph –Chain Graph Graphical Models

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 7 Graphical Models Chain Graphs Causal Models DGMs UGMs Bayesian Networks MRFs Gibbs/Boltzman Distributions DBNs Mixture Models Decision Trees Simple Models PCA LDA HMM Factorial HMM/Mixed Memory Markov Models BMMs Kalman Other Semantics FST Dependency Networks Segment Models AR ZMs Factor Graphs GMs cover many well-known methods

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 8 Structure Algorithms Language Approximations Data-Bases Graphical Models Provide:

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 9 GMs give us: I.Structure: A method to explore the structure of “natural” phenomena (causal vs. correlated relations, properties of natural signals and scenes) II.Algorithms: A set of algorithms that provide “efficient” probabilistic inference and statistical decision making III.Language: A mathematically formal, abstract, visual language with which to efficiently discuss families of probabilistic models and their properties. Graphical Models Provide

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 10 GMs give us (cont): IV.Approximation: Methods to explore systems of approximation and their implications. E.g., what are the consequences of a (perhaps known to be) wrong assumption? V.Data-base: Provide a probabilistic “data-base” and corresponding “search algorithms” for making queries about properties in such model families. GMs Provide

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 11 GMs There are many different types of GM. Each GM has its semantics A GM (under the current semantics) is really a set of constraints. The GM represents all probability distributions that obey these constraints, including those that obey additional constraints (but not including those that obey fewer constraints). Most often, the constraints are some form of factorization property, e.g., f() factorizes (is composed of a product of factors of subsets of arguments).

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 12 Types of Queries Several types of queries we may be interested in: –Compute: p(one subset of vars) –Compute: p(one subset of vars| another subset of vars) –Find the N most probable configurations of one subset of variables given assignments of values to some other sets –Q: Is one subset independent of another subset? –Q: Is one subset independent of another given a third? How efficiently can we do this? Can this question be answered? What if it is too costly, can we approximate, and if so, how well? These are questions we will answer this term. GMs are like a probabilistic data-base (or data structure), a system that can be queried to provide answers to these sorts of questions.

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 13 Example Typical goal of pattern recognition: –training (say, EM or gradient descent), need query of form: In this form, we need to compute p(o,h) efficiently. –Bayes decision rule, need to find best class for a given unknown pattern: –but this is yet another query on a probability distribution. –We can train, and perform Bayes decision theory quickly if we can compute with probabilities quickly. Graphical models provide a way to reason about, and understand when this is possible, and if not, how to reasonably approximate.

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 14 Some Notation Random variables X i, Y i, X, Y (scalar or vector) Distributions: Subsets:

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 15 Main types of Graphical Models Markov Random Fields –a form of undirected graphical model –relatively simple to understand their semantics –also, log-linear models, Gibbs distributions, Boltzman distributions, many “exponential models”, conditional random fields (CRFs), etc. Bayesian networks –a form of directed graphical model –originally developed to represent a form of causality, but not ideal for that (they still represent factorization) –Semantics more interesting (but trickier) than MRFs Factor Graphs –pure, the assembly language models for factorization properties –came out of coding theory community (LDPC, Turbo codes)

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 16 Main types of Graphical Models Chain graphs: –Hybrid between Bayesian networks and MRFs –A set of clusters of undirected nodes connected as directed links –Not as widely used, but very powerful. Ancestral graphs –we probably won’t cover these.

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 17 C X Mixture models Q1Q1 Q2Q2 Q3Q3 Q4Q4 Markov Chains Q1Q1 Q2Q2 Q3Q3 Q4Q4 Bayesian Network Examples

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 18 X Y Other generalizations possible E.g., Q = gen. diagonal, or capture using general A since  Q R PCA: Q = , R  0, A = ortho FA: Q = I, R = diagonal GMs: PCA and Factor Analysis

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 19 X1X1 X2X2 X3X3 X4X4 I1I1 I2I2 The data X 1:4 is explained by the two (marginally) independent causes. Independent Component Analysis

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 20 C X   Class conditional data has diff. mean but common covariance matrix. Fisher’s formulation: project onto space spanned by the means. Linear Discriminant Analysis

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 21 C X   C X   M C X   M HDA/QDA MDA HMDA Extensions to LDA Heteroschedastic Discriminant Analysis/Quadratic Discriminant Analysis Mixture Discriminant Analsysis Both

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 22 Generalized Probabilistic Decision Trees Hierarchical Mixtures of Experts (Jordan) O I D1D1 D2D2 D3D3 Generalized Decision Trees

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 23 Example: Printer Troubleshooting Dechter

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 24 Classifier Combination X S C Mixture of Experts (Sum rule) X1X1 C Naive Bayes (prod. rule) X2X2 XNXN... Other Combination Schemes C C1C1 C2C2 X1X1 X2X2

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 25 C X C X Generative Model Discriminative Model Discriminative and Generative Models

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 26 HMMs/Kalman Filter Q1Q1 Q2Q2 Q3Q3 Q4Q4 X1X1 X2X2 X3X3 X4X4 Q1Q1 Q2Q2 Q3Q3 Q4Q4 X1X1 X2X2 X3X3 X4X4 HMM Autoregressive HMM

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 27 Switching Kalman Filter S1S1 S2S2 S3S3 S4S4 X1X1 X2X2 X3X3 X4X4 Q1Q1 Q2Q2 Q3Q3 Q4Q4

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 28 Factorial HMM Q1Q1 Q2Q2 Q3Q3 Q4Q4 X1X1 X2X2 X3X3 X4X4 Q’ 1 Q’ 2 Q’ 3 Q’ 4

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 29 Example: standard 4-gram Standard Language Modeling

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 30 Nothing gets zero probability Interpolated Uni-,Bi-,Tri-Grams

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 31 Conditional mixture tri-grams

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 32 Bayesian Networks … and so on We need to be more formal about what BNs mean. In the rest of this, and in the next, lecture, we start with basic semantics of BNs, move on to undirected models, and then come back to BNs again to clean up …

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 33 Bayesian Networks Has nothing to do with “Bayesian statistical models” (there are Bayesian and non-Bayesian Bayesian networks). valid:invalid:

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 34 Sub-family specification: Directed acyclic graph (DAG) Nodes - random variables Edges - direct “influence” Instantiation: Set of conditional probability distributions e b e be b b e BE P(A | E,B) Earthquake Radio Burglary Alarm Call Compact representation: factors of probabilities of children given parents (J. Pearl, 1988) Together (graph and inst.): Defines a unique distribution in a factored form BN: Alarm Network Example

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 35 Bayesian Networks Reminder: chain rule of probability. For any order 

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 36 Bayesian Networks

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 37 Conditional Independence

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 38 Conditional Independence & BNs GMs can help answer CI queries: Y D CE BZX A No!!

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 39 Bayesian Networks & CI

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 40 Bayesian Networks & CI

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 41 Bayesian Networks & CI

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 42 Bayesian Networks & CI

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 43 Pictorial d-separation: blocked/unblocked paths Blocked Paths Unblocked Paths

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 44 Three 3-node examples of BNs and their conditional independence statements. V1V1 V2V2 V3V3 V1V1 V2V2 V3V3 V1V1 V2V2 V3V3 Three Canonical Cases

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 45 Case 1 Markov Chain

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 46 Case 2 Still a Markov Chain

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 47 Case 3 NOT a Markov Chain

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 48 SUVs Greenhouse Gasses Global Warming Lung Cancer Smoking Bad Breath GeneticsCancerSmoking Examples of the three cases

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 49 Bayes Ball simple algorithm, ball bouncing along path in graph, to tell if path is blocked or not (more details in text).

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 50 What are implied conditional independences?

Lec 1: March 28th, 2006EE512 - Graphical Models - J. BilmesPage 51 Two Views of a Family