Download presentation
Presentation is loading. Please wait.
Published bySuparman Dharmawijaya Modified over 6 years ago
1
Dependence Dependence = NOT Independent Only 1 way to be independent
VS Fix this
2
Estimating Dependency and Significance for High-Dimensional Data
Michael R. Siracusa* Kinh Tieu*, Alexander T. Ihler §, John W. Fisher *§, Alan S. Willsky § * Computer Science and Artificial Intelligence Laboratory § Laboratory for Information and Decision Systems
3
Applications
4
Problem Statement Given N i.i.d. observations for K sources
Determine if the K sources are independent or not by Calculating some dependency measure Estimating the significance of this measurement
5
Hypothesis Test Two Hypotheses: Assuming we know the distributions:
Given N i.i.d. observations:
6
Factorization Test Two Factorizations:
But we don’t we know the distributions: Our best approximation (like GLR): Given N i.i.d. observations:
7
Factorization Test (cont)
Given N i.i.d. observations: True Joint Dist True Independent Dist True Independent Dist True Independent Dist
8
Factorization Test (cont)
For 2 variable case For 2 variable Gaussian case In general: Questions: How do we do density estimation? Can we compute this value when x is high dimensional How do we make our decision between F1 and F0
9
Sample Based Density Estimates
10
High Dimensional Data From the Data Processing Inequality: VS
11
High Dimensional Data (cont)
Sufficiency: For High dimensional data Maximize left side of bound Gaussian w/ Linear Projections Close form solution (Eigenvalue problem): Kullback 68 Nonparametric Gradient descent : Ihler and Fisher 03
12
Swiss Roll PCA 2D Projection MaxKL 2D Optimization 3D Data
13
Significance
14
More significance
15
Low Dim Latent Var Dependency via
Synthetic data Noise in High Dim Space High Dim Obs Distracter Low Dim Latent Var Dependency via M: Controls that number of dimensions dependency info is uniformly distributed over D: Controls the total dimensionality of our K observations
16
Experiments 100 Trial w/ Samples of Dependent Data
100 Trials w/ Samples of Independent Data Each trial gives a statistic and significance pf
17
Gaussian Data
18
Gaussian rho = .75
19
Three D Ball Data
21
Significance: Permutations good
22
Multi-camera
23
Conclusions Nice General Framework
Permutations allow us to draw independent samples Have shown cases where Gaussian assumptions will fail, and PCA is no good for dimensionality reduction
24
Future Work More experiments on real data
Better optimization procedure
27
Applications What Vision Problems Can We Solve w/ Accurate Measures of Dependency? Data Association, Correspondence Feature Selection Learning Structure More specifically we explicit address the following problems . The following have dependency has their main focus… We will specifically discuss: Correspondence (for multi-camera tracking) Audio-visual Association
28
Audio-Visual Association
Useful For: Speaker Localization - Help improve Human-Computer Interaction - Help Source Separation Automatic Transcription of Archival Video - Who is speaking? - Are they seen by the camera? I have been interested in Audio-visual association… Take alook at this video… who is speaking… now focus on the first person.. And raise your hand when he is speaking… So we see that even this a simple problem, but is also not so easy.. .but at it’s core is measure whether or not a single audio stream belongs to any of the video segements.. There are lots of complex things going on.. But how much work do we have to do to answer this simple question.. Hi.. I’m michael and I’m interested in multimodal data association.. Specifically for my masters I wroked on audio-visual data assocation. Take for example this toy problem… if we had audio, our task would be to identify which, if any of these videos lips is associated with the audio. This task is not so hard for humans and we would like the computer to be able to do it.. We have some basic questions, like how we should measure this assocaition and how well we can do with and without a model of human speech.. Ie. Treating it as a generic data association problem or using domain specific knowledge.
29
Multi-camera Tracking
30
Hypotheses Camera X Camera Y VS
31
Maximal Correspondence
32
Distributions of Transition Times
33
Discussion and Future Work
Dependence underlies various vision related problems. We studied a framework for measuring dependence. Measure significance (how confident are you) Make it more robust.
36
Math (oh no!) For 2 variable case
37
Outline Applications: (for computer vision)
Problem Formulation: (Hypothesis Testing) Computation: (Non-parametric entropy estimation) Curse of Dimensionality: (Informative Statistics) Correspondence: (Markov Chain Monte Carlo)
39
Question is not how to measure it .. It is that you should measure it.
What does all this mean.. 1 quesiton Are there principle ways of assessing dependency without explicitly choosing a model.
40
Previous Talks Greg: Model dependence between features and class
Kristen: Model dependence between features and a scene Ariadna: Model dependency between intra-class features Wanmei: Dependency between protocol signal and voxel response Chris: Audio and video dependence with events Antonio: Contextual Dependence Corey: “Inferring Dependencies” We Should understand the tools before we use them. Right Everyone? Certain things come up, KL Divergence, Measruing Correlation, Details about Density Estimation, some people throw some information theory at you. Devil is in the details.. .seems like everyone is worrying more about the specific details so we are going to explore the more general problem formuation. … our stuff is more directly related. (clustering, classification are other tools everyone uses) Some of these have a precise definition of dependence and a particular model, while others it’s a little more fuzzy and measuring dependence may just be some preprocessing step to setup the problem. The point is dependcy comes up over and over again and we would like a precise way to discuss it and some well understood tools to use to characterize dependence. Most people are comfortable with discusisng tools for classification or clustering… we want to be just as comfortable with discussing dependency.. .and particularly for our problems where charactierizing dependcy is the focus. Fundametnal question.. What does it mean to assess dependency We need to define it.. And learn how to compute At the end.. What is the strength and the nature of dependence We are dealing with problems with measurements that are high dimensional… probablistic models don’t fit into nice parameteric families..
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.