Download presentation
Presentation is loading. Please wait.
1
Lecture 9 Inexact Theories
2
Syllabus Lecture 01Describing Inverse Problems Lecture 02Probability and Measurement Error, Part 1 Lecture 03Probability and Measurement Error, Part 2 Lecture 04The L 2 Norm and Simple Least Squares Lecture 05A Priori Information and Weighted Least Squared Lecture 06Resolution and Generalized Inverses Lecture 07Backus-Gilbert Inverse and the Trade Off of Resolution and Variance Lecture 08The Principle of Maximum Likelihood Lecture 09Inexact Theories Lecture 10Nonuniqueness and Localized Averages Lecture 11Vector Spaces and Singular Value Decomposition Lecture 12Equality and Inequality Constraints Lecture 13L 1, L ∞ Norm Problems and Linear Programming Lecture 14Nonlinear Problems: Grid and Monte Carlo Searches Lecture 15Nonlinear Problems: Newton’s Method Lecture 16Nonlinear Problems: Simulated Annealing and Bootstrap Confidence Intervals Lecture 17Factor Analysis Lecture 18Varimax Factors, Empirical Orthogonal Functions Lecture 19Backus-Gilbert Theory for Continuous Problems; Radon’s Problem Lecture 20Linear Operators and Their Adjoints Lecture 21Fréchet Derivatives Lecture 22 Exemplary Inverse Problems, incl. Filter Design Lecture 23 Exemplary Inverse Problems, incl. Earthquake Location Lecture 24 Exemplary Inverse Problems, incl. Vibrational Problems
3
Purpose of the Lecture Discuss how an inexact theory can be represented Solve the inexact, linear Gaussian inverse problem Use maximization of relative entropy as a guiding principle for solving inverse problems Introduce F-test as way to determine whether one solution is “better” than another
4
Part 1 How Inexact Theories can be Represented
5
How do we generalize the case of an exact theory to one that is inexact?
6
d obs model, m datum, d m ap m est d pre d=g(m) exact theory case theory
7
d obs model, m datum, d m ap m est d pre d=g(m) to make theory inexact... must make the theory probabilistic or fuzzy
8
d obs datum, d model, m m ap d obs m ap d obs m ap m est d pre model, m datum, d combinationtheorya prior p.d.f.
9
how do you combine two probability density functions ?
10
so that the information in them is combined...
11
desirable properties order shouldn’t matter combining something with the null distribution should leave it unchanged combination should be invariant under change of variables
12
Answer
13
d obs datum, d model, m m ap d obs m ap d obs m ap m est d pre model, m datum, d theory, p g d obs datum, d model, m m ap d obs m ap d obs m ap m est d pre model, m datum, d (F)(E)(D) a priori, p A total, p T
14
“solution to inverse problem” maximum likelihood point of (with p N ∝constant) simultaneously gives m est and d pre
15
probability that the estimated model parameters are near m and the predicted data are near d probability that the estimated model parameters are near m irrespective of the value of the predicted data T
16
conceptual problem do not necessarily have maximum likelihood points at the same value of m and T
17
d obs d pre model, m datum, d m ap m est model, m m est ’ p(m)
18
illustrates the problem in defining a definitive solution to an inverse problem
19
illustrates the problem in defining a definitive solution to an inverse problem fortunately if all distributions are Gaussian the two points are the same
20
Part 2 Solution of the inexact linear Gaussian inverse problem
21
Gaussian a priori information
22
a priori values of model parameters their uncertainty
23
Gaussian observations
24
observed data measurement error
25
Gaussian theory
26
linear theory uncertainty in theory
27
mathematical statement of problem find (m,d) that maximizes p T (m,d) = p A (m) p A (d) p g (m,d) and, along the way, work out the form of p T (m,d)
28
notational simplification group m and d into single vector x = [d T, m T ] T group [cov m] A and [cov d] A into single matrix write d-Gm=0 as Fx=0 with F=[I, –G]
29
after much algebra, we find p T (x) is a Gaussian distribution with mean and variance
30
after much algebra, we find p T (x) is a Gaussian distribution with mean and variance solution to inverse problem
31
after pulling m est out of x*
32
reminiscent of G T (GG T ) -1 minimum length solution
33
after pulling m est out of x* error in theory adds to error in data
34
after pulling m est out of x* solution depends on the values of the prior information only to the extent that the model resolution matrix is different from an identity matrix
35
and after algebraic manipulation which also equals reminiscent of (G T G) -1 G T least squares solution
36
interesting aside weighted least squares solution is equal to the weighted minimum length solution
37
what did we learn? for linear Gaussian inverse problem inexactness of theory just adds to inexactness of data
38
Part 3 Use maximization of relative entropy as a guiding principle for solving inverse problems
39
from last lecture
40
assessing the information content in p A (m) Do we know a little about m or a lot about m ?
41
Information Gain, S - S called Relative Entropy
42
m p A (m) S(σ A ) p N (m) σAσA (A) (B)
43
Principle of Maximum Relative Entropy or if you prefer Principle of Minimum Information Gain
44
find solution p.d.f. p T (m) that has smallest possible new information as compared to a priori p.d.f. p A (m) find solution p.d.f. p T (m) that has the largest relative entropy as compared to a priori p.d.f. p A (m) or if you prefer
46
properly normalized p.d.f. data is satisfied in the mean or expected value of error is zero
47
After minimization using Lagrange Multipliers process p T (m) is Gaussian with maximum likelihood point m est satisfying
48
After minimization using Lagrane Multipliers process p T (m) is Gaussian with maximum likelihood point m est satisfying just the weighted minimum length solution
49
What did we learn? Only that the Principle of Maximum Entropy is yet another way of deriving the inverse problem solutions we are already familiar with
50
Part 4 F-test as way to determine whether one solution is “better” than another
51
Common Scenario two different theories solution m est A M A model parameters prediction error E A solution m est B M B model parameters prediction error E B
52
Suppose E B < E A Is B really better than A ?
53
What if B has many more model parameters than A M B >> M A Is B fitting better any surprise?
54
Need to against Null Hypothesis The difference in error is due to random variation
55
suppose error e has a Gaussian p.d.f. uncorrelated uniform variance σ d
56
estimate variance
57
want to known the probability density function of
58
actually, we’ll use the quantity which is the same, as long as the two theories that we’re testing is applied to the same data
59
p(F N,2 ) p(F N,5 ) p(F N,50 ) F F F F p(F N,25 ) N=2 50 N=250 N=2 50 N=2 50 p.d.f. of F is known
60
as is its mean and variance
61
example same dataset fit with a straight line and a cubic polynomial
62
(A) Linear fit, N-M=9, E=0.030 (B) Cubic fit, N-M=7, E=0.006 zizi zizi didi didi
63
(A) Linear fit, N-M=9, E=0.030 (B) Cubic fit, N-M=7, E=0.006 zizi zizi didi didi F 7,9 = 4.1 est
64
probability that F >F est (cubic fit seems better than linear fit) by random chance alone or F < 1/F est (linear fit seems better than cubic fit) by random chance alone
65
in MatLab P = 1 - (fcdf(Fobs,vA,vB)-fcdf(1/Fobs,vA,vB));
66
answer: 6% The Null Hypothesis that the difference is due to random variation cannot be rejected to 95% confidence
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.