T. Mhamdi, O. Hasan and S. Tahar, HVG, Concordia University Montreal, Canada July 2010 On the Formalization of the Lebesgue Integration Theory in HOL On.

Slides:



Advertisements
Similar presentations
Random Processes Introduction (2)
Advertisements

Advanced topics in Financial Econometrics Bas Werker Tilburg University, SAMSI fellow.
B ETTI NUMBERS OF RANDOM SIMPLICIAL COMPLEXES MATTHEW KAHLE & ELIZABETH MECKE Presented by Ariel Szapiro.
Basic Structures: Sets, Functions, Sequences, Sums, and Matrices
Basic Structures: Sets, Functions, Sequences, Sums, and Matrices
Random Variable A random variable X is a function that assign a real number, X(ζ), to each outcome ζ in the sample space of a random experiment. Domain.
Week 21 Basic Set Theory A set is a collection of elements. Use capital letters, A, B, C to denotes sets and small letters a 1, a 2, … to denote the elements.
Lecture 6 Hyperreal Numbers (Nonstandard Analysis)
Independence of random variables
An Lp theory for outer measures. Application to singular integrals. Christoph Thiele Santander, September 2014.
1. Introduction Consistency of learning processes To explain when a learning machine that minimizes empirical risk can achieve a small value of actual.
By : L. Pour Mohammad Bagher Author : Vladimir N. Vapnik
Continuous Random Variables. For discrete random variables, we required that Y was limited to a finite (or countably infinite) set of values. Now, for.
13. The Weak Law and the Strong Law of Large Numbers
Math Camp 2: Probability Theory Sasha Rakhlin. Introduction  -algebra Measure Lebesgue measure Probability measure Expectation and variance Convergence.
Probability theory 2011 Convergence concepts in probability theory  Definitions and relations between convergence concepts  Sufficient conditions for.
The moment generating function of random variable X is given by Moment generating function.
1 Foundations of Interval Computation Trong Wu Phone: Department of Computer Science Southern Illinois University Edwardsville.
Lecture III. Uniform Probability Measure I think that Bieren’s discussion of the uniform probability measure provides a firm basis for the concept of.
Joint Distribution of two or More Random Variables
Hamid R. Rabiee Fall 2009 Stochastic Processes Review of Elementary Probability Lecture I.
Sets.
Theory of Integration in Statistics
Title: Functions, Limits and Continuity Prof. Dr. Nasima Akhter And Md. Masum Murshed Lecturer Department of Mathematics, R.U. 29 July, 2011, Friday Time:
Chapter 1 Probability and Distributions Math 6203 Fall 2009 Instructor: Ayona Chatterjee.
A Brief Summary for Exam 1 Subject Topics Propositional Logic (sections 1.1, 1.2) –Propositions Statement, Truth value, Proposition, Propositional symbol,
IRDM WS Chapter 2: Basics from Probability Theory and Statistics 2.1 Probability Theory Events, Probabilities, Random Variables, Distributions,
2.4 Sequences and Summations
استاد : دکتر گلبابایی In detail this means three conditions:  1. f has to be defined at c.  2. the limit on the left hand side of that equation has.
Basic Concepts of Discrete Probability (Theory of Sets: Continuation) 1.
PROBABILITY AND STATISTICS FOR ENGINEERING Hossein Sameti Department of Computer Engineering Sharif University of Technology The Weak Law and the Strong.
Random walk on Z: Simple random walk : Let be independent identically distributed random variables (iid) with only two possible values {1,-1},
Chapter 3 L p -space. Preliminaries on measure and integration.
Sets Defined A set is an object defined as a collection of other distinct objects, known as elements of the set The elements of a set can be anything:
Week11 Parameter, Statistic and Random Samples A parameter is a number that describes the population. It is a fixed number, but in practice we do not know.
MA5296 Lecture 1 Completion and Uniform Continuity Wayne M. Lawton Department of Mathematics National University of Singapore 2 Science Drive 2 Singapore.
Probability and Measure September 2, Nonparametric Bayesian Fundamental Problem: Estimating Distribution from a collection of Data E. ( X a distribution-valued.
One Random Variable Random Process.
CompSci 102 Discrete Math for Computer Science
Lecture V Probability theory. Lecture questions Classical definition of probability Frequency probability Discrete variable and probability distribution.
Section 2.1. Section Summary Definition of sets Describing Sets Roster Method Set-Builder Notation Some Important Sets in Mathematics Empty Set and Universal.
1 Introduction to Abstract Mathematics Sets Section 2.1 Basic Notions of Sets Section 2.2 Operations with sets Section 2.3 Indexed Sets Instructor: Hayk.
Basic Principles (continuation) 1. A Quantitative Measure of Information As we already have realized, when a statistical experiment has n eqiuprobable.
Sets 2/10/121. What is a Set? Informally, a collection of objects, determined by its members, treated as a single mathematical object Not a real definition:
Week 121 Law of Large Numbers Toss a coin n times. Suppose X i ’s are Bernoulli random variables with p = ½ and E(X i ) = ½. The proportion of heads is.
Chapter 2 With Question/Answer Animations. Section 2.1.
1.6 Change of Measure 1. Introduction We used a positive random variable Z to change probability measures on a space Ω. is risk-neutral probability measure.
1 Probability and Statistical Inference (9th Edition) Chapter 5 (Part 2/2) Distributions of Functions of Random Variables November 25, 2015.
Random Variables. Numerical Outcomes Consider associating a numerical value with each sample point in a sample space. (1,1) (1,2) (1,3) (1,4) (1,5) (1,6)
Week 111 Some facts about Power Series Consider the power series with non-negative coefficients a k. If converges for any positive value of t, say for.
Chapter 6 Large Random Samples Weiqi Luo ( 骆伟祺 ) School of Data & Computer Science Sun Yat-Sen University :
Chebyshev’s Inequality Markov’s Inequality Proposition 2.1.
Summary of the Last Lecture This is our second lecture. In our first lecture, we discussed The vector spaces briefly and proved some basic inequalities.
Sums of Random Variables and Long-Term Averages Sums of R.V. ‘s S n = X 1 + X X n of course.
Chapter 2 1. Chapter Summary Sets (This Slide) The Language of Sets - Sec 2.1 – Lecture 8 Set Operations and Set Identities - Sec 2.2 – Lecture 9 Functions.
Department of Mathematics
11.1 Sequences. A sequence is a list of numbers written in an explicit order. n th term Any real-valued function with domain a subset of the positive.
Primbs, MS&E345 1 Measure Theory in a Lecture. Primbs, MS&E345 2 Perspective  -Algebras Measurable Functions Measure and Integration Radon-Nikodym Theorem.
Department of Mathematics
What is Probability? Quantification of uncertainty.
CONTINUOUS RANDOM VARIABLES
Lebesgue measure: Lebesgue measure m0 is a measure on i.e., 1. 2.
Chapter 3 The Real Numbers.
Introduction to Real Analysis
A Brief Summary for Exam 1
13. The Weak Law and the Strong Law of Large Numbers
Chapter 3 The Real Numbers.
Chapter 5 Limits and Continuity.
Experiments, Outcomes, Events and Random Variables: A Revisit
13. The Weak Law and the Strong Law of Large Numbers
Presentation transcript:

T. Mhamdi, O. Hasan and S. Tahar, HVG, Concordia University Montreal, Canada July 2010 On the Formalization of the Lebesgue Integration Theory in HOL On the Formalization of the Lebesgue Integration Theory in HOL

2 Introduction Related Work Measure Theory Borel Sigma Algebra Lebesgue Integration Applications Conclusion and Future work Outline

3 Fundamental concept in many mathematical theories –Real Analysis –Probability theory –Information theory Model and reason about continuous and unpredictable components of physical systems. Lebesgue Integral Probability theory Expectation Variance Density functions Probability theory Expectation Variance Density functions Information theory Entropy Relative Entropy KL divergence Information theory Entropy Relative Entropy KL divergence Physical systems Introduction

4 Advantages of the Lebesgue integral –Handles a larger class of functions –Better behavior for interchanging limits and integrals –Unified definition for discrete and continuous cases –Defined for arbitrary measures Lebesgue Integral: partition the range Reimann Integral: partition the domain Introduction

5 Hurd ( PhD Thesis, Cambridge UK ) –Formalization of measure theory in HOL –A measure space is a pair (A,µ) –The space is implicitly the universal set of the appropriate type. –Restriction of the measure spaces that can be constructed. –To apply the formalization for an arbitrary space X, a new HOL type should be defined as well as operations on this type and their properties must be proven. –Considerable effort that needs to be done for every space of interest. Related Work

6 Richter ( TPHOLs'08 ) –Formalized measure theory in Isabel/HOL based on Hurd’s. –Same limitation as the work of Hurd. –Defined the Borel sigma algebra based on the intervals and used it to formalize the Lebesgue integral. –We provide a formalization that is based on the open sets: Can be used for any topological space: real numbers, complex numbers, n-dimensional Euclidian space,… Related Work

7 Hasan (Mathematical Methods in the Applied Sciences ) –Builds on top of Hurd’s work –Formalized continuous random variables –Formalized statistical properties for discrete random variables. –Valid for specific distributions. –Case studies (coupon collector's problem, stop-and-wait protocol). Related Work

8 Coble (PhD Thesis, Cambridge UK) –Generalized the measure theory of Hurd –A measure space is a triplet (X,A,µ) –Formalized the Lebesgue integral –Proved properties of the Lebesgue integral only for positive simple functions. –Formalization does not include the Borel sigma algebra. –No properties of measurable functions. –No properties of the Lebesgue integral for arbitrary functions. Related Work

9 Formalize of the measure theory based on the work of Coble. Extend it by formalizing the Borel sigma algebra. Borel Sigma algebra is generated by the open sets. We prove that our definition is equivalent to the definition of Richter in the special case where the space is the set of real numbers. Using the formalization of the Borel sigma algebra, we prove –properties of real-valued measurable functions –properties of the Lebesgue integral Proposed Solution

10 Basic definitions Measure Theory Sigma Algebra: A collection A of subsets of a space X defines a sigma algebra on X iff its contains the empty set and is closed under countable unions and complementation within the space X Measurable Space (X,A) Measure Function Measure Space (X,A,µ) Measurable Functions Generated Sigma Algebra σ(X,G)

11 Borel Sigma Algebra Definition The Borel sigma algebra on X is the sigma algebra generated by the open sets of X. Example: Borel sigma algebra on the real line B(R). Theories needed to formalize B(R) Topology: to define the notions of neighborhoods, open sets and prove necessary theorems. Rational Numbers: to prove the properties of real-valued measurable functions.

12 Topology –A formalization of some concepts of topology is available in HOL –Developed by John Harrison. –It does not use the set theory operations (SUBSET, UNION, INTER, …) –It does not include needed theorems. –A much more complete theory was developed by John Harrison for Hol Light. –We formalized the notions of neighborhoods, open sets and proved some of their properties. Borel Sigma Algebra

13 Borel Sigma Algebra The inverse image of an open set by a continuous function is open Any open set of R can be written as a countable union of open intervals. Neighborhood: Open set of R: The formalization is not complete, we developed only the concepts we needed.

14 Rational Numbers –The main results that we need are: The set of rational numbers is countable, infinite The set of rational numbers is dense is R: For all x,y in R such that x < y, there exists a rational number r such that x < r < y The set of open intervals with rational end-points is countable. –Q as a subset of R. –None of this is available in the existing rational theory in HOL. Borel Sigma Algebra

15 Rational Numbers Borel Sigma Algebra The formalization is fairly rich. Set of rational numbers: Q is dense in R: Properties of rational numbers: -x, x+y, x*y, 1/y, x/y

16 Theorem: B(R) is generated by the open intervals ]c,d[. It is also generated by any of the following classes of intervals ]-∞,c[, [c,+∞[, ]c,+∞[, ]-∞,c], [c,d[, ]c,d], [c,d]. Borel Sigma Algebra Theorem (real-valued measurable functions): Let (X,A) be a measurable space, A function f: X→R is measurable with respect to (A,B(R)) iff f-1(]-∞,c[) A, for all c R.

17 Properties of real-valued measurable functions If f and g are measurable, then the following functions are also measurable: cf, |f|, f n, f+g, fg, max(f,g) - Most challenging property: sum of measurable functions - Main reason we formalized the rational numbers. Borel Sigma Algebra

18 Borel Sigma Algebra Every continuous function g: R → R is (B(R),B(R)) measurable. If g: R → R is continuous and f is (A,B(R)) measurable then so is g o f. If (f n ) is a monotonically increasing sequence of real-valued measurable functions with respect to (A,B(R)) and point-wise convergent to a function f then f is also (A,B(R)) measurable. Properties of real-valued measurable functions

19 Lebesgue Integral Positive Simple Functions Lebesgue Integral of Positive Simple Functions Lebesgue Integral of non-negative Functions Lebesgue Integral of Arbitrary Functions where, and

20 Lebesgue Integral Problem: Not possible to prove some of the Lebesgue integral properties using the definition. Solution: Prove the properties for positive simple functions. If g is a non-negative function, find a sequence of positive simple functions that is pointwise convergent to g. Extend the properties to non-negative functions. Extend the properties for the non-negative functions f + and f - to arbitrary functions. Definition: Lebesgue Integrable Functions Let (X,A,µ) be a measure space, a measurable function f is integrable iff Equivalently,

21 Lebesgue Integral Theorem: Integrability Theorem For any non-negative integrable function f, there exists a monotonically increasing sequence of positive simple functions (f n ) that is pointwise convergent to f. Besides, Proof: Upper bound Pointwise convergence Monotonicity

22 Lebesgue Integral Lebesgue Integral Properties Let f and g be integrable functions and c R, then Issue: Some properties does not require integrable functions.

23 Lebesgue Integral Almost Everywhere Definition: A property is true almost everywhere relative to the measure µ, iff the subset where the property does not hold is a null set. Theorems: Lebesgue Monotone Convergence Let f be an integrable function and (f n ) be a monotonically increasing sequence of non-negative functions, point-wise convergent to f then For example: f = g a.e. when the set is a null set. A is a null set f = g a.e. f ≤ g a.e.

24 Applications We prove important properties from the theory of probability, namely, the Chebyshev and Markov inequalities and the Weak Law of Large Numbers. Provide estimates of tail probabilities. The Chebyshev inequality guarantees that nearly all values are close to the mean. The Markov inequality provides loose yet useful bounds for the CDF. The WLLN is used in a multitude of fields. It is used, for instance, to prove the Asymptotic Equipartition Property, a fundamental concept in information theory.

25 Applications Let X be a random variable with expectation m and finite variance σ 2. Then for all k > 0, Chebyshev Inequality Markov Inequality Theorem: Let (S,A,µ) be a measure space and f a measurable function defined on S. Then for any non-negative function g, non-decreasing on the range of f, then for all t>0,

26 Applications Weak Law of Large Numbers Let X 1,X 2,… be a sequence of iid random variables with finite expectation m. Let, then for any, Chebyshev Inequality t = k σ f = |X-m| g(t) = t 2 if t ≥ 0 and 0 otherwise. Markov Inequality t = k f = |X| g(t) = t if t ≥ 0 and 0 otherwise. Proof: Chebyshev inequality, Properties of the Expectation and Variance

27 Conclusion Measure Theory Borel Sigma Algebra Properties of real-valued measurable functions Lebesgue Integration Properties of Lebesgue Integral Classical inequalities Weak Law of Large Numbers Formalization required more than 7000 lines of code. Only 250 lines to verify the properties of the applications section ~400 man hours.

28 Ongoing and Future Work Radon Nikodym Theorem Asymptotic Equipartition Property Source Coding Theorem

29 Thank you!