G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 1 Statistical Data Analysis: Lecture 4 1Probability, Bayes’ theorem 2Random variables and.

Slides:



Advertisements
Similar presentations
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem, random variables, pdfs 2Functions.
Advertisements

Probability Densities
G. Cowan Lectures on Statistical Data Analysis Lecture 2 page 1 Statistical Data Analysis: Lecture 2 1Probability, Bayes’ theorem 2Random variables and.
Estimation of parameters. Maximum likelihood What has happened was most likely.
Descriptive statistics Experiment  Data  Sample Statistics Experiment  Data  Sample Statistics Sample mean Sample mean Sample variance Sample variance.
Statistics.
G. Cowan Lectures on Statistical Data Analysis Lecture 12 page 1 Statistical Data Analysis: Lecture 12 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 8 1Probability, Bayes’ theorem, random variables, pdfs 2Functions of.
K. Desch – Statistical methods of data analysis SS10
G. Cowan 2011 CERN Summer Student Lectures on Statistics / Lecture 41 Introduction to Statistics − Day 4 Lecture 1 Probability Random variables, probability.
2. Random variables  Introduction  Distribution of a random variable  Distribution function properties  Discrete random variables  Point mass  Discrete.
G. Cowan Lectures on Statistical Data Analysis Lecture 14 page 1 Statistical Data Analysis: Lecture 14 1Probability, Bayes’ theorem 2Random variables and.
Probability Distributions and Frequentist Statistics “A single death is a tragedy, a million deaths is a statistic” Joseph Stalin.
G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 4 Continuous Random Variables and Probability Distributions.
Introduction to Statistics − Day 2
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 7 1Probability, Bayes’ theorem, random variables, pdfs 2Functions of.
Some standard univariate probability distributions
Lecture II-2: Probability Review
Normal and Sampling Distributions A normal distribution is uniquely determined by its mean, , and variance,  2 The random variable Z = (X-  /  is.
Chapter 21 Random Variables Discrete: Bernoulli, Binomial, Geometric, Poisson Continuous: Uniform, Exponential, Gamma, Normal Expectation & Variance, Joint.
Continuous Probability Distribution  A continuous random variables (RV) has infinitely many possible outcomes  Probability is conveyed for a range of.
Standard Statistical Distributions Most elementary statistical books provide a survey of commonly used statistical distributions. The reason we study these.
Functions of Random Variables. Methods for determining the distribution of functions of Random Variables 1.Distribution function method 2.Moment generating.
1 Probability and Statistics  What is probability?  What is statistics?
DATA ANALYSIS Module Code: CA660 Lecture Block 3.
Chapter 5 Statistical Models in Simulation
G. Cowan Lectures on Statistical Data Analysis Lecture 1 page 1 Lectures on Statistical Data Analysis YETI IPPP Durham Young Experimentalists and.
Uniovi1 Some distributions Distribution/pdfExample use in HEP BinomialBranching ratio MultinomialHistogram with fixed N PoissonNumber of events found UniformMonte.
Overview course in Statistics (usually given in 26h, but now in 2h)  introduction of basic concepts of probability  concepts of parameter estimation.
Statistics for Engineer Week II and Week III: Random Variables and Probability Distribution.
Moment Generating Functions
G. Cowan 2009 CERN Summer Student Lectures on Statistics1 Introduction to Statistics − Day 4 Lecture 1 Probability Random variables, probability densities,
G. Cowan Lectures on Statistical Data Analysis Lecture 3 page 1 Lecture 3 1 Probability (90 min.) Definition, Bayes’ theorem, probability densities and.
Functions of Random Variables. Methods for determining the distribution of functions of Random Variables 1.Distribution function method 2.Moment generating.
880.P20 Winter 2006 Richard Kass Binomial Probability Distribution For the binomial distribution P is the probability of m successes out of N trials. Here.
ENGR 610 Applied Statistics Fall Week 3 Marshall University CITE Jack Smith.
G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 1: Introduction 清华大学高能物理研究中心 2010 年 4 月 12—16 日 Glen Cowan.
G. Cowan Computing and Statistical Data Analysis / Stat 2 1 Computing and Statistical Data Analysis Stat 2: Catalogue of pdfs London Postgraduate Lectures.
G. Cowan Lectures on Statistical Data Analysis Lecture 1 page 1 Lectures on Statistical Data Analysis London Postgraduate Lectures on Particle Physics;
G. Cowan Statistical Data Analysis / Stat 1 1 Statistical Data Analysis 2015/16 London Postgraduate Lectures on Particle Physics; University of London.
Stats Probability Theory Summary. The sample Space, S The sample space, S, for a random phenomena is the set of all possible outcomes.
B AD 6243: Applied Univariate Statistics Data Distributions and Sampling Professor Laku Chidambaram Price College of Business University of Oklahoma.
Exam 2: Rules Section 2.1 Bring a cheat sheet. One page 2 sides. Bring a calculator. Bring your book to use the tables in the back.
G. Cowan Lectures on Statistical Data Analysis Lecture 1 page 1 Lectures on Statistical Data Analysis RWTH Aachen Graduiertenkolleg February, 2007.
G. Cowan Weizmann Statistics Workshop, 2015 / GDC Lecture 11 Statistical Methods for Particle Physics Lecture 1: introduction to frequentist statistics.
1 Introduction to Statistics − Day 4 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Lecture 2 Brief catalogue of probability.
G. Cowan Lectures on Statistical Data Analysis Lecture 8 page 1 Statistical Data Analysis: Lecture 8 1Probability, Bayes’ theorem 2Random variables and.
1 Introduction to Statistics − Day 3 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Brief catalogue of probability densities.
G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 1 Lecture 4 1 Probability (90 min.) Definition, Bayes’ theorem, probability densities and.
G. Cowan Lectures on Statistical Data Analysis Lecture 9 page 1 Statistical Data Analysis: Lecture 9 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Lectures on Statistical Data Analysis Lecture 6 page 1 Statistical Data Analysis: Lecture 6 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Lectures on Statistical Data Analysis Lecture 5 page 1 Statistical Data Analysis: Lecture 5 1Probability, Bayes’ theorem 2Random variables and.
3.1 Statistical Distributions. Random Variable Observation = Variable Outcome = Random Variable Examples: – Weight/Size of animals – Animal surveys: detection.
G. Cowan Lectures on Statistical Data Analysis Lecture 12 page 1 Statistical Data Analysis: Lecture 12 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Aachen 2014 / Statistics for Particle Physics, Lecture 11 Statistical Methods for Particle Physics Lecture 1: probability, random variables, MC.
Sums of Random Variables and Long-Term Averages Sums of R.V. ‘s S n = X 1 + X X n of course.
1 Introduction to Statistical Methods for High Energy Physics Glen Cowan 2005 CERN Summer Student Lectures CERN Summer Student Lectures on Statistics Glen.
ASV Chapters 1 - Sample Spaces and Probabilities
Introduction to Statistics − Day 2
Appendix A: Probability Theory
Lectures on Statistical Data Analysis
Chapter 7: Sampling Distributions
Lecture 4 1 Probability (90 min.)
Lecture 3 1 Probability Definition, Bayes’ theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests general.
Statistics for Particle Physics Lecture 1: Fundamentals
Lecture 4 1 Probability Definition, Bayes’ theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests general.
Lecture 4 1 Probability Definition, Bayes’ theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests general.
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Presentation transcript:

G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 1 Statistical Data Analysis: Lecture 4 1Probability, Bayes’ theorem 2Random variables and probability densities 3Expectation values, error propagation 4Catalogue of pdfs 5The Monte Carlo method 6Statistical tests: general concepts 7Test statistics, multivariate methods 8Goodness-of-fit tests 9Parameter estimation, maximum likelihood 10More maximum likelihood 11Method of least squares 12Interval estimation, setting limits 13Nuisance parameters, systematic uncertainties 14Examples of Bayesian approach

G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 2 Some distributions Distribution/pdfExample use in HEP BinomialBranching ratio MultinomialHistogram with fixed N PoissonNumber of events found UniformMonte Carlo method ExponentialDecay time GaussianMeasurement error Chi-squareGoodness-of-fit Cauchy Mass of resonance LandauIonization energy loss

G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 3 Binomial distribution Consider N independent experiments (Bernoulli trials): outcome of each is ‘success’ or ‘failure’, probability of success on any given trial is p. Define discrete r.v. n = number of successes (0 ≤ n ≤ N). Probability of a specific outcome (in order), e.g. ‘ssfsf’ is But order not important; there are ways (permutations) to get n successes in N trials, total probability for n is sum of probabilities for each permutation.

G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 4 Binomial distribution (2) The binomial distribution is therefore random variable parameters For the expectation value and variance we find:

G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 5 Binomial distribution (3) Binomial distribution for several values of the parameters: Example: observe N decays of W ±, the number n of which are W→  is a binomial r.v., p = branching ratio.

G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 6 Multinomial distribution Like binomial but now m outcomes instead of two, probabilities are For N trials we want the probability to obtain: n 1 of outcome 1, n 2 of outcome 2,  n m of outcome m. This is the multinomial distribution for

G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 7 Multinomial distribution (2) Now consider outcome i as ‘success’, all others as ‘failure’. → all n i individually binomial with parameters N, p i for all i One can also find the covariance to be Example: represents a histogram with m bins, N total entries, all entries independent.

G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 8 Poisson distribution Consider binomial n in the limit → n follows the Poisson distribution: Example: number of scattering events n with cross section  found for a fixed integrated luminosity, with

G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 9 Uniform distribution Consider a continuous r.v. x with  ∞ < x < ∞. Uniform pdf is: N.B. For any r.v. x with cumulative distribution F(x), y = F(x) is uniform in [0,1]. Example: for  0 → , E  is uniform in [E min, E max ], with

G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 10 Exponential distribution The exponential pdf for the continuous r.v. x is defined by: Example: proper decay time t of an unstable particle (  = mean lifetime) Lack of memory (unique to exponential):

G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 11 Gaussian distribution The Gaussian (normal) pdf for a continuous r.v. x is defined by: Special case:  = 0,  2 = 1 (‘standard Gaussian’): (N.B. often ,  2 denote mean, variance of any r.v., not only Gaussian.) If y ~ Gaussian with ,  2, then x = (y   ) /  follows  (x).

G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 12 Gaussian pdf and the Central Limit Theorem The Gaussian pdf is so useful because almost any random variable that is a sum of a large number of small contributions follows it. This follows from the Central Limit Theorem: For n independent r.v.s x i with finite variances  i 2, otherwise arbitrary pdfs, consider the sum Measurement errors are often the sum of many contributions, so frequently measured values can be treated as Gaussian r.v.s. In the limit n → ∞, y is a Gaussian r.v. with

G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 13 Central Limit Theorem (2) The CLT can be proved using characteristic functions (Fourier transforms), see, e.g., SDA Chapter 10. Good example: velocity component v x of air molecules. OK example: total deflection due to multiple Coulomb scattering. (Rare large angle deflections give non-Gaussian tail.) Bad example: energy loss of charged particle traversing thin gas layer. (Rare collisions make up large fraction of energy loss, cf. Landau pdf.) For finite n, the theorem is approximately valid to the extent that the fluctuation of the sum is not dominated by one (or few) terms. Beware of measurement errors with non-Gaussian tails.

G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 14 Multivariate Gaussian distribution Multivariate Gaussian pdf for the vector are column vectors, are transpose (row) vectors, For n = 2 this is where  = cov[x 1, x 2 ]/(  1  2 ) is the correlation coefficient.

G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 15 Chi-square (  2 ) distribution The chi-square pdf for the continuous r.v. z (z ≥ 0) is defined by n = 1, 2,... = number of ‘degrees of freedom’ (dof) For independent Gaussian x i, i = 1,..., n, means  i, variances  i 2, follows  2 pdf with n dof. Example: goodness-of-fit test variable especially in conjunction with method of least squares.

G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 16 Cauchy (Breit-Wigner) distribution The Breit-Wigner pdf for the continuous r.v. x is defined by  = 2, x 0 = 0 is the Cauchy pdf.) E[x] not well defined, V[x] →∞. x 0 = mode (most probable value)  = full width at half maximum Example: mass of resonance particle, e.g. , K *,  0,...  = decay rate (inverse of mean lifetime)

G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 17 Landau distribution For a charged particle with  = v /c traversing a layer of matter of thickness d, the energy loss  follows the Landau pdf: L. Landau, J. Phys. USSR 8 (1944) 201; see also W. Allison and J. Cobb, Ann. Rev. Nucl. Part. Sci. 30 (1980) 253.    d 

G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 18 Landau distribution (2) Long ‘Landau tail’ → all moments ∞ Mode (most probable value) sensitive to , → particle i.d.

G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 19 Beta distribution Often used to represent pdf of continuous r.v. nonzero only between finite limits.

G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 20 Gamma distribution Often used to represent pdf of continuous r.v. nonzero only in [0,∞]. Also e.g. sum of n exponential r.v.s or time until nth event in Poisson process ~ Gamma

G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 21 Student's t distribution = number of degrees of freedom (not necessarily integer) = 1 gives Cauchy, → ∞ gives Gaussian.

G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 22 Student's t distribution (2) If x ~ Gaussian with  = 0,  2 = 1, and z ~  2 with n degrees of freedom, then t = x / (z/n) 1/2 follows Student's t with = n. This arises in problems where one forms the ratio of a sample mean to the sample standard deviation of Gaussian r.v.s. The Student's t provides a bell-shaped pdf with adjustable tails, ranging from those of a Gaussian, which fall off very quickly, (  → ∞, but in fact already very Gauss-like for  = two dozen), to the very long-tailed Cauchy ( = 1). Developed in 1908 by William Gosset, who worked under the pseudonym "Student" for the Guinness Brewery.

G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 23 Wrapping up lecture 4 We’ve looked at a number of important distributions: Binomial, Multinomial, Poisson, Uniform, Exponential Gaussian, Chi-square, Cauchy, Landau, Beta, Gamma, Student's t and we’ve seen the important Central Limit Theorem: explains why Gaussian r.v.s come up so often For a more complete catalogue see e.g. the handbook on statistical distributions by Christian Walck from