Probability distributions and likelihood

Slides:



Advertisements
Similar presentations
Chapter 3 Some Special Distributions Math 6203 Fall 2009 Instructor: Ayona Chatterjee.
Advertisements

458 Fitting models to data – II (The Basics of Maximum Likelihood Estimation) Fish 458, Lecture 9.
Discrete Probability Distributions
Distributions Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
The Triangle of Statistical Inference: Likelihoood
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Review and Preview This chapter combines the methods of descriptive statistics presented in.
CHAPTER Discrete Models  G eneral distributions  C lassical: Binomial, Poisson, etc Continuous Models  G eneral distributions 
Binomial Probability Distribution
The Triangle of Statistical Inference: Likelihoood Data Scientific Model Probability Model Inference.
Fitting probability models to frequency data. Review - proportions Data: discrete nominal variable with two states (“success” and “failure”) You can do.
EAS31116/B9036: Statistics in Earth & Atmospheric Sciences Lecture 3: Probability Distributions (cont’d) Instructor: Prof. Johnny Luo
1 Keep Life Simple! We live and work and dream, Each has his little scheme, Sometimes we laugh; sometimes we cry, And thus the days go by.
Chapter 5 Probability Distributions 5-1 Overview 5-2 Random Variables 5-3 Binomial Probability Distributions 5-4 Mean, Variance and Standard Deviation.
3.1 Statistical Distributions. Random Variable Observation = Variable Outcome = Random Variable Examples: – Weight/Size of animals – Animal surveys: detection.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 5-1 Chapter 5 Some Important Discrete Probability Distributions Business Statistics,
Review. Common probability distributions Discrete: binomial, Poisson, negative binomial, multinomial Continuous: normal, lognormal, beta, gamma, (negative.
Nonlinear function minimization (review). Newton’s minimization method Ecological detective p. 267 Isaac Newton We want to find the minimum value of f(x)
Theoretical distributions: the other distributions.
Chapter 6: Probability. Probability Probability is a method for measuring and quantifying the likelihood of obtaining a specific sample from a specific.
Probability Distributions  A variable (A, B, x, y, etc.) can take any of a specified set of values.  When the value of a variable is the outcome of a.
INTRODUCTION TO ECONOMIC STATISTICS Topic 5 Discrete Random Variables These slides are copyright © 2010 by Tavis Barr. This work is licensed under a Creative.
12. Discrete probability distributions
Lecture Slides Elementary Statistics Twelfth Edition
Chapter 4 Applied Statistics and Probability for Engineers
Modeling and Simulation CS 313
Introduction to Probability - III John Rundle Econophysics PHYS 250
Introductory Statistics and Data Analysis
CHAPTER 6 Random Variables
Chapter 14 Fitting Probability Distributions
Lecture Slides Elementary Statistics Eleventh Edition
Continuous Probability Distributions Part 2
Logistic Regression APKC – STATS AFAC (2016).
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
Probability and Statistics for Particle Physics
Hypothesis Testing: One Sample Cases
CHAPTER 14: Binomial Distributions*
Chapter 4 Probability Distributions
Modeling and Simulation CS 313
Discrete random variable X Examples: shoe size, dosage (mg), # cells,…
Binomial Distribution
Virtual University of Pakistan
Chapter 3 Discrete Random Variables and Probability Distributions
Chapter 5 Sampling Distributions
Chapter 5 Sampling Distributions
Multinomial Distribution
Maximum Likelihood Find the parameters of a model that best fit the data… Forms the foundation of Bayesian inference Slide 1.
Chapter 3 Discrete Random Variables and Probability Distributions
Chapter 5 Some Important Discrete Probability Distributions
Discrete random variable X Examples: shoe size, dosage (mg), # cells,…
Chapter 5 Sampling Distributions
Discrete Probability Distributions
Probability distributions
Multinomial Experiments
Example Human males have one X-chromosome and one Y-chromosome,
Probability Theory and Specific Distributions (Moore Ch5 and Guan Ch6)
Some Discrete Probability Distributions Part 2
Lecture Slides Elementary Statistics Twelfth Edition
Discrete Probability Distributions
Chapter 3 Discrete Random Variables and Probability Distributions
Lecture Slides Elementary Statistics Twelfth Edition
Discrete Probability Distributions
Introduction to Estimation
Discrete Probability Distributions
Continuous Probability Distributions Part 2
Multinomial Experiments
Multinomial Experiments
Multinomial Experiments
Discrete Probability Distributions
Review of the Binomial Distribution
Presentation transcript:

Probability distributions and likelihood

Readings Ecological detective: Wikipedia (seriously!) Chapter 3 Probability distributions Wikipedia (seriously!) e.g. Beta distribution, lognormal distribution, etc.

Overview (three lectures) Probability vs. likelihood Probability distributions: binomial, poisson, normal, lognormal, negative binomial, beta, gamma, multinomial Likelihood profile The concept of support Model selection, likelihood ratio, AIC Robustness Contradictory data

Probability Likelihood If I flip a fair coin 10 times, what is the probability of it landing heads up every time? Given the fixed parameter (p = 0.5), what is the probability of different outcomes? Probabilities add up to 1. I flipped a coin 10 times and obtained 10 heads. What is the likelihood that the coin is fair? Given the fixed outcomes (data), what is the likelihood of different parameter values? Likelihoods do not add up to 1. Hypotheses (parameter values) are compared using likelihood values (higher = better).

Probability Likelihood Area under curve between 5 and 10 Height of curve at x = 10 Height of curve at x = 14 What is the probability that 5 ≤ x ≤ 10 given a normal distribution with µ = 13 and σ = 4? Answer: 0.204 What is the probability that –1000 ≤ x ≤ 1000 given a normal distribution with µ = 13 and σ = 4? Answer: 1.000 What is the likelihood that µ = 13 and σ = 4 if you observed a value of x = 10 (answer: the likelihood is 0.075) x = 14 (answer: the likelihood is 0.097) Conclusion: if the observed value was 14, it is more likely that the parameters are µ = 13 and σ = 4, because 0.097 is higher than 0.075.

We use the same (in this case normal) probability distribution function for both probability and likelihood! Likelihood is the height, probability is the area

Common probability distributions Discrete: binomial, Poisson, negative binomial, multinomial Continuous: normal, lognormal, beta, gamma, (negative binomial) For all of these the Excel sheet gives the likelihood at each value of x, given the parameters of the distribution

7 Statistical distributions 7 Statistical distributions.xlsx Many distributions defined here, including Excel functions and functions defined directly in the spreadsheet

SD and CV (all distributions)

Binomial probability distribution Number of trials Number of successes Probability of success [0,1] The “factorial term” How many ways are there of selecting k objects from among N objects Example: probability of getting k = 5 heads when flipping a coin N = 10 times, if the coin is fair (p = 0.5). Note: known number of trials. 7 Statistical distributions.xlsx, sheet Binomial

Poisson probability distribution Expected number of events Number of events Example: On average there are λ = 9.4 fatal traffic accidents in Washington State every week. What is the probability that there would be k = 0 in a week? (Note: rare event out of large number of possible events.) 7 Statistical distributions.xlsx, sheet Poisson

Limitations of Poisson Has only one parameter, which is both the mean and the variance We often have discrete count data, but in real-life data the variance is often larger than predicted by the Poisson

Thus we often use the negative binomial Closely related to the Poisson and binomial One extra parameter related to the variance VERY useful

Standard negative binomial Number of failures Probability of a success Number of successes Squint a lot and this looks kind of like a binomial Example: a factory makes widgets successfully with probability p. How many successful widgets have been made when r = 3 failed widgets have been made? The distribution predicts the probability of k = 0, 1, 2, … successful widgets being made. 7 Statistical distributions.xlsx, sheet Neg binomial (std)

Ecological usefulness? Great for factories, but almost no ecological problems can be thought of as successes or failures in this way Great for factory production problems But we want a function with parameters for Mean Overdispersion (increased variance = increased chance of extreme events) Integer events are rare in nature, we want to deal with real numbers

Practitioner’s negative binomial Gamma function (factorial that accepts non-integers, see later) Overdispersion parameter Predicted mean As θ increases, variance increases, hence “overdispersion” As θ → , var(Z) →  As θ → 0, var(Z) = λ, just like a Poisson! Example: our data contain observations k, with mean λ and variance greater than λ. Find the value of overdispersion θ that best accounts for this increased variance. 7 Statistical distributions.xlsx, sheet Neg binomial (practitioner)

Weird facts about the practitioner’s negative binomial When θ → 0 this doesn’t just smell like a Poisson, and act like a Poisson, it is the Poisson By replacing the factorials with gamma functions, the r and k can be real numbers not just integers What on earth is a gamma function???

Gamma function Γ() For background information only: this is a generalized factorial function that accepts real numbers not just integers Excel: does not have a gamma function but has a ln of gamma function (GAMMALN)

Multinomial probability distribution Observed number in category k Total number observed Model-predicted proportion in category k Example: fitting a model to proportions at age (or proportions at length) data. Model produces predicted proportions pi and data gives observed numbers xi in each category. Total numbers sampled = n = x1 + x2+ … + xk 7 Statistical distributions.xlsx, sheet Multinomial

7 Statistical distributions.xlsx, sheet Multinomial

Unrealism of multinomial (and other distributions too!) Assumes every sampling event is completely independent But there is much correlation in reality Same trawl, area, time of day, day of year, gender, etc. Real data never ever fit a multinomial this well Later lectures will introduce the concept of “effective sample size” neff, which will be smaller than reported sample size n.

Normal distribution 7 Statistical distributions.xlsx, sheet Normal

Lognormal distribution 7 Statistical distributions.xlsx, sheet Lognormal

Lognormal: key notes 0 < x <  Mean(x) is not µ If we want the mean to be µ, then replace the model parameter with: Used widely for abundance and biomass

Beta distribution 0.5,0.5 1,1 1.3,1.3 4,4 0.5,2 2,6 50,50 7 Statistical distributions.xlsx, sheet Beta

Beta: key notes Values confined to be 0 < x < 1 Can mimic almost any shape within those bounds Although bounded, can change the bounds by multiplying / dividing x values E.g. survival parameters

Gamma distribution 4, 1 4, 2 1.1, 0.5 0.9, 0.0001 60, 5 7 Statistical distributions.xlsx, sheet Gamma

Gamma: key notes 0 ≤ x <  Somewhat like an exponential, lognormal, or normal Flexibility without being bounded like the beta distribution E.g. salmon arrival numbers plotted over time Excel function beta.dist() assumes parameters α* = α and β* =1/β