Download presentation
Presentation is loading. Please wait.
Published byDominic Ross Modified over 8 years ago
1
Review
2
Common probability distributions Discrete: binomial, Poisson, negative binomial, multinomial Continuous: normal, lognormal, beta, gamma, (negative binomial) Experiment with the distributions in Excel sheet “7 distributions.xlsx”
3
Beta distribution 0.5,0.5 1,1 1.3,1.3 4,4 50,50 2,60.5,2 Beta function
4
Beta: key notes Values confined to be 0 < x < 1 Can mimic almost any shape within those bounds Although bounded, can change the bounds by multiplying / dividing x values E.g. survival parameters
5
Gamma distribution 4, 1 4, 2 1.1, 0.5 0.9, 0.0001 60, 5
6
Gamma: key notes 0 ≤ x < ∞ Somewhat like an exponential, lognormal, or normal Flexibility without being bounded like the beta distribution E.g. salmon arrival numbers plotted over time Excel function beta.dist() assumes parameters α* = α and β* =1/β
7
Likelihood case studies
8
Reading Ecological detective: – Chapter 7 Likelihood and maximum likelihood
9
Probability If I flip a fair coin 10 times, what is the probability of it landing heads up every time? Given the fixed parameter (p = 0.5), what is the probability of different outcomes? Probabilities add up to 1. I flipped a coin 10 times and obtained 10 heads. What is the likelihood that the coin is fair? Given the fixed outcomes (data), what is the likelihood of different parameter values? Likelihoods do not add up to 1. Hypotheses (parameter values) are compared using likelihood values (higher = better). Likelihood
10
Probability What is the probability that 5 ≤ x ≤ 10 given a normal distribution with µ = 13 and σ = 4? Answer: 0.204 What is the probability that –1000 ≤ x ≤ 1000 given a normal distribution with µ = 13 and σ = 4? Answer: 1.000 What is the likelihood that µ = 13 and σ = 4 if you observed a value of (a) x = 10 (answer: the likelihood is 0.075) (b) x = 14 (answer: the likelihood is 0.097) Conclusion: if the observed value was 14, it is more likely that the parameters are µ = 13 and σ = 4, because 0.097 is higher than 0.075. Likelihood Area under curve between 5 and 10 Height of curve at x = 14 Height of curve at x = 10
11
ProbabilityLikelihood Notation What is the probability of observing a variety of data Y i given fixed parameter value p? What is the likelihood of different hypotheses about a variety of parameter values p m given that you observed fixed data Y
12
Probability (binomial) You are studying birth rates in a sea otter population with N = 30 females. During a study season each female either gives birth or does not give birth. Probability: it is (somehow) known that birth rates are p = 0.7. What is the probability that ≤10 of the females give birth? Answer: sum the probabilities for x ≤ 10; giving 0.000037. 8 Examples.xlsx, sheet Sea otter prob
13
Likelihood (binomial) You are studying birth rates in a sea otter population with N = 30 females. During a study season each female either gives birth or does not give birth. Likelihood: 10 females give birth; 20 do not. What is the likelihood that p = 0.3? 0.5? 0.7? Answer: p = 0.3: L = 0.141 p = 0.5: L = 0.028 p = 0.7: L = 0.00003 The MLE is p = 10/30 = 0.3333 L = 0.153 8 Examples.xlsx, sheet Sea otter like
14
Review of logarithms
15
Negative log likelihood (normal) Minimizing sum of squares, weighted by the variance Model-predicted mean Model-predicted standard deviation Data
16
Omit constants Include constants x at same minimum 8 Examples.xlsx, sheet NLL vs. Like Guess the likelihood that x = 25 Answer: 6.6×10 -10 Guess the likelihood that x = 51 Answer: 6.8×10 -82
17
Multiple observations If observations (data points) are independent then we can multiply the likelihoods together Or add up the negative log likelihoods
18
Multiple observations The sea otter study has been extended to three field seasons, based on the same N = 30 females. In these three seasons, 10, 20, and 13 females gave birth respectively. Assuming independence between years, what is the overall likelihood that p = 0.3? 0.5? 0.7? YearObservedp = 0.3p = 0.5p = 0.7p = 0.46 1100.141560.027980.000030.06057 2180.000460.080550.074850.04190 3130.044420.111540.001500.14144 L1×L2×L3L1×L2×L3 0.00000290.00025140.00000000.0003589 Total NLL12.758.2919.527.93 Scaled L0.008130.700450.000011.00000 MLE Binomial likelihood for p in that year given the observed data Product of likelihood Likelihood divided by MLE likelihood 8 Examples.xlsx, sheet Sea otter 3 seasons
19
Mark-recapture example We tagged 100 fish Went back a few days later (after mixing etc.) And recaptured 100 fish λ=5 recaptures were tagged We use Poisson distribution to explore the likelihood of different population sizes (N) The question is: what is N?
20
What we need Data: number marked, number recaptured, tags recaptured If the population size is N Prop. tagged = num. marked / N Recoveries = prop. tagged × num. recaptured Predicted tags recovered is λ of the Poisson, i.e. if λ = 1,2,…,100 then what is the likelihood of observing 5 recaptures?
21
Population size Num tagged Prop. tagged Num recapture Pred. tags recoveredLikelihood 5001000.20010020.000.000 10001000.10010010.000.038 15001000.0671006.670.140 20001000.0501005.000.175 25001000.0401004.000.156 30001000.0331003.330.122 35001000.0291002.860.091 40001000.0251002.500.067 45001000.0221002.220.049 50001000.0201002.000.036 55001000.0181001.820.027 60001000.0171001.670.020 65001000.0151001.540.015 70001000.0141001.430.012 75001000.0131001.330.009 80001000.0131001.250.007 85001000.0121001.180.006 90001000.0111001.110.005 95001000.0111001.050.004 100001000.0101001.000.003 8 Examples.xlsx, sheet Mark-recapture Observed data: k = 5
22
8 Examples.xlsx, sheet Mark-recapture
23
Multiple observations We go out twice more, capture 100 animals each time, and find 3 of these are tagged the second time, and 4 are tagged the third time
24
8 Examples.xlsx, sheet Mark-recapture
25
Multiply the three likelihoods together The result is much narrower 8 Examples.xlsx, sheet Mark-recapture MLE
26
The likelihood profile Mark-recapture example only had one parameter, so the likelihood profile is just the likelihood For problems with more than one parameter, fix the parameter of interest at discrete values and find the maximum likelihood by searching over all other parameters You can use the likelihood profile to calculate a confidence interval – find the two parameter values where the negative log- likelihood is 1.92 units higher than the MLE value
27
1.92 units from the MLE Confidence interval for x
28
Likelihood ratio test and 1.92 When the NLL for a model parameter is more than 1.92 units from the MLE, that is the 95% confidence interval (asymptotically for large sample size, when well behaved, etc.)
29
8 Examples.xlsx, sheet MR CI calcs 1.92 units higher MLE Mark-recapture example MLE: 2500 95% CI: 1491-4668
30
The concept of support The relative likelihood is the amount of support the data offer for one hypothesis compared to another hypothesis The absolute likelihood has no particular meaning without reference to other hypotheses
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.