Chronologies from radiocarbon dates to age-depth models Richard Telford Bio 351 Quantitative Palaeoecology Lecture Plan Calibration of single dates 14.

Slides:



Advertisements
Similar presentations
Introduction to Monte Carlo Markov chain (MCMC) methods
Advertisements

Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
Design of Experiments Lecture I
CS479/679 Pattern Recognition Dr. George Bebis
Cost of surrogates In linear regression, the process of fitting involves solving a set of linear equations once. For moving least squares, we need to form.
0 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.
Chapter 7 Title and Outline 1 7 Sampling Distributions and Point Estimation of Parameters 7-1 Point Estimation 7-2 Sampling Distributions and the Central.
1 Methods of Experimental Particle Physics Alexei Safonov Lecture #21.
Markov-Chain Monte Carlo
Bayesian statistics – MCMC techniques
Computing the Posterior Probability The posterior probability distribution contains the complete information concerning the parameters, but need often.
Resampling techniques Why resampling? Jacknife Cross-validation Bootstrap Examples of application of bootstrap.
Maximum likelihood Conditional distribution and likelihood Maximum likelihood estimations Information in the data and likelihood Observed and Fisher’s.
Curve-Fitting Regression
Particle filters (continued…). Recall Particle filters –Track state sequence x i given the measurements ( y 0, y 1, …., y i ) –Non-linear dynamics –Non-linear.
End of Chapter 8 Neil Weisenfeld March 28, 2005.
1 lBayesian Estimation (BE) l Bayesian Parameter Estimation: Gaussian Case l Bayesian Parameter Estimation: General Estimation l Problems of Dimensionality.
Using ranking and DCE data to value health states on the QALY scale using conventional and Bayesian methods Theresa Cain.
Bayesian Estimation (BE) Bayesian Parameter Estimation: Gaussian Case
Maximum likelihood (ML)
Objectives of Multiple Regression
LINEAR REGRESSION Introduction Section 0 Lecture 1 Slide 1 Lecture 5 Slide 1 INTRODUCTION TO Modern Physics PHYX 2710 Fall 2004 Intermediate 3870 Fall.
1 Bayesian methods for parameter estimation and data assimilation with crop models Part 2: Likelihood function and prior distribution David Makowski and.
Overview G. Jogesh Babu. Probability theory Probability is all about flip of a coin Conditional probability & Bayes theorem (Bayesian analysis) Expectation,
0 Pattern Classification, Chapter 3 0 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda,
Statistical Decision Theory
PARAMETRIC STATISTICAL INFERENCE
Estimating parameters in a statistical model Likelihood and Maximum likelihood estimation Bayesian point estimates Maximum a posteriori point.
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 08/10/ :23 PM 1 Some basic statistical concepts, statistics.
Multiple Regression The Basics. Multiple Regression (MR) Predicting one DV from a set of predictors, the DV should be interval/ratio or at least assumed.
CSC321: 2011 Introduction to Neural Networks and Machine Learning Lecture 11: Bayesian learning continued Geoffrey Hinton.
Bayesian Inversion of Stokes Profiles A.Asensio Ramos (IAC) M. J. Martínez González (LERMA) J. A. Rubiño Martín (IAC) Beaulieu Workshop ( Beaulieu sur.
10.1: Confidence Intervals – The Basics. Review Question!!! If the mean and the standard deviation of a continuous random variable that is normally distributed.
Curve-Fitting Regression
ECE 8443 – Pattern Recognition LECTURE 07: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Class-Conditional Density The Multivariate Case General.
Fast Simulators for Assessment and Propagation of Model Uncertainty* Jim Berger, M.J. Bayarri, German Molina June 20, 2001 SAMO 2001, Madrid *Project of.
Statistical Decision Theory Bayes’ theorem: For discrete events For probability density functions.
Latent Class Regression Model Graphical Diagnostics Using an MCMC Estimation Procedure Elizabeth S. Garrett Scott L. Zeger Johns Hopkins University
Radiocarbon Calibration By Jonathan Chacon ESS November2007.
- 1 - Overall procedure of validation Calibration Validation Figure 12.4 Validation, calibration, and prediction (Oberkampf and Barone, 2004 ). Model accuracy.
Calibrating Radiocarbon Ages Matt Baillie 4/1/04 HWR696T.
Machine Learning 5. Parametric Methods.
6. Population Codes Presented by Rhee, Je-Keun © 2008, SNU Biointelligence Lab,
1 Chapter 8: Model Inference and Averaging Presented by Hui Fang.
The Unscented Particle Filter 2000/09/29 이 시은. Introduction Filtering –estimate the states(parameters or hidden variable) as a set of observations becomes.
CS Statistical Machine learning Lecture 25 Yuan (Alan) Qi Purdue CS Nov
Bayesian Modelling Harry R. Erwin, PhD School of Computing and Technology University of Sunderland.
Lecture 8: Measurement Errors 1. Objectives List some sources of measurement errors. Classify measurement errors into systematic and random errors. Study.
Density Estimation in R Ha Le and Nikolaos Sarafianos COSC 7362 – Advanced Machine Learning Professor: Dr. Christoph F. Eick 1.
Outline Historical note about Bayes’ rule Bayesian updating for probability density functions –Salary offer estimate Coin trials example Reading material:
Ch 1. Introduction Pattern Recognition and Machine Learning, C. M. Bishop, Updated by J.-H. Eom (2 nd round revision) Summarized by K.-I.
Week 2 Normal Distributions, Scatter Plots, Regression and Random.
CSC321: Lecture 8: The Bayesian way to fit models Geoffrey Hinton.
Overview G. Jogesh Babu. R Programming environment Introduction to R programming language R is an integrated suite of software facilities for data manipulation,
Hierarchical Models. Conceptual: What are we talking about? – What makes a statistical model hierarchical? – How does that fit into population analysis?
Markov Chain Monte Carlo in R
Stats Methods at IC Lecture 3: Regression.
Chapter 7. Classification and Prediction
Goodness of Fit x² -Test
Data Mining Lecture 11.
Pattern Classification, Chapter 3
Chapter 3: Maximum-Likelihood and Bayesian Parameter Estimation (part 2)
Statistical Methods For Engineers
Introduction to Instrumentation Engineering
Modelling data and curve fitting
Filtering and State Estimation: Basic Concepts
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
Statistical NLP: Lecture 4
Ch13 Empirical Methods.
Chapter 3: Maximum-Likelihood and Bayesian Parameter Estimation (part 2)
Presentation transcript:

Chronologies from radiocarbon dates to age-depth models Richard Telford Bio 351 Quantitative Palaeoecology Lecture Plan Calibration of single dates 14 C years  cal years Bayesian statistics Calibration of multiple dates in a series at the same event Age-depth models

14 C half-life is 5730 years Suitable for organic material and carbonates Useful for sediments years old The most widely used dating tool for late- Quaternary studies Unique amongst absolute-dating methods in not giving a date in calendar years Radiocarbon Dating

Radioactive Decay 14 C→ 14 N+  Random process Atom has 50% chance of decaying in 5730 yrs Exponential decay

Radioactive Decay equations What is λ?

Express measured 14 C as %modern A=A initial e -ln(2)*age/halflife ln(A/A initial )=ln(2)*age/halflife Use Libby halflife 5568 age= ln(A/A initial ) assume A initial = A modern age= ln(A/A modern ) Using Radioactive Decay equations Assumes atmospheric 14 C constant

Causes of Non-Constant Atmospheric 14 C 1) Changes in production - Variations in solar activity solar minimum weak magnetic shield maximum 14 C production solar maximum strong magnetic shield minimum 14 C production 2) Changes in distribution - rate of ocean turnover - global vegetation changes - Variations in earth magnetic field strength

Dendrochronological Evidence Find 14 C date of tree rings of known age

INTCAL04

14 C Calibration Curves Atmospheric Marine

Calibration: from 14 C Age to Calibrated Age The intercept method –quick, easy and entirely inappropriate Classical calibration (CALIB) –fast and simple Bayesian calibration –allows use of prior information

Calibration of marine dates Use either classical or Bayesian calibration Use the marine calibration curve Set ΔR – the local reservoir affect offset Set σΔR – the uncertainty Do not subtract R

The Intercept Method: Multiple Intercepts Calibrated years BP Radiocarbon years BP 4540± ±50

The Intercept Method: Missing Probabilities

C lassical Calibration  Unknown calendar date  (  ) is the true radiocarbon age  but cannot be measured precisely Radiocarbon date y is a realisation of Y =  (  ) + noise Noise is assumed to have a Normal distribution with mean 0, and standard deviation  Thus Y~N(  (  ),  2 ).

C lassical Calibration Normal Distribution The probability distribution p(Y) of the 14 C ages Y around the 14 C date y with a total standard deviation  is: Total standard deviation  is, where  s and  c are the standard deviations of the 14 C date and calibration curve respectively: The calibration curve can be defined as: Replacing Y with  (  ), p(Y) is: To obtain P(  ),  (  ) is determined for each calendar year and the corresponding probability is transferred to the  axis. Y =  (  )

C lassical Calibration Quick and simple Fine if we just have one date But difficult to include any a priori knowledge e.g. dates in a sequence To do this we need to use the Bayesian paradigm

The Bayesian Paradigm ( ) Bayes, T.R. (1763) An essay towards solving a problem in the Doctrine of Chances. Philosophical Transactions of the Royal Society, 53: Can utilise information outside of the data. This prior information and its related uncertainty must be encoded into probabilities. Then it can be combined with data to assess the total value of the combined information. Bayes' Theorem provides a structure for doing this. Simple in theory, but computationally difficult.

The Bayesian Paradigm The Likelihood - “How likely are the values of the data observed, given some specific values of the unknown parameters?” The Prior – “How much belief do I attach to possible values of the unknown parameters before observing the data?” The Posterior - “How much belief do I attach to possible values of the unknown parameters after observing the data?” The Posterior The Likelihood The Prior

The Likelihood  Unknown calendar date  (  ) is the true radiocarbon age  but cannot be measured precisely Radiocarbon date y is a realisation of Y =  (  ) + noise Noise is assumed to have a Normal distribution with mean 0, and standard deviation  Thus Y~N(  (  ),  2 ). With the calibration curve, we have an estimate of  (  ), and can formalise the relationship between  and y±  This is the likelihood.

The Prior For a single date with weak (or no) a priori information we can use an non- informative prior e.g. for a date  known to be post-glacial P prior (  )= Often we know more than this. Perhaps there is stratigraphic information: e.g. dates  1,  2 &  3 are taken from a sediment core and are in chronological order P prior (  1 <  2 <  3 )= The Bayesian paradigm offers the greatest advantage over classical methods when there is a strong prior and overlapping data. constant for -50<  < otherwise constant for  1 <  2 <  3 <  0 otherwise

Computation of the Posterior Analytically calculation is impossible for all but the simplest cases So instead Produce many simulations from the posterior and use as estimate Markov Chain Monte Carlo does this to give approximate solution Markov Chain? - each simulation depends only on the previous one - selected from range of possible values - the state space Areas with higher probability will be sampled more frequently

Markov Chain Continued 1.Start with an initial guess 2.Select the next sample 3.Repeat step 2 until convergence is reached Gibbs sampler - one of the simplest MCMC methods

theta[1] chains 1:2 iteration Convergence Easier to diagnose that it hasn’t converged, than prove that it has.

Reproducibility MCMC does not yield an exact answer It is the outcome of random process Repeated runs can give different results Calibrate multiple times & verify results are similar Report just one run Acknowledge level of variability

Outlier Detection Outliers can have a large impact on the age estimates Extreme but “correct” dates Contamination Erroneous assumptions? Need a method to detect them and reduce their influence Outliers can only be defined based on calibrated dates Christen (1994) Radiocarbon determinations dating the same event should come from N(  (  ),  2 ) An outlier is a determination that needs a shift  j Given the a priori probability that a date is an outlier, posteriori probabilities can be calculated Calibration and outlier detection done together Automatic down-weighting of outliers

Dates in Stratigraphic Order

Wiggle Matching In material with annual increments (tree-rings & varves) Time between two dates precisely known 20 years 11 22 This additional information can be used in the prior

Wiggle Matching 2 Buck et al. (1996) Bayesian approach to interpreting archaeological data. Wiley: Chichester. p

Wiggle Matching in Unlaminated Sediments  x 12 11 33 If the sedimentation rate is assumed to be constant: (  1-  2)/(  2-  3)  =  x 12 /  x 23 This information can be used in the prior 22  x 23

Wiggle Matching in Unlaminated Sediments Wiggle matching has greatest impact when the calibration curve is very wiggly there is a high density of dates But may be sensitive to the assumption of linear sedimentation Christen et al. (1995) Radiocarbon

Sensitivity Tests Bayesian radiocarbon calibration is very flexible and sensitive Apparently small changes in prior information can have a large effect on the results Need to carefully consider the specific representations you choose And investigate what happens when you vary them Report the findings

Software Oxcal Download fromhttp:// Fast & easy for simple models BCAL Online at Automatic outlier detection WinBugs If you want to implement a novel model Remember to enter your samples oldest first!

From Dates to Chronologies Not every level dated –too expensive –insufficient material Fit age-depth to find undated levels –Linear interpolation –Linear regression models –Splines –Mixed-effect models (Heegaard et al. (2005)) Age-depth models based on uncalibrated dates are meaningless

Linear Interpolation What assumptions does this make? Lake Tilo

Linear Interpolation – Join the Dots Which dots? 285 BP

Linear regression models Lake Tilo What assumptions does this make? Also weighted- least squares Assess by  2 Polynomial order

Is Sedimentation a Polynomial Function? Holzmaar varve sequence

Conclusions Bayesian calibration of 14 C dates - allows inclusion of prior knowledge - produces more precise calibrations - but, if the priors are invalid, lower accuracy Age-depth modelling - lots of different methods - some are worse than others - no currently implemented method properly incorporates the full uncertainties