Fair-Balance Paradox, Star- tree Paradox, and Bayesian Phylogenetics Ziheng Yang, Mol. Biol. Evol. 2007 Presented by Caroline Uhler and Anna-Sapfo Malaspinas.

Slides:



Advertisements
Similar presentations
The multispecies coalescent: implications for inferring species trees
Advertisements

Bayes rule, priors and maximum a posteriori
Simulation Examples in EXCEL Montana Going Green 2010.
Basics of Statistical Estimation
Review of Probability. Definitions (1) Quiz 1.Let’s say I have a random variable X for a coin, with event space {H, T}. If the probability P(X=H) is.
Chapter 4 Probability and Probability Distributions
Flipping A Biased Coin Suppose you have a coin with an unknown bias, θ ≡ P(head). You flip the coin multiple times and observe the outcome. From observations,
Binomial Distribution & Bayes’ Theorem. Questions What is a probability? What is the probability of obtaining 2 heads in 4 coin tosses? What is the probability.
CSC321: 2011 Introduction to Neural Networks and Machine Learning Lecture 10: The Bayesian way to fit models Geoffrey Hinton.
Bayesian Wrap-Up (probably). 5 minutes of math... Marginal probabilities If you have a joint PDF:... and want to know about the probability of just one.
The Structure of a Hypothesis Test. Hypothesis Testing Hypothesis Test Statistic P-value Conclusion.
Final Exam: May 10 Thursday. If event E occurs, then the probability that event H will occur is p ( H | E ) IF E ( evidence ) is true THEN H ( hypothesis.
Maximum Likelihood. Likelihood The likelihood is the probability of the data given the model.
Assuming normally distributed data! Naïve Bayes Classifier.
Descriptive statistics Experiment  Data  Sample Statistics Sample mean Sample variance Normalize sample variance by N-1 Standard deviation goes as square-root.
Visual Recognition Tutorial
Bayesian learning finalized (with high probability)
Basics of Statistical Estimation. Learning Probabilities: Classical Approach Simplest case: Flipping a thumbtack tails heads True probability  is unknown.
INFM 718A / LBSC 705 Information For Decision Making Lecture 6.
LARGE SAMPLE TESTS ON PROPORTIONS
Descriptive statistics Experiment  Data  Sample Statistics Experiment  Data  Sample Statistics Sample mean Sample mean Sample variance Sample variance.
End of Chapter 8 Neil Weisenfeld March 28, 2005.
INFM 718A / LBSC 705 Information For Decision Making Lecture 7.
Using ranking and DCE data to value health states on the QALY scale using conventional and Bayesian methods Theresa Cain.
. Approximate Inference Slides by Nir Friedman. When can we hope to approximate? Two situations: u Highly stochastic distributions “Far” evidence is discarded.
Language Modeling Approaches for Information Retrieval Rong Jin.
Lecture 9: p-value functions and intro to Bayesian thinking Matthew Fox Advanced Epidemiology.
P(Y≤ 0.4) P(Y < 0.4) P(0.1 < Y ≤ 0.15)
Crash Course on Machine Learning
1 Bernoulli trial and binomial distribution Bernoulli trialBinomial distribution x (# H) 01 P(x)P(x)P(x)P(x)(1 – p)p ?
Estimating parameters in a statistical model Likelihood and Maximum likelihood estimation Bayesian point estimates Maximum a posteriori point.
Choosing Sample Size for Knowledge Tracing Models DERRICK COETZEE.
Math 15 – Elementary Statistics Sections 7.1 – 7.3 Probability – Who are the Frequentists?
IID Samples In supervised learning, we usually assume that data points are sampled independently and from the same distribution IID assumption: data are.
Hierarchical Topic Models and the Nested Chinese Restaurant Process Blei, Griffiths, Jordan, Tenenbaum presented by Rodrigo de Salvo Braz.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Created by Tom Wegleitner, Centreville, Virginia Section 3-6 Probabilities Through Simulations.
Maximum Likelihood - "Frequentist" inference x 1,x 2,....,x n ~ iid N( ,  2 ) Joint pdf for the whole random sample Maximum likelihood estimates.
Likelihood function and Bayes Theorem In simplest case P(B|A) = P(A|B) P(B)/P(A) and we consider the likelihood function in which we view the conditional.
The star-tree paradox in Bayesian phylogenetics Bengt Autzen Department of Philosophy, Logic and Scientific Method LSE.
Graph of a Binomial Distribution Binomial distribution for n = 4, p = 0.4:
WOOD 492 MODELLING FOR DECISION SUPPORT Lecture 25 Simulation.
Making sense of data We got to deal with some Math here folks.
Bayesian Phylogenetics. Bayes Theorem Pr(Tree|Data) = Pr(Data|Tree) x Pr(Tree) Pr(Data)
Simulating Probabilistic Behaviour
CHAPTER 6 Naive Bayes Models for Classification. QUESTION????
1 BA 555 Practical Business Analysis Linear Programming (LP) Sensitivity Analysis Simulation Agenda.
Objective  Find the experimental probability that an event will occur.
A shared random effects transition model for longitudinal count data with informative missingness Jinhui Li Joint work with Yingnian Wu, Xiaowei Yang.
Bootstrap ? See herehere. Maximum Likelihood and Model Choice The maximum Likelihood Ratio Test (LRT) allows to compare two nested models given a dataset.Likelihood.
Bayesian Approach Jake Blanchard Fall Introduction This is a methodology for combining observed data with expert judgment Treats all parameters.
16-3 The Binomial Probability Theorem. Let’s roll a die 3 times Look at the probability of getting a 6 or NOT getting a 6. Let’s make a tree diagram.
The Phylogenetic Indian Buffet Process : A Non- Exchangeable Nonparametric Prior for Latent Features By: Kurt T. Miller, Thomas L. Griffiths and Michael.
Bayesian statistics named after the Reverend Mr Bayes based on the concept that you can estimate the statistical properties of a system after measuting.
Chapter 14 Week 5, Monday. Introductory Example Consider a fair coin: Question: If I flip this coin, what is the probability of observing heads? Answer:
Random Variables If  is an outcome space with a probability measure and X is a real-valued function defined over the elements of , then X is a random.
Math 1320 Chapter 7: Probability 7.3 Probability and Probability Models.
National Alliance for Medical Image Computing Bayesian Classifier Image Filter John Georgia Tech Luis Kitware.
Bayesian Estimation and Confidence Intervals Lecture XXII.
Bayesian Estimation and Confidence Intervals
Applied statistics Usman Roshan.
… a tricky task !? Tanja Gernhard, Klaas Hartmann
Probability Trees By Anthony Stones.
Advanced Artificial Intelligence
Introduction to Bayesian Model Comparison
Bayes for Beginners Luca Chech and Jolanda Malamud
Counting and Classification (Dr. Monticino)
MAS2317- Introduction to Bayesian Statistics
Mathematical Foundations of BME Reza Shadmehr
Presentation transcript:

Fair-Balance Paradox, Star- tree Paradox, and Bayesian Phylogenetics Ziheng Yang, Mol. Biol. Evol Presented by Caroline Uhler and Anna-Sapfo Malaspinas

Outline What is the star-tree paradox? – Simulations Explanation: the fair-coin paradox. Solutions to the star-tree paradox: – Data size dependent prior – Degenerate-model prior Discussion

Paradoxes Star tree paradox 3 species rooted tree. If data is generated using a star tree the probability of each resolved tree does not approch 1/3 in large data sets. Fair coin paradox Assuming you flip a fair coin n times and observe y number of heads. The posterior P + =P(  > 1/2) does not approach ½ (but rather the uniform distribution).

Simulations P1P1 P2P2 P3P3 P2P2 P3P3 P1P1 P i : posterior probability of seeing tree topology  i

Solution to the paradox(es) Specification of the prior: Data Size-Dependant prior Degenerate-Model Prior (non zero probability of to the degenerate model)

Fair-coin paradox: Behavior of posterior with data size dependent prior

Star-tree paradox: Standard deviation  = 0  = 0.5  = 0.51  =  = 0.8

Fair-coin paradox: Effect of prior a:  0 = 0  = 2 b:  0 = 0.1  = 2 c:  0 = 0  0 = 1/3 d:  0 = 0.1  0 = 1/3

Discussion Does the star-tree occur in nature? Are there other ways of resolving the paradox in practice? Should priors in existing programs (e.g. Mr Bayes) be modified accordingly? Use (features of) the data to define the prior? Phylogenetics: is that prior appropriate in general? A different approach: Steel and Matsen (2007)