Statistical Approaches to Length of Reign Boyko Zlatev University of Alberta Canadian Young Researchers Conference in Mathematics and Statistics. Edmonton,

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

1 Chi-Square Test -- X 2 Test of Goodness of Fit.
CHAPTER 2 Building Empirical Model. Basic Statistical Concepts Consider this situation: The tension bond strength of portland cement mortar is an important.
Copyright © 2009 Pearson Education, Inc. Chapter 29 Multiple Regression.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
N.D.GagunashviliUniversity of Akureyri, Iceland Pearson´s χ 2 Test Modifications for Comparison of Unweighted and Weighted Histograms and Two Weighted.
Sampling distributions of alleles under models of neutral evolution.
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 25, Slide 1 Chapter 25 Comparing Counts.
Hypothesis testing Some general concepts: Null hypothesisH 0 A statement we “wish” to refute Alternative hypotesisH 1 The whole or part of the complement.
4. Multiple Regression Analysis: Estimation -Most econometric regressions are motivated by a question -ie: Do Canadian Heritage commercials have a positive.
Elementary hypothesis testing Purpose of hypothesis testing Type of hypotheses Type of errors Critical regions Significant levels Hypothesis vs intervals.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Statistical inference Population - collection of all subjects or objects of interest (not necessarily people) Sample - subset of the population used to.
The Neymann-Pearson Lemma Suppose that the data x 1, …, x n has joint density function f(x 1, …, x n ;  ) where  is either  1 or  2. Let g(x 1, …,
How tired are students in the morning ? Jose Almanza Period
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Statistical inference: confidence intervals and hypothesis testing.
INTRODUCTORY MATHEMATICAL ANALYSIS For Business, Economics, and the Life and Social Sciences  2011 Pearson Education, Inc. Chapter 14 Integration.
Regression Analysis (2)
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
Copyright © 2010 Pearson Education, Inc. Warm Up- Good Morning! If all the values of a data set are the same, all of the following must equal zero except.
Topic 5 Statistical inference: point and interval estimate
Sampling Distribution ● Tells what values a sample statistic (such as sample proportion) takes and how often it takes those values in repeated sampling.
1 Statistical Distribution Fitting Dr. Jason Merrick.
Essential Statistics Chapter 131 Introduction to Inference.
LECTURER PROF.Dr. DEMIR BAYKA AUTOMOTIVE ENGINEERING LABORATORY I.
How many times a week are you late to school? Leslie Lariz Marisol Cipriano Period
INTRODUCTORY MATHEMATICAL ANALYSIS For Business, Economics, and the Life and Social Sciences  2007 Pearson Education Asia Chapter 14 Integration.
CHAPTER 17: Tests of Significance: The Basics
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Section Inference about Two Means: Independent Samples 11.3.
The Enlightenment Europe (a.k.a. the Age of Reason)
Confidence intervals and hypothesis testing Petter Mostad
What is you’re GPA? Cristian Espinoza Juan Gonzalez Period
Testing Hypothesis That Data Fit a Given Probability Distribution Problem: We have a sample of size n. Determine if the data fits a probability distribution.
Fitting probability models to frequency data. Review - proportions Data: discrete nominal variable with two states (“success” and “failure”) You can do.
Copyright © 2010 Pearson Education, Inc. Slide
Introduction to Statistical Inference Jianan Hui 10/22/2014.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Copyright © 2010 Pearson Education, Inc. Warm Up- Good Morning! If all the values of a data set are the same, all of the following must equal zero except.
Sampling and estimation Petter Mostad
Tuesday, April 8 n Inferential statistics – Part 2 n Hypothesis testing n Statistical significance n continued….
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
Ricardo Garcia Bertin Pablo Eric Gomez Period:
Beginning Statistics Table of Contents HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2008 by Hawkes Learning Systems/Quant Systems, Inc.
Stats Term Test 4 Solutions. c) d) An alternative solution is to use the probability mass function and.
Comparing Counts Chapter 26. Goodness-of-Fit A test of whether the distribution of counts in one categorical variable matches the distribution predicted.
Name Mean of Sampling Distribution Standard Deviation/Error of Sampling Distribution 1 sample z-Interval for Proportions 1 sample z-interval for Means.
BIOL 582 Lecture Set 2 Inferential Statistics, Hypotheses, and Resampling.
AP Statistics Chapter 24 Notes “Comparing Two Sample Means”
The accuracy of averages We learned how to make inference from the sample to the population: Counting the percentages. Here we begin to learn how to make.
Tests of Significance We use test to determine whether a “prediction” is “true” or “false”. More precisely, a test of significance gets at the question.
The normal approximation for probability histograms.
Virtual University of Pakistan Lecture No. 34 of the course on Statistics and Probability by Miss Saleha Naghmi Habibullah.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
© 2010 Pearson Prentice Hall. All rights reserved Chapter Hypothesis Tests Regarding a Parameter 10.
The Scientific Revolution
Chapter 4 Basic Estimation Techniques
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Goodness of Fit x² -Test
Chapter 25 Comparing Counts.
Statistical Methods For Engineers
CHAPTER 17: Tests of Significance: The Basics
Enlightenment – 1600 to 1700’s “Natural Laws” … Niccolo Machiavelli
Chapter 26 Comparing Counts.
Chapter Outline Inferences About the Difference Between Two Population Means: s 1 and s 2 Known.
Chapter 26 Comparing Counts Copyright © 2009 Pearson Education, Inc.
Chapter 26 Comparing Counts.
Presentation transcript:

Statistical Approaches to Length of Reign Boyko Zlatev University of Alberta Canadian Young Researchers Conference in Mathematics and Statistics. Edmonton, May 2010

The Date of Founding of Rome Last Roman King deposed c. 503 BC. Traditional date of founding of Rome – c. 760 BC. Reigns of 7 Roman Kings cover 257 years in total. Isn’t it a very large value?

Sir Isaac Newton ( ) and “The Chronology of Ancient Kingdoms Amended” (1728) Newton was not much interested in probability (Sheynin, 1971). But he was interested in chronology! He reconstructed ancient chronology and made it “fit better with the course of Nature”. For that he analyzed some data, available in his time.

Newton’s Data and Results Average lengths of reign for different states concentrated in the interval 18÷20 years. In fact, this is roughly a 65% confidence interval, as shown by Stigler (JASA, 1977).

François-Marie Arouet de Voltaire ( ) Confirmed Newton’s results on the reigns on the German Emperors (elected!) – average is 18.4 years. Applied Newton’s rule to Chinese Kings. Noted that in states where revolutions are frequent, Newton’s rule shows too large a value.

Marie Jean Antoine Nicolas de Caritat, marquis de Condorcet ( ) Determined the chance that 7 kings shall reigned exactly 257 years. Applied “la probabilité propre” – “appropriate probability” (not clear what is it) to find the solution. Contributed to shortening the reign of Louis XVI – King of France.

Condorcet’s Solution Following De Moivre’s Law of Mortality with 90 years as limit of life, and the ages of (s)elected kings between 30 and 60 years, the probability seven kings reigning s years is the coefficient c s of x s in the expansion of:

Condorcet’s Solution (continued) Condorcet found P = c 257 = (may be it is an approximation only) Computation with R gives P = c 257 = , but it is also not very accurate: Mathematica gives P = c 257 =

Condorcet’s Solution (continued) “Appropriate probability” for 414 events possible: If P from R computation is used, the result is To find P(s≥257), one has to compute:

Karl Pearson “Biometry and Chronology” (Biometrika, 1928) “The frequency distribution of the reigns of souvereigns is given. What is the chance that a sample of seven selected at random will give a total length of reigns of 257 or more years?”

Karl Pearson “Biometry and Chronology” (Biometrika, 1928) (continued) The frequency distribution we have is not for the same country at the same time (further discussions are possible about which is more important – time or space). In reality averages for different countries and times are (according to Pearson) in surprising agreement with Newton’s results, i.e. concentrated between 18 and 20 years.

Karl Pearson “Biometry and Chronology” (Biometrika, 1928) (continued) Histogram for 250 European Monarchs

Karl Pearson “Biometry and Chronology” (Biometrika, 1928) (continued) Distribution of Means in Samples of Seven Reigns According to this distribution, Pearson found the probability for seven kings reigning at least 257 years to be This is about ½ of the result based on de Moivre’s Law of Mortality

Karl Pearson “Biometry and Chronology” (Biometrika, 1928) (continued) “Seven men are chosen at random between the ages of 30 and 60. Find the chance that a sample of seven selected at random will give a total length of reigns of 257 or more years”. The life-table chosen will not be from same time and same country. The choice of age interval is questionable.

Solution by Charles F. Trustam (Biometrika, 1928) H M life table is used (healthy males from England and Scotland, mid. XIX century). Linear approximation for λ between 0 and 45. P =

1943 E.Rubin. The Place of Statistical Methods in Modern Historiography. American Journal of Economics and Sociology. S.Ya.Lurie. Newton – Historian of the Antiquity. In Proceedings of Symposium (USSR), dedicated to 300-th anniversary from Newton’s birth In both papers Newton’s approach to the date of founding of Rome and similar problems is examined. As pointed by Lurie, in some cases Newton’s method gives surprisingly accurate results, confirmed by evidence found later. According to Rubin: “Statistical methods may only be applied if the historian knows something as to the reliability of the data. (…) There must be a clear comprehension of these techniques, which implies an understanding of their possibilities and limitations”.

Thomas R. Trautmann. Length of Generation and Reign in Ancient India. Journal of the American Oriental Society (1969) Hypotheses concerning ancient Indian chronology must be tested by comparing with a sample of kings from medieval India whose dates are better known. The problems of interregna and missing data are considered. Estimations based on the number of generations are more accurate then those based on the number of reigns.

E. Khmaladze, R. Browning, J. Haywood Brittle power: On Roman Emperors and exponential lengths of rule. Statistics & Probability Letters (2007) Hypothesis tested: a sample T 1, …, T n of n independent r.vs. follows exponential distribution with some a priori unspecified λ>0. One could consider an empirical distribution function of the sample and compare it to, with estimated by from the same data. Instead of taking the difference there is an advantage to take where K is called the “compensator” in the martingale theory of point processes. For the family of exponential distributions its form is simple. If the hypothesis of exponentiality is true, the version of the empirical process converges in distribution to the standard Brownian motion. So, the classical Kolmogorov-Smirnov statistic can be used.

E. Khmaladze et al. (continued) Results For the three chronological tables of Roman Emperors (those of Kienast, Parkin and Gibbon) the p-values are 0.26, 0.17 and 0.20, respectively. Hence, the hypothesis of exponentiality is not rejected. Exponentiality rejected for Chinese Emperors (p=0.028). Age of ascent dependence for British and Spanish monarchs.

Why? “…their [Roman Emperors’] lives were constantly exposed to turmoil and life threatening challenges, no matter in peace or war” (Khmaladze at al.) But the same must hold for other monarchs also! There are differences in the inheritance rules!

Topology of Data: Family tree (graph!) mapped on a time interval

Idea: to apply FDA Considering non-homogeneous Poisson process with intensity µ(t) = exp(W(t)), representing W(t) in terms of a set of basis functions (B-splines, Fourier, etc.) and minimizing the following: (Ramsay & Silverman, Functional Data Analysis, 2006).

THANK YOU!