Archimedean Copulas Theodore Charitos MSc. Student CROSS.

Slides:



Advertisements
Similar presentations
INTRODUCTION TO COPULAS
Advertisements

Chapter 10: Estimating with Confidence
Pair-copula constructions of multiple dependence Workshop on ''Copulae: Theory and Practice'' Weierstrass Institute for Applied Analysis and.
Models for construction of multivariate dependence Workshop on Copulae and Multivariate Probability distributions in Finance – Theory, Applications,
Chap 8: Estimation of parameters & Fitting of Probability Distributions Section 6.1: INTRODUCTION Unknown parameter(s) values must be estimated before.
Simulating Exchangeable Multivariate Archimedean Copulas and its Applications Authors: Florence Wu Emiliano A. Valdez Michael Sherris.
The Simple Linear Regression Model: Specification and Estimation
Chapter 7 Sampling and Sampling Distributions
Statistical Background
Continuous Random Variables and Probability Distributions
Experimental Evaluation
Properties of Random Numbers
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Statistical Intervals Based on a Single Sample.
Chapter 10: Estimating with Confidence
M obile C omputing G roup A quick-and-dirty tutorial on the chi2 test for goodness-of-fit testing.
Nonparametric or Distribution-free Tests
Marketing Research, 2 nd Edition Alan T. Shao Copyright © 2002 by South-Western PPT-1 CHAPTER 17 BIVARIATE STATISTICS: NONPARAMETRIC TESTS.
Copyright © Cengage Learning. All rights reserved. 12 Simple Linear Regression and Correlation.
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-1 Review and Preview.
 1  Outline  stages and topics in simulation  generation of random variates.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
Random Sampling, Point Estimation and Maximum Likelihood.
Montecarlo Simulation LAB NOV ECON Montecarlo Simulations Monte Carlo simulation is a method of analysis based on artificially recreating.
Estimating parameters in a statistical model Likelihood and Maximum likelihood estimation Bayesian point estimates Maximum a posteriori point.
PROBABILITY AND STATISTICS FOR ENGINEERING Hossein Sameti Department of Computer Engineering Sharif University of Technology Two Functions of Two Random.
1 Measuring Association The contents in this chapter are from Chapter 19 of the textbook. The crimjust.sav data will be used. cjsrate: RATE JOB DONE: CJ.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
1 7. Two Random Variables In many experiments, the observations are expressible not as a single quantity, but as a family of quantities. For example to.
Multiple Random Variables Two Discrete Random Variables –Joint pmf –Marginal pmf Two Continuous Random Variables –Joint Distribution (PDF) –Joint Density.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Estimating with Confidence Section 10.1 Confidence Intervals: The Basics.
Copyright © 2010 Pearson Education, Inc. Slide
Chapter 2 Statistical Background. 2.3 Random Variables and Probability Distributions A variable X is said to be a random variable (rv) if for every real.
PROBABILITY AND STATISTICS FOR ENGINEERING Hossein Sameti Department of Computer Engineering Sharif University of Technology Two Random Variables.
Tip #1: Look at your data (1)
Chapter 7 Point Estimation of Parameters. Learning Objectives Explain the general concepts of estimating Explain important properties of point estimators.
Virtual University of Pakistan Lecture No. 44 of the course on Statistics and Probability by Miss Saleha Naghmi Habibullah.
Review of Probability. Important Topics 1 Random Variables and Probability Distributions 2 Expected Values, Mean, and Variance 3 Two Random Variables.
Sampling and estimation Petter Mostad
Evaluation of gene-expression clustering via mutual information distance measure Ido Priness, Oded Maimon and Irad Ben-Gal BMC Bioinformatics, 2007.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
STOCHASTIC HYDROLOGY Stochastic Simulation of Bivariate Distributions Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National.
Models for construction of multivariate dependence 2nd Vine Copula Workshop, Delft, 16. December 2008 Kjersti Aas, Norwegian Computing Center.
Continuous Random Variables and Probability Distributions
Copyright © Cengage Learning. All rights reserved. 5 Joint Probability Distributions and Random Samples.
Copyright © Cengage Learning. All rights reserved. 5 Joint Probability Distributions and Random Samples.
Chapter 8 Estimation ©. Estimator and Estimate estimator estimate An estimator of a population parameter is a random variable that depends on the sample.
Latvijas Aktuāru Asociācija AN INTRODUCTION TO COPULAS Gaida Pettere Professor, Dr. Math. Riga Technical University, Chairmen of Latvian Actuarial Association.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
1 Lecture Plan Modelling Profit Distribution from Wind Production (Excel Case: Danish Wind Production and Spot Prices) Reasons for copula.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Inference about the slope parameter and correlation
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
7. Two Random Variables In many experiments, the observations are expressible not as a single quantity, but as a family of quantities. For example to record.
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
7. Two Random Variables In many experiments, the observations are expressible not as a single quantity, but as a family of quantities. For example to record.
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
7. Two Random Variables In many experiments, the observations are expressible not as a single quantity, but as a family of quantities. For example to record.
Chapter 8: Estimating with Confidence
Presentation transcript:

Archimedean Copulas Theodore Charitos MSc. Student CROSS

Task-Goal Examination of relations between Archimedean copulas and diagonal band or minimum information copulas given correlation constraints Computation of relative information with respect to uniform distribution for each family.

Accomplishment of tasks Use of the algorithm provided in the paper of Christian Genest and Louis-Paul Rivest “Statistical Inference Procedures for Bivariate Archimedean Copulas”. Use of small program in Matlab for calculating numerically the relative information with respect to uniform distribution.

Structure of presentation Theoretical Background Explanation and description of the whole procedure proposed by Genest and Rivest. Analysis of datasets sampled from Unicorn software. Results Conclusions-Discussions

Theoretical Background Definition: A bivariate distribution function with marginals and is said to be generated by an Archimedean copula if it can be expressed in the form for some convex, decreasing function on in such a way that

Proposition: Let and be uniform random variates whose dependence function is of the form for some convex decreasing function defined on with the property that. Set, and. Then is uniformly distributed on, is distributed as and, are independent random variables.

Proposition: Let X and Y be uniform random variables with dependence function. For let and define The function is convex,decreasing and satisfies if and only if for all. It is obvious from the above propositions that is determined as long as can be determined from the dataset. This will be done in our case via a nonparametric estimation of the distribution of V based on a decomposition of Kendall’s tau.

A pair of random variables is concordant if large values of one tend to be associated with large values of the other and vice versa. More precisely, if we have two observations and from a vector of continuous random variables we say that and are concordant if and. Similarly, and are discordant if and or vice versa.

Definition The Kendall’s tau for the sample is defined as where c is the number of concordant pairs, d is the number of discordant pairs from n observations of a vector and is the number of distinct pairs of observations in the sample.

For Archimedean copulas the Kendall’s tau statistic can be conveniently computed via the identity Apparently, the problem now of estimating the bivariate dependence function relies on the estimation of. Genest and Rivest provide a nonparametric procedure for estimating and also.

Analysis of various datasets The algorithm proposed by Genest and Rivest uses the variables where the symbol # stands for the cardinality of a set. If denotes the distribution function of a point mass at the origin, then a nonparametric estimator of is given by Knowing that, a sample equivalent for the estimation of is

Family Clayton Frank Gumbel -

The next step of the analysis concerns the performance of a Pearson chi-squared goodness of fit test statistic for each family in order to assess the fit of the various models. This means that a classification of the dataset is made each time constituting the observed frequencies. However, since the chi-squared test requires predicted values for its computation, it is necessary to generate random variates whose joint distribution belongs to one of the mentioned Archimedean families

Algorithm for sampling from Archimedean families 1.Generate two independent uniform variates u and t. 2. Set 3. Set v = 4. The desired pair is

Archimedean Family Clayton Frank Gumbel

Clayton’s joint density with a=1.514 Frank’s joint density with a=4.604 Gumbel’s joint density with a=0.757

In general, the relative information with respect to uniform distribution for the bivariate case is computed as where is the joint density of and An approximation of the real solution in each case will be provided, which however is enough to indicate what should someone expect from each Archimedean family.

To illustrate the above procedure six datasets (n=1000) were at first sampled and thoroughly analyzed. The correlations were 0.2, 0.65 and 0.9 for both the diagonal band and the minimum information copulas. A classification of the frequencies was also decided. For the sake of completeness, six more datasets with similar correlations constraints but different size (n=5000) and classification were also analyzed in order to compare results.

Recapitulation-Steps Sample from diagonal band and minimum information copulas. Estimate Kendal’s tau and the empirical lambda function. Estimate the parameters for each family according to the previous results. Estimate the lambda functions for each family. Classify the dataset in categories and simulate values from each family according to their estimated parameters. Perform chi-square goodness of fit test and compare the resulting fits. Compute the relative information with respect to uniform distribution. Repeat the whole procedure for different correlations and size of the dataset.

Examples of classifications from diag.band with 0.2 Cross-Classification of X and Y (Observed values) X\Y Cross-Classification of X and Y (Observed values) X\Y

= Statistic df Clayton Frank Gumbel = Statistic df Clayton Frank Gumbel

= Statistic df Clayton Frank Gumbel = Statistic df Clayton Frank Gumbel

= Statistic df Clayton Frank Gumbel = Statistic df Clayton Frank Gumbel

= Statistic df Clayton Frank Gumbel = Statistic df Clayton Frank Gumbel

= Statistic df Clayton Frank Gumbel = Statistic df Clayton Frank Gumbel

= Statistic df Clayton Frank Gumbel = Statistic df Clayton Frank Gumbel

Relative Information with respect to uniform distribution Clayton Frank Gumbel Diag.band 0.2 (n=1000) Min.inf.0.2 (n=1000) Diag.band 065 (n=1000) Min.inf.0.65 (n=1000) Diag.band 0.9 (n=1000) Min.inf.0.9 (n=1000) Diag.band0.2 (n=5000) Min.inf.0.2 (n=5000) Diag.band 0.65 (n=5000) Min.inf.0.65 (n=5000) Diag.band0.9 (n=5000) Min.inf.0.9 (n=5000)

Conclusions-Comments for correlation 0.2 all the three families seem to fit reasonably well when n=1000,but when n=5000 only Frank’s and also Gumbel’s for the min.information. for correlation 0.65 the results are quite promising only when n=1000 for correlation 0.9 Frank’s and Gumbel’s family seem to fit the data when n=1000 and only Frank’s family when n=5000. the results are more promising in the cases of minimum information copula and this is actually a fact that holds for all the datasets no matter what the correlation is. the results are much better when the size of the dataset is smaller. It is obvious that the chi-square test statistic is sensitive to the number of cells.For greater size n and 6x6 cross-classification the results in almost all cases are disappointing. A performance of another goodness of fit test might result in more encouraging conclusions. for correlation 0.2 Clayton’s family has the smallest values of relative information with respect to uniform distribution. Nonetheless, for greater correlations, Frank’s family has the smallest values.