Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Slides:



Advertisements
Similar presentations
Advanced topics in Financial Econometrics Bas Werker Tilburg University, SAMSI fellow.
Advertisements

One-way nonparametric ANOVA with trigonometric scores by Kravchuk, O.Y. School of Land and Food Sciences, University of Queensland.
CHAPTER 2 Building Empirical Model. Basic Statistical Concepts Consider this situation: The tension bond strength of portland cement mortar is an important.
Fundamentals of Data Analysis Lecture 12 Methods of parametric estimation.
Economics 105: Statistics Go over GH 11 & 12 GH 13 & 14 due Thursday.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Computing the ranks of data is only one of several possible so- called scoring methods that are in use... Section 2.7 reviews three of them – we’ll look.
Sampling Distributions (§ )
Introduction to Statistics
2008 SIAM Conference on Imaging Science July 7, 2008 Jason A. Palmer
ASYMPTOTIC PROPERTIES OF ESTIMATORS: PLIMS AND CONSISTENCY
The Multiple Regression Model Prepared by Vera Tabakova, East Carolina University.
Maximum likelihood (ML) and likelihood ratio (LR) test
Maximum likelihood (ML)
Maximum likelihood (ML) and likelihood ratio (LR) test
Statistical Background
Chapter 11: Inference for Distributions
7. Nonparametric inference  Quantile function Q  Inference on F  Confidence bands for F  Goodness- of- fit tests 1.
Probability theory 2008 Outline of lecture 5 The multivariate normal distribution  Characterizing properties of the univariate normal distribution  Different.
Maximum likelihood (ML)
Statistical Theory; Why is the Gaussian Distribution so popular? Rob Nicholls MRC LMB Statistics Course 2014.
1 A MONTE CARLO EXPERIMENT In the previous slideshow, we saw that the error term is responsible for the variations of b 2 around its fixed component 
Lecture II-2: Probability Review
M obile C omputing G roup A quick-and-dirty tutorial on the chi2 test for goodness-of-fit testing.
 We cannot use a two-sample t-test for paired data because paired data come from samples that are not independently chosen. If we know the data are paired,
Comparison of data distributions: the power of Goodness-of-Fit Tests
Multiple testing in high- throughput biology Petter Mostad.
Copyright © Cengage Learning. All rights reserved. 8 Tests of Hypotheses Based on a Single Sample.
1 CSI5388: Functional Elements of Statistics for Machine Learning Part I.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
1 G Lect 6b G Lecture 6b Generalizing from tests of quantitative variables to tests of categorical variables Testing a hypothesis about a.
Estimating parameters in a statistical model Likelihood and Maximum likelihood estimation Bayesian point estimates Maximum a posteriori point.
9 Mar 2007 EMBnet Course – Introduction to Statistics for Biologists Nonparametric tests, Bootstrapping
Nonparametric Statistical Methods: Overview and Examples ETM 568 ISE 468 Spring 2015 Dr. Joan Burtner.
Previous Lecture: Categorical Data Methods. Nonparametric Methods This Lecture Judy Zhong Ph.D.
An Empirical Likelihood Ratio Based Goodness-of-Fit Test for Two-parameter Weibull Distributions Presented by: Ms. Ratchadaporn Meksena Student ID:
Statistics in Food and Resource Economics Lecture I.
Estimating Incremental Cost- Effectiveness Ratios from Cluster Randomized Intervention Trials M. Ashraf Chaudhary & M. Shoukri.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Deterministic vs. Random Maximum A Posteriori Maximum Likelihood Minimum.
ASYMPTOTIC OPTIMALITY OF RANK TESTS OF LOCATION AND THE BROWNIAN BRIDGE Kravchuk, O.Y. Department of Mathematics, The University of Queensland.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
Chapter 5 Parameter estimation. What is sample inference? Distinguish between managerial & financial accounting. Understand how managers can use accounting.
Probability and Measure September 2, Nonparametric Bayesian Fundamental Problem: Estimating Distribution from a collection of Data E. ( X a distribution-valued.
Nonparametric Statistics
Chapter 2 Statistical Background. 2.3 Random Variables and Probability Distributions A variable X is said to be a random variable (rv) if for every real.
Lecture 3: Statistics Review I Date: 9/3/02  Distributions  Likelihood  Hypothesis tests.
Review of Probability. Important Topics 1 Random Variables and Probability Distributions 2 Expected Values, Mean, and Variance 3 Two Random Variables.
Fourier series, Discrete Time Fourier Transform and Characteristic functions.
Analisis Non-Parametrik Antonius NW Pratama MK Metodologi Penelitian Bagian Farmasi Klinik dan Komunitas Fakultas Farmasi Universitas Jember.
1 Estimation of Gene-Specific Variance 2/17/2011 Copyright © 2011 Dan Nettleton.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Computing the ranks of data is only one of several possible so-called scoring methods that are in use... Section 2.7 reviews three of them – we’ll look.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Statistical Data Analysis 2011/2012 M. de Gunst Lecture 6.
Warsaw Summer School 2015, OSU Study Abroad Program Normal Distribution.
1 Testing Statistical Hypothesis The One Sample t-Test Heibatollah Baghi, and Mastee Badii.
Copyright © Cengage Learning. All rights reserved. 15 Distribution-Free Procedures.
Evaluating Hypotheses. Outline Empirically evaluating the accuracy of hypotheses is fundamental to machine learning – How well does this estimate its.
Essential Statistics Chapter 191 Comparing Two Proportions.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Chapter 4. The Normality Assumption: CLassical Normal Linear Regression Model (CNLRM)
Virtual COMSATS Inferential Statistics Lecture-26
Chapter 7: Sampling Distributions
Lecture 17 Rank Correlation Coefficient
SA3202 Statistical Methods for Social Sciences
Trigonometric scores rank statistics
Some Nonparametric Methods
One-way nonparametric ANOVA with trigonometric scores by Kravchuk, O.Y. School of Land and Food Sciences, University of Queensland The trigonometric scores.
Sampling Distributions (§ )
Lecture Slides Elementary Statistics Twelfth Edition
Presentation transcript:

Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ

Olena Kravchuk Brownian bridge and nonparametric rank tests2 Lecture outline Definition and important characteristics of the Brownian bridge (BB) Interesting measurable events on the BB Asymptotic behaviour of rank statistics Cramer-von Mises statistic Small and large sample properties of rank statistics Some applications of rank procedures Useful references

Olena Kravchuk Brownian bridge and nonparametric rank tests3 Definition of Brownian bridge

Olena Kravchuk Brownian bridge and nonparametric rank tests4 Construction of the BB

Olena Kravchuk Brownian bridge and nonparametric rank tests5 Varying the coefficients of the bridge

Olena Kravchuk Brownian bridge and nonparametric rank tests6 Two useful properties

Olena Kravchuk Brownian bridge and nonparametric rank tests7 Ranks and anti-ranks First sampleSecond sample Index Data Rank Anti-rank354612

Olena Kravchuk Brownian bridge and nonparametric rank tests8 Simple linear rank statistic Any simple linear rank statistic is a linear combination of the scores, a’s, and the constants, c’s. When the constants are standardised, the first moment is zero and the second moment is expressed in terms of the scores. The limiting distribution is normal because of a CLT.

Olena Kravchuk Brownian bridge and nonparametric rank tests9 Constrained random walk on pooled data Combine all the observations from two samples into the pooled sample, N=m+n. Permute the vector of the constants according to the anti-ranks of the observations and walk on the permuted constants, linearly interpolating the walk Z between the steps. Pin down the walk by normalizing the constants. This random bridge Z converges in distribution to the Brownian Bridge as the smaller sample increases.

Olena Kravchuk Brownian bridge and nonparametric rank tests10 From real data to the random bridge First sampleSecond sample Index, i Data, X Constant, c Rank, R Anti-rank, D Bridge, Z

Olena Kravchuk Brownian bridge and nonparametric rank tests11 Symmetric distributions and the BB

Olena Kravchuk Brownian bridge and nonparametric rank tests12 Random walk model: no difference in distributions

Olena Kravchuk Brownian bridge and nonparametric rank tests13 Location and scale alternatives

Olena Kravchuk Brownian bridge and nonparametric rank tests14 Random walk: location and scale alternatives Shift = 2 Scale = 2

Olena Kravchuk Brownian bridge and nonparametric rank tests15 Simple linear rank statistic again The simple linear rank statistic is expressed in terms of the random bridge. Although the small sample properties are investigated in the usual manner, the large sample properties are governed by the properties of the Brownian Bridge. It is easy to visualise a linear rank statistic in such a way that the shape of the bridge suggests a particular type of statistic.

Olena Kravchuk Brownian bridge and nonparametric rank tests16 Trigonometric scores rank statistics The Cramer-von Mises statistic The first and second Fourier coefficients:

Olena Kravchuk Brownian bridge and nonparametric rank tests17 Combined trigonometric scores rank statistics The first and second coefficients are uncorrelated Fast convergence to the asymptotic distribution The Lepage test is a common test of the combined alternative (S W is the Wilcoxon statistic and S A-B is the Ansari-Bradley, adopted Wilcoxon, statistic)

Olena Kravchuk Brownian bridge and nonparametric rank tests18 Percentage points for the first component (one-sample) Durbin and Knott – Components of Cramer-von Mises Statistics

Olena Kravchuk Brownian bridge and nonparametric rank tests19 Percentage points for the first component (two-sample) Kravchuk – Rank test of location optimal for HSD

Olena Kravchuk Brownian bridge and nonparametric rank tests20 Some tests of location

Olena Kravchuk Brownian bridge and nonparametric rank tests21 Trigonometric scores rank estimators Location estimator of the HSD (Vaughan) Scale estimator of the Cauchy distribution (Rublik) Trigonometric scores rank estimator (Kravchuk)

Olena Kravchuk Brownian bridge and nonparametric rank tests22 Optimal linear rank test An optimal test of location may be found in the class of simple linear rank tests by an appropriate choice of the score function, a. Assume that the score function is differentiable. An optimal test statistic may be constructed by selecting the coefficients, b’s.

Olena Kravchuk Brownian bridge and nonparametric rank tests23 Functionals on the bridge When the score function is defined and differentiable, it is easy to derive the corresponding functional.

Olena Kravchuk Brownian bridge and nonparametric rank tests24 Result 4: trigonometric scores estimators Efficient location estimator for the HSD Efficient scale estimator for the Cauchy distribution Easy to establish exact confidence level Easy to encode into automatic procedures

Olena Kravchuk Brownian bridge and nonparametric rank tests25 Numerical examples: test of location t-testWilcoxonS1S1 p-value CI 95% (-172.4,28.6)(-185.0,25.0)(-183.0,25.0) 1.Normal, N(500,100 2 ) 2.Normal, N(580,100 2 )

Olena Kravchuk Brownian bridge and nonparametric rank tests26 Numerical examples: test of scale F-testSiegel-TukeyS2S2 p-value Normal, N(300,200 2 ) 2.Normal, N(300,100 2 )

Olena Kravchuk Brownian bridge and nonparametric rank tests27 Numerical examples: combined test F-testt-testS 1 2 +S 2 2 LepageCM p-value Normal, N(580,200 2 ) 2.Normal, N(500,100 2 )

Olena Kravchuk Brownian bridge and nonparametric rank tests28 When two colour histograms are compared, nonparametric tests are required as a priori knowledge about the colour probability distribution is generally not available. The difficulty arises when statistical tests are applied to colour images: whether one should treat colour distributions as continuous, discrete or categorical. Application: palette-based images

Olena Kravchuk Brownian bridge and nonparametric rank tests29 Application: grey-scale images

Olena Kravchuk Brownian bridge and nonparametric rank tests30 Application: grey-scale images, histograms

Olena Kravchuk Brownian bridge and nonparametric rank tests31 Application: colour images

Olena Kravchuk Brownian bridge and nonparametric rank tests32 Useful books 1.H. Cramer. Mathematical Methods of Statistics. Princeton University Press, Princeton, 19 th edition, G. Grimmett and D. Stirzaker. Probability and Random Processes. Oxford University Press, N.Y., J. Hajek, Z. Sidak and P.K. Sen. Theory of Rank Tests. Academic Press, San Diego, California, F. Knight. Essentials of Brownian Motion and Diffusion. AMS, Providence, R.I., K. Knight. Mathematical Statistics. Chapman & Hall, Boca Raton, J. Maritz. Distribution-free Statistical Methods. Monographs on Applied Probability and Statistics. Chapman & Hall, London, 1981.

Olena Kravchuk Brownian bridge and nonparametric rank tests33 Interesting papers 1.J. Durbin and M. Knott. Components of Cramer – von Mises statistics. Part 1. Journal of the Royal Statistical Society, Series B., K.M. Hanson and D.R. Wolf. Estimators for the Cauchy distribution. In G.R. Heidbreder, editor, Maximum entropy and Bayesian methods, Kluwer Academic Publisher, Netherlands, N. Henze and Ya.Yu. Nikitin. Two-sample tests based on the integrated empirical processes. Communications in Statistics – Theory and Methods, A. Janseen. Testing nonparametric statistical functionals with application to rank tests. Journal of Statistical Planning and Inference, F.Rublik. A quantile goodness-of-fit test for the Cauchy distribution, based on extreme order statistics. Applications of Mathematics, D.C. Vaughan. The generalized secant hyperbolic distribution and its properties. Communications in Statistics – Theory and Methods, 2002.