Clustering of financial time series This paper addresses the topic of classifying financial time series in a fuzzy framework proposing two fuzzy clustering.

Slides:



Advertisements
Similar presentations
Answering Approximate Queries over Autonomous Web Databases Xiangfu Meng, Z. M. Ma, and Li Yan College of Information Science and Engineering, Northeastern.
Advertisements

Discrimination and Classification. Discrimination Situation: We have two or more populations  1,  2, etc (possibly p-variate normal). The populations.
A CTION R ECOGNITION FROM V IDEO U SING F EATURE C OVARIANCE M ATRICES Kai Guo, Prakash Ishwar, Senior Member, IEEE, and Janusz Konrad, Fellow, IEEE.
12-1 Multiple Linear Regression Models Introduction Many applications of regression analysis involve situations in which there are more than.
Portfolio Diversity and Robustness. TOC  Markowitz Model  Diversification  Robustness Random returns Random covariance  Extensions  Conclusion.
Principal Component Analysis (PCA) for Clustering Gene Expression Data K. Y. Yeung and W. L. Ruzzo.
Exploiting Sparse Markov and Covariance Structure in Multiresolution Models Presenter: Zhe Chen ECE / CMR Tennessee Technological University October 22,
Introduction Data and simula- tion methodology VaR models and estimation results Estimation perfor- mance analysis Conclusions Appendix Doctoral School.
ARCGICE WP 2.2 ERROR ESTIMATION OF NEW GEOID C.C.Tscherning, University of Copenhagen,
Ch 5.6: Series Solutions Near a Regular Singular Point, Part I
Ch 5.1: Review of Power Series
Prénom Nom Document Analysis: Data Analysis and Clustering Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
Ch 5.5: Series Solutions Near a Regular Singular Point, Part I We now consider solving the general second order linear equation in the neighborhood of.
Segmentation Graph-Theoretic Clustering.
Ch 5.1: Review of Power Series Finding the general solution of a linear differential equation depends on determining a fundamental set of solutions of.
The Pinhole Camera Model
12 1 Variations on Backpropagation Variations Heuristic Modifications –Momentum –Variable Learning Rate Standard Numerical Optimization –Conjugate.
Linear Equations in Linear Algebra
EE462 MLCV 1 Lecture 3-4 Clustering (1hr) Gaussian Mixture and EM (1hr) Tae-Kyun Kim.
Dealing with Heteroscedasticity In some cases an appropriate scaling of the data is the best way to deal with heteroscedasticity. For example, in the model.
Arithmetic Operations on Matrices. 1. Definition of Matrix 2. Column, Row and Square Matrix 3. Addition and Subtraction of Matrices 4. Multiplying Row.
Principal Component Analysis (PCA) for Clustering Gene Expression Data K. Y. Yeung and W. L. Ruzzo.
Graph-based consensus clustering for class discovery from gene expression data Zhiwen Yum, Hau-San Wong and Hongqiang Wang Bioinformatics, 2007.
Computer Graphics: Programming, Problem Solving, and Visual Communication Steve Cunningham California State University Stanislaus and Grinnell College.
Math 5364 Notes Chapter 8: Cluster Analysis Jesse Crawford Department of Mathematics Tarleton State University.
Alternative Measures of Risk. The Optimal Risk Measure Desirable Properties for Risk Measure A risk measure maps the whole distribution of one dollar.
ME 1202: Linear Algebra & Ordinary Differential Equations (ODEs)
Segmentation Course web page: vision.cis.udel.edu/~cv May 7, 2003  Lecture 31.
Statistics and Linear Algebra (the real thing). Vector A vector is a rectangular arrangement of number in several rows and one column. A vector is denoted.
Blind Pattern Matching Attack on Watermark Systems D. Kirovski and F. A. P. Petitcolas IEEE Transactions on Signal Processing, VOL. 51, NO. 4, April 2003.
T – Biomedical Signal Processing Chapters
1 1.3 © 2012 Pearson Education, Inc. Linear Equations in Linear Algebra VECTOR EQUATIONS.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Paper: Large-Scale Clustering of cDNA-Fingerprinting Data Presented by: Srilatha Bhuvanapalli INFS 795 – Special Topics in Data Mining.
Intro. ANN & Fuzzy Systems Lecture 26 Modeling (1): Time Series Prediction.
Vector Norms and the related Matrix Norms. Properties of a Vector Norm: Euclidean Vector Norm: Riemannian metric:
Elements of Financial Risk Management Second Edition © 2012 by Peter Christoffersen 1 Simulating the Term Structure of Risk Elements of Financial Risk.
Monte-Carlo method for Two-Stage SLP Lecture 5 Leonidas Sakalauskas Institute of Mathematics and Informatics Vilnius, Lithuania EURO Working Group on Continuous.
HMM - Part 2 The EM algorithm Continuous density HMM.
Meeting 18 Matrix Operations. Matrix If A is an m x n matrix - that is, a matrix with m rows and n columns – then the scalar entry in the i th row and.
1 Pattern Recognition: Statistical and Neural Lonnie C. Ludeman Lecture 29 Nov 11, 2005 Nanjing University of Science & Technology.
Feature extraction using fuzzy complete linear discriminant analysis The reporter : Cui Yan
1  The Problem: Consider a two class task with ω 1, ω 2   LINEAR CLASSIFIERS.
Prototype Classification Methods Fu Chang Institute of Information Science Academia Sinica ext. 1819
Estimation Method of Moments (MM) Methods of Moment estimation is a general method where equations for estimating parameters are found by equating population.
CS 376 Introduction to Computer Graphics 04 / 25 / 2007 Instructor: Michael Eckmann.
1  Problem: Consider a two class task with ω 1, ω 2   LINEAR CLASSIFIERS.
Section 17.2 Line Integrals.
1 A Fuzzy Logic Framework for Web Page Filtering Authors : Vrettos, S. and Stafylopatis, A. Source : Neural Network Applications in Electrical Engineering,
Variations on Backpropagation.
Mean and Variance Dynamics Between Agricultural Commodity Prices, Crude Oil Prices and Exchange Rates Ardian Harri Mississippi State University Darren.
Copyright © Cengage Learning. All rights reserved. 2 SYSTEMS OF LINEAR EQUATIONS AND MATRICES.
Stats & Summary. The Woodbury Theorem where the inverses.
Copyright © Cengage Learning. All rights reserved. 16 Vector Calculus.
CHAPTER 3 Selected Design and Processing Aspects of Fuzzy Sets.
1 Objective To provide background material in support of topics in Digital Image Processing that are based on matrices and/or vectors. Review Matrices.
Dr. Thomas Kigabo RUSUHUZWA
LINEAR CLASSIFIERS The Problem: Consider a two class task with ω1, ω2.
A Genetic Algorithm Approach to K-Means Clustering
VAR models and cointegration
Discrimination and Classification
Section 7.4 Matrix Algebra.
Clustering Evaluation The EM Algorithm
Segmentation Graph-Theoretic Clustering.
Linear Equations in Linear Algebra
Representation of Functions by Power Series (9.9)
Unsupervised Learning II: Soft Clustering with Gaussian Mixture Models
Fuzzy Clustering Algorithms
Tutorial 10 SEG7550.
Presentation transcript:

Clustering of financial time series This paper addresses the topic of classifying financial time series in a fuzzy framework proposing two fuzzy clustering models both based on GARCH models. Two distance measures Two cluster models based on GARCH models

GARCH models and their autoregressive (AR) representation

AR-based distance measure for comparing time series For each pair of time series, z tk and z tk′, let and be the vectors of the estimated parameters of their finite AR representation, e.g. AR(R k ) and AR(R k′ ) respectively. squared Euclidean distance measure:

In this way, the AR coefficients replace the time series in the comparative assessment. When the AR coefficient vectors representing each time series have unequal length we can adopt the zero-padding solution, by adding zeros defining a new AR vector with the same length as the longer one.

GARCH-based distance measure for comparing time series This distance metric, which takes into account the volatility, is based on both the estimated GARCH parameters and their estimated covariances. Define the vectors of the estimated parameters: and the matrices : Vx = of estimated covariances of the GARCH(p, q) representation of each pair of time series.

distance measure: consider a pair Xt and Yt GARCH(1, 1) processes and their estimated covariance matrix:

带入公式 得:

GARCH-based fuzzy C-medoids clustering model (GARCH-FCMdC model) vector of the corresponding autoregressive coefficients.

where uic represents the fuzzy membership of the i-th AR process in the c-th cluster. is the squared Euclidean distance between the i-th AR process and the c-th medoid AR process. m > 1 is a weighting exponent that controls the fuzziness of the partition. As m increases, the membership degrees are fuzzier.

by means of the Lagrangian multiplier method, we get the local optimal solutions: Two further issues concern the detection of both the optimal number of clusters C and the fuzziness parameter m. In particular, in our application we consider the Fuzzy Silhouette index. Set the value of m in the interval (1, 1.5].

A Prototypical Case of AR-based Distance and Clustering Model Let Xt ∼ GARCH(1, 1) and Yt ∼ GARCH(1, 1) be two stochastic processes. The AR-coefficients corresponding to the processes are: The AR-distance becomes:

three convergent geometric series: Therefore, we obtain: The GARCH-FCMdC model can be rewritten as:

GARCH-based fuzzy C-medoids clustering model with Caiado distance (GARCH-FCMdCC model) let Z = {zt1, zt2,..., zti,..., ztI } (t = 1,..., T ) be a set of I univariate financial time series, L = {L1, L2,..., Li,..., LI } be the corresponding vectors of estimated parameters of their GARCH(p, q) representations and V = {V1, V,..., Vi,..., VI } be the set of the estimated covariances matrices, with Vi =

consider a subset of Z: with estimated parameters and covariances matrices of the GARCH(p, q) representations Then the GARCH-based fuzzy C-medoids clustering model with Caiado distance (GARCH-FCMdCC) can be written as: is the Caiado distance between the i-th financial time series and the c-th medoid financial time series.

The local optimal solution is:

Application to dailies’ returns of Euro exchange rates Present and discuss the results of an empirical application of the proposed GARCH-based Fuzzy C- Medoids clustering models to the volatility of dailies Euro exchange rates against 29 international currencies. The aim of the analysis is to identify the exchange rates that show similar fluctuation in the volatility of dailies’ returns and thus to classify Euro exchange rates vs major international currencies according to their stability.