DDNMs: Coupled Systems of univariate DLMs

Slides:



Advertisements
Similar presentations
Sequential learning in dynamic graphical model Hao Wang, Craig Reeson Department of Statistical Science, Duke University Carlos Carvalho Booth School of.
Advertisements

Computer vision: models, learning and inference Chapter 8 Regression.
ROBERT ENGLE DIRECTOR VOLATILITY INSTITUTE AT NYU STERN THE ECONOMICS AND ECONOMETRICS OF COMMODITY PRICES AUGUST 2012 IN RIO.
Introduction Data and simula- tion methodology VaR models and estimation results Estimation perfor- mance analysis Conclusions Appendix Doctoral School.
The loss function, the normal equation,
Predictive Automatic Relevance Determination by Expectation Propagation Yuan (Alan) Qi Thomas P. Minka Rosalind W. Picard Zoubin Ghahramani.
RISK MANAGEMENT GOALS AND TOOLS. ROLE OF RISK MANAGER n MONITOR RISK OF A FIRM, OR OTHER ENTITY –IDENTIFY RISKS –MEASURE RISKS –REPORT RISKS –MANAGE -or.
1 MULTIVARIATE GARCH Rob Engle UCSD & NYU. 2 MULTIVARIATE GARCH MULTIVARIATE GARCH MODELS ALTERNATIVE MODELS CHECKING MODEL ADEQUACY FORECASTING CORRELATIONS.
Optimal Filtering of Jump Diffusions: Extracting Latent States from Asset Prices Jonathan Stroud, Wharton, U. Pennsylvania Stern-Wharton Conference on.
Arizona State University DMML Kernel Methods – Gaussian Processes Presented by Shankar Bhargav.
Estimation and the Kalman Filter David Johnson. The Mean of a Discrete Distribution “I have more legs than average”
Computer vision: models, learning and inference Chapter 10 Graphical Models.
Learning In Bayesian Networks. Learning Problem Set of random variables X = {W, X, Y, Z, …} Training set D = { x 1, x 2, …, x N }  Each observation specifies.
CSC2535: 2013 Advanced Machine Learning Lecture 3a: The Origin of Variational Bayes Geoffrey Hinton.
Predictive versus Explanatory Models in Asset Management Campbell R. Harvey Global Asset Allocation and Stock Selection.
Cao et al. ICML 2010 Presented by Danushka Bollegala.
The horseshoe estimator for sparse signals CARLOS M. CARVALHO NICHOLAS G. POLSON JAMES G. SCOTT Biometrika (2010) Presented by Eric Wang 10/14/2010.
Alternative Measures of Risk. The Optimal Risk Measure Desirable Properties for Risk Measure A risk measure maps the whole distribution of one dollar.
Computer vision: models, learning and inference Chapter 19 Temporal models.
Empirical Financial Economics Asset pricing and Mean Variance Efficiency.
SIS Sequential Importance Sampling Advanced Methods In Simulation Winter 2009 Presented by: Chen Bukay, Ella Pemov, Amit Dvash.
Elements of Financial Risk Management Second Edition © 2012 by Peter Christoffersen 1 Distributions and Copulas for Integrated Risk Management Elements.
Bayesian networks Classification, segmentation, time series prediction and more. Website: Twitter:
Mixture Models, Monte Carlo, Bayesian Updating and Dynamic Models Mike West Computing Science and Statistics, Vol. 24, pp , 1993.
MURI: Integrated Fusion, Performance Prediction, and Sensor Management for Automatic Target Exploitation 1 Dynamic Sensor Resource Management for ATE MURI.
Elements of Financial Risk Management Second Edition © 2012 by Peter Christoffersen 1 Simulating the Term Structure of Risk Elements of Financial Risk.
Gaussian Processes Li An Li An
Learning In Bayesian Networks. General Learning Problem Set of random variables X = {X 1, X 2, X 3, X 4, …} Training set D = { X (1), X (2), …, X (N)
CIA Annual Meeting LOOKING BACK…focused on the future.
Additional Topics in Prediction Methodology. Introduction Predictive distribution for random variable Y 0 is meant to capture all the information about.
Guest lecture: Feature Selection Alan Qi Dec 2, 2004.
The generalization of Bayes for continuous densities is that we have some density f(y|  ) where y and  are vectors of data and parameters with  being.
Gaussian Processes For Regression, Classification, and Prediction.
REGIME CHANGES AND FINANCIAL MARKETS Prepared for Topics in Quantitative Finance | Abhishek Rane - Andrew Ang and Allan Timmermann.
The Unscented Particle Filter 2000/09/29 이 시은. Introduction Filtering –estimate the states(parameters or hidden variable) as a set of observations becomes.
- 1 - Preliminaries Multivariate normal model (section 3.6, Gelman) –For a multi-parameter vector y, multivariate normal distribution is where  is covariance.
1 VaR Models VaR Models for Energy Commodities Parametric VaR Historical Simulation VaR Monte Carlo VaR VaR based on Volatility Adjusted.
Institute of Statistics and Decision Sciences In Defense of a Dissertation Submitted for the Degree of Doctor of Philosophy 26 July 2005 Regression Model.
Hierarchical models. Hierarchical with respect to Response being modeled – Outliers – Zeros Parameters in the model – Trends (Us) – Interactions (Bs)
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 1: INTRODUCTION.
Types of risk Market risk
Bayesian Semi-Parametric Multiple Shrinkage
Value at Risk and Expected Shortfall
Market-Risk Measurement
Deep Feedforward Networks
Multi-task learning approaches to modeling context-specific networks
LECTURE 09: BAYESIAN ESTIMATION (Cont.)
Portfolio Risk Management : A Primer
Alan Qi Thomas P. Minka Rosalind W. Picard Zoubin Ghahramani
PSG College of Technology
Types of risk Market risk
CSCI 5822 Probabilistic Models of Human and Machine Learning
Simultaneous Graphical Dynamic Models
Market Risk VaR: Model-Building Approach
Information Based Criteria for Design of Experiments
Q4 : How does Netflix recommend movies?
Estimation Error and Portfolio Optimization
Estimation Error and Portfolio Optimization
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
CSCI 5822 Probabilistic Models of Human and Machine Learning
Bryan Kelly Asaf Manela Alan Moreira
Pattern Recognition and Machine Learning
Estimation Error and Portfolio Optimization
Part III Exchange Rate Risk Management
The loss function, the normal equation,
Mathematical Foundations of BME Reza Shadmehr
Estimation Error and Portfolio Optimization
Measuring Exposure to Exchange Rate Fluctuations
Probabilistic Surrogate Models
Adaptive Variable Selection
Presentation transcript:

DDNMs: Coupled Systems of univariate DLMs Dimension … Structure … Volatility …

Key First Example: VAR(1) Written other ways: = (row j of )’ Lag 1 responses for each univariate series Cross-talk at lag 1: “networks” Feed-forward effects: “impulse” responses Contemporaneous links/association: V

Dynamics in Inter- & Cross-connections: Time-Varying VAR(1) Evolve in time Written another way: Time-varying lagged connections Time-varying volatilities and “co-volatilities” in contemporaneous connections Easy to add more predictors series-by-series

Parameter dimension – BIG with moderate q,d: scalability? VAR and TV-VAR(d) Models Univariate elements: Parameter dimension – BIG with moderate q,d: scalability? Parsimony: sparse lagged and cross-sectional dependencies?

Dynamic Dependency Network Models A Structured Subset of TV-VAR(d) Models Dynamic Dependency Network Models

Coupled DLMs: DDNMs Interest in: e.g. 500 stocks S&P, daily Scale up: large “Decoupled” univariate series/models Series-specific independent predictors Series-specific states, evolutions, discount factors, volatilities Sequential analysis: Fast? Parallelisation? Cross-series / multivariate structure : Sensitive co-volatility modelling Critical: coherent joint forecast distributions Computationally fast

Multi-Regression and DDNMs Extensions & generalizations of “multiregression DLMs” C.M. Queen & J.Q. Smith (RSS B, 1993; and later) Z. Zhao, M. Xie & MW (2015) Lopes, McCulloch & Tsay (2010, unpublished) J. Nakajima & MW (2013,14,15) Cholesky-style multivariate stochastic volatility (MSV)

Decoupled, Parallel Sets of Univariate DLMs Stock prices: Citibank, AT&T, Ebay, ….. Bank of America, GE Cross-series: directed & sparse graphical structure – lagged Idiosyncratic: volatility C T EBAY BOA GE Customised predictors external/independent factors lagged y – sparse set e.g. Citibank “info”

? Contemporaneous Coupling of Univariate DLMs Cross-series: directed graphical structure e.g. ? C T EBAY BOA GE A few parental predictors

Coupled Sets of Univariate DLMs Multiple univariate models “decoupled” in parallel Known predictors “Parents” Parental sets: *Current* values of (some) related series as predictors *Independent* across j: residuals & univariate volatilities:

Implied Coherent Multivariate Model dynamic regressions & precision/volatility Sparse Non-zeros in . Multivariate volatility (precision) matrix : - Cholesky-style: upper triangular form Notation: extend to row j and pad with 0 entries General conclusions from this example

Dynamic Graphical Model Structure Induced precision/volatility Non-zeros in . zero precision: conditional independence Non-zeros in . General conclusions from this example Links to & from parents Links to & from parents-of-parents

Parental-Induced Cross-Talk in Dynamic Regressions Non-zeros in . Non-zeros in . General conclusions from this example parents grandparents ……. great-grandparents

“Triangular” Set of Coupled Univariate DLMs Multiple univariate models “decoupled” in parallel Parallel states - independent evolutions 2 discount factors, series specific

Normal pdf in conditional model j Dynamic Dependency Network Models Parents of j Joint pdf: Normal pdf in conditional model j Compositional form of joint density - directed graphical model - General conclusions from this example Known & variances: Multiregression DLM TV-VAR with lagged in , & unknown volatility processes: Dynamic dependency networks

Summary Concepts: DDNMs - Multiple univariate models: decoupled, in parallel - Conducive to on-line sequential learning: Analytic, fast, parallel - Simultaneous parental sets define *sparse* multivariate stochastic volatility matrix - New dynamic graphical models for MSV: evolutions of - * Dependent on choice of order of named series j=1:m *

DDNMs: Model Forms, Filtering and Forecasting Parallel, conditionally independent analyses: Example: – independent, steady state evolutions – discounting for state and volatility Forecasting: 1-step: individual T distributions - marginal likelihoods for parameter assessment - series j specific (discount factors, TVAR model order, etc … ) 1-step: multivariate/joint forecast mean and variance matrix - analytic: recursively computed More steps ahead? General conclusions from this example

DDNMs: Recoupling Decoupled Models Forecasting: TVAR models: Simulate for more steps ahead DDNM: Simulate! For steps ahead k=1:K … General conclusions from this example

Example: FX, Commodities & Stock Indices Daily closing prices in $US Model log(price) General conclusions from this example

Model Structure and Hyper-parameters TVAR order - all possible parental sets - discrete ranges of TVAR order and discounts Discrete model space: parental set state discount factor volatility discount factor Sequential learning over time: 1-step forecast pdf: Student t Score and normalize: - Bayesian model uncertainty and averaging for forecasting

Parallel Model Uncertainty Analysis Sequential learning over time: Parallel! - across all models for each series - independently/parallel across series - computationally accessible - model space dimension reduction:

But First: More Time-adaptive Structure Learning Model probabilities converge/degenerate Sometimes fast Always to “wrong” model Lack of adaptability over time “Reality” may be outside the “span” of model set? - all are “poor models”

Power Discounting of Model Uncertainties Discount (“forgetting”) factor - Exponentially discount past information across models - “Synthetic” dynamics on model space - Intervention: Change model probabilities each time step Extend model uncertainty analysis: grid of + product of model probabilities over j [ Raftery et al, 2010 Koop & Korobilis 2012 ] [ P.J. Harrison, 1960s; West & Harrison, 1987 1st edn ]

Finance Example: Models, Analyses, Decisions 50:50 training/test data Train (to mid 4/2006): evaluate all Test (from mid 4/2006): use reduced subsets of models Forecasting and decisions: 5-day look-ahead portfolio Commodity Trading Advisors’ (CTA) benchmark (“market index”) Model m+1 series with CTA included Optimization of portfolio to: - minimize predicted portfolio risk - subject to specified %target exceedance over expected/forecast CTA benchmark - and decorrelated- in expectation- with CTA benchmark

Finance Example: Scoring Power Discounting Forecast RMSE & MAD Improves: 1,5-step forecasts - adaptability - Portfolio returns?

Finance Example: One Series

Finance Example:TVAR Model Order by Series

Finance Example: Model Discount Factors by Series

Finance Example: Parental Inclusion Probabilities

Finance Example: Parental Inclusion Probabilities (cont)

Finance Example: Parental Inclusion Probabilities (cont)

Finance Example: 1-Day Ahead Portfolios

Finance Example: 5-Day Ahead Portfolios

Finance Example: 5-Day Ahead Portfolios

DDNMs - Flexible, customisable univariate models: decoupled, in parallel - Parental structure: sparse multivariate stochastic volatility matrix - On-line sequential learning: Analytic, fast, parallel - Recoupling for coherent posterior and predictive inferences dependent on choice of order of named series j=1:m Scale-up in dimension? Choice/specification of order? 400 S&P stock price series?

Portfolios & Dynamic Multivariate Graphical Models Time series + conditional independence structures/constraints Multivariate dynamic linear models & multivariate stochastic volatility + Graphical models of cross-series structure High-dimensional asset allocation - portfolio problems Structure, parameter reduction: stability, parsimony, accuracy

Financial Series: Structure of Dependencies Multiple equities: managed funds, index funds, exchange rates, etc - period t returns Volatility … Structure … Precision matrix: defines conditional (in)dependencies

Utilities & Decisions in Dynamic Portfolios Models/choices of response variable - % returns - actual FX or commodity, stock prices - opportunities for more informed and improved forecasts using prices? Whatever model choice, derived 1-step forecasts for %returns: Rebalance portfolio decision: - want high % return - want to understand, control, reduce risk Portfolio allocation vector Choose (re)allocation vector each time step : Update/optimize sequentially via updated forecasts : Observed returns – cumulate k-step ahead variants

Standard Portfolio Examples Decisions: choice of utilities Risk = s.d. of portfolio Good things: high target return low risk stable portfolio weights a. Minimize risk, minimum variance: portfolio risk2 b. Target return: analytic – terms in c. Target constrained: b with non-negative weights. Requires non-linear optimization You & Me investors (401K, no shorting) Role of precision matrix

Gaussian Graphical Models – precision matrix zeros/structure: constrain, induce parsimony Zeros represent conditional independencies

Drawing Graphs Graph exhibits conditional independencies Zeros in precision matrix ~ missing edges International exchange rates example, p=11 No edge: [ Carvalho, Massam & W, Biometrika 2007 ]

Mean-Target Portfolio: Precisions & Graphical Models Precision matrix Optimal mean-target portfolio: Predicted return after “explaining” by other assets Residual variance (“nondiversifiable risk”) Sparsity, parsimony: Implications for portfolio decisions & revision over time

People & Links www.stat.duke.edu/~mw Zoey Zhao PhD (Duke 2015) Citadel, Chicago Amy Xie (Duke BA 2013) PhD student, Duke Dynamic dependence networks: Financial time series forecasting & portfolio decisions, ASMBI, 2016 www.stat.duke.edu/~mw