Conceptual Differences Between Cube Analyst and Cube Analyst Drive Austen C. Duffy, Ph.D. Computational Mathematician, Citilabs.

Slides:



Advertisements
Similar presentations
Pattern Recognition and Machine Learning
Advertisements

Lect.3 Modeling in The Time Domain Basil Hamed
Fast Algorithms For Hierarchical Range Histogram Constructions
Tutorial 10: Performing What-If Analyses
CSCI 347 / CS 4206: Data Mining Module 07: Implementations Topic 03: Linear Models.
R OBERTO B ATTITI, M AURO B RUNATO. The LION Way: Machine Learning plus Intelligent Optimization. LIONlab, University of Trento, Italy, Feb 2014.
Model calibration using. Pag. 5/3/20152 PEST program.
Experimental Design, Response Surface Analysis, and Optimization
Supervised Learning Recap
A Mathematica ® based regression analysis program Analisys … A Curve Fitting Application.
GEOG 111 & 211A Transportation Planning Traffic Assignment.
Announcements  Homework 4 is due on this Thursday (02/27/2004)  Project proposal is due on 03/02.
Spatial Interpolation
Prénom Nom Document Analysis: Data Analysis and Clustering Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
CSCI 4550/8556 Computer Networks Comer, Chapter 19: Binding Protocol Addresses (ARP)
Traffic Matrix Estimation: Existing Techniques and New Directions A. Medina (Sprint Labs, Boston University), N. Taft (Sprint Labs), K. Salamatian (University.
Class 3: Estimating Scoring Rules for Sequence Alignment.
What is Learning All about ?  Get knowledge of by study, experience, or being taught  Become aware by information or from observation  Commit to memory.
ECE 530 – Analysis Techniques for Large-Scale Electrical Systems
Basic Mathematics for Portfolio Management. Statistics Variables x, y, z Constants a, b Observations {x n, y n |n=1,…N} Mean.
Laurent Itti: CS599 – Computational Architectures in Biological Vision, USC Lecture 7: Coding and Representation 1 Computational Architectures in.
An Experimental Procedure for Mid Block-Based Traffic Assignment on Sub-area with Detailed Road Network Tao Ye M.A.Sc Candidate University of Toronto MCRI.
Principles of the Global Positioning System Lecture 11 Prof. Thomas Herring Room A;
TPB Models Development Status Report Presentation to the Travel Forecasting Subcommittee Ron Milone National Capital Region Transportation Planning Board.
Linear Recurrence Relations in Music By Chris Hall.
Particle Filtering in Network Tomography
1 Validation & Verification Chapter VALIDATION & VERIFICATION Very Difficult Very Important Conceptually distinct, but performed simultaneously.
Enhancing TDF Model Results Using Intersection Control Specific Delays and Turning Movement Level Matrix Estimation for a Downtown Circulation Study Presented.
DSS Modeling Current trends – Multidimensional analysis (modeling) A modeling method that involves data analysis in several dimensions – Influence diagram.
Modeling.
Prof. Dr. S. K. Bhattacharjee Department of Statistics University of Rajshahi.
Minimum Phoneme Error Based Heteroscedastic Linear Discriminant Analysis for Speech Recognition Bing Zhang and Spyros Matsoukas BBN Technologies Present.
Optimization in Engineering Design Georgia Institute of Technology Systems Realization Laboratory 101 Quasi-Newton Methods.
Integration of Transportation System Analyses in Cube Wade L. White, AICP Citilabs Inc.
IOS110 Introduction to Operating Systems using Windows Session 5 1.
A Survey of Distributed Task Schedulers Kei Takahashi (M1)
DBSQL 12-1 Copyright © Genetic Computer School 2009 Chapter 12 Recent Concepts and Application of Databases.
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 3: LINEAR MODELS FOR REGRESSION.
Getting to Know Cube.
Dynamics of Traffic Flows in Combined Day-to-day and With- in Day Context Chandra Balijepalli ITS, Leeds September 2004.
SIMULTANEOUS TRAVEL MODEL ESTIMATION FROM SURVEY DATA AND TRAFFIC COUNTS May 20, 2015 Vince Bernardin, PhD, RSG Steven Trevino, RSG John-Paul Hopman, MACOG.
CS 782 – Machine Learning Lecture 4 Linear Models for Classification  Probabilistic generative models  Probabilistic discriminative models.
MATRIX ADJUSTMENT MACRO (DEMADJ.MAC AND DEMADJT.MAC) APPLICATIONS: SEATTLE EXPERIENCE Murli K. Adury Youssef Dehghani Sujay Davuluri Parsons Brinckerhoff.
ECE 8443 – Pattern Recognition LECTURE 10: HETEROSCEDASTIC LINEAR DISCRIMINANT ANALYSIS AND INDEPENDENT COMPONENT ANALYSIS Objectives: Generalization of.
Exploring Cube Base and Cube Voyager. Exploring Cube Base and Cube Voyager Use Cube Base and Cube Voyager to develop data, run scenarios, and examine.
A n = c 1 a n-1 + c2an-2 + … + c d a n-d d= degree and t= the number of training data (notes) The assumption is that the notes in the piece are generated.
“On Sizing and Shifting The BFGS Update Within The Sized-Broyden Family of Secant Updates” Richard Tapia (Joint work with H. Yabe and H.J. Martinez) Rice.
Multi-modal Demand Adjustment in EMME2 Yuri Teleshevsky URS Corporation, New York.
FLSWM 2040 Traffic Projection Hongbo Chi. Introduction FLSWM provides travel forecasting over the entire state reflecting long range demographic and socioeconomic.
1  The Problem: Consider a two class task with ω 1, ω 2   LINEAR CLASSIFIERS.
A Tour-Based Urban Freight Transportation Model Based on Entropy Maximization Qian Wang, Assistant Professor Department of Civil, Structural and Environmental.
DTC Quantitative Methods Summary of some SPSS commands Weeks 1 & 2, January 2012.
1 An Arc-Path Model for OSPF Weight Setting Problem Dr.Jeffery Kennington Anusha Madhavan.
Chapter 7 What Can Computers Do For Me?. How important is the material in this chapter to understanding how a computer works? 4.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition LECTURE 12: Advanced Discriminant Analysis Objectives:
TRB Planning Applications Conference May 2009, Houston,TX A Caveat on O-D Matrix Estimation/Adjustment: Deviations from a seed matrix and Simultaneous.
1 Forecasting Traffic for a Start-Up Toll Road 12 th TRB National Transportation Planning Application Conference May 18, 2009 David Schellinger, P.E. Vice.
Ensemble forecasting/data assimilation and model error estimation algorithm Prepared by Dusanka Zupanski and Milija Zupanski CIRA/CSU Denning group meeting.
ECE 530 – Analysis Techniques for Large-Scale Electrical Systems Prof. Hao Zhu Dept. of Electrical and Computer Engineering University of Illinois at Urbana-Champaign.
Computacion Inteligente Least-Square Methods for System Identification.
Discriminative n-gram language modeling Brian Roark, Murat Saraclar, Michael Collins Presented by Patty Liu.
Chapter 14 EXPLORATORY FACTOR ANALYSIS. Exploratory Factor Analysis  Statistical technique for dealing with multiple variables  Many variables are reduced.
Part 3: Estimation of Parameters. Estimation of Parameters Most of the time, we have random samples but not the densities given. If the parametric form.
Cost9b 1 Living with Function Points Bernstein and Lubashevsky Text pp
Author: Vikas Sindhwani and Amol Ghoting Presenter: Jinze Li
LECTURE 11: Advanced Discriminant Analysis
Network Assignment and Equilibrium for Disaggregate Models
Calibration.
Roberto Battiti, Mauro Brunato
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
Presentation transcript:

Conceptual Differences Between Cube Analyst and Cube Analyst Drive Austen C. Duffy, Ph.D. Computational Mathematician, Citilabs

The Cube Analyst Suite The Cube Analyst Suite consists of two OD matrix estimation programs, each possessing some unique features and capabilities. Cube Analyst - The original matrix estimation program capable of a wide variety of static type estimation problems. Cube Analyst Drive – The new matrix estimation program designed for dynamic estimation, but possessing some nice static estimation features as well.

Matrix Estimation w/ the Analyst Suite

Cube Analyst vs. Cube Analyst Drive: The Basics Methodology: Statistics Based Estimation using Maximum Likelihood Method Problem Types: Small Static Matrices, Public Transit Supporting Programs: Highway, PT, TRIPS- MVESTE, MVESTL, etc. Methodology: Estimation Based on Data Assimilation Techniques Problem Types: Dynamic Matrices and Large Static Matrices Supporting Programs: Highway, Avenue, any dynamic assignment program

Maximum Likelihood Method (Analyst) The maximum likelihood method applies a statistical model utilizing maximum likelihood estimators (MLE’s) to a given data set (e.g. observed counts) to provide estimates for model parameters (e.g. OD matrix) that maximize a likelihood function. “How likely is it that our OD matrix matches the observed count data”

Data Assimilation (Analyst Drive) In Data Assimilation, we seek to minimize the squared error between a forecasted analysis (e.g. simulated volume) and a set of data observations (e.g. traffic counts) weighted by the reliability of the observations (e.g. confidence values), while simultaneously seeking to preserve a defined structure within our background solution (e.g. Input OD matrix).

Cube Analyst vs. Cube Analyst Drive: Mathematical Models Analyst Applies the maximum likelihood method. Solution estimates obtained via likelihood functions represented by PDF’s Essentially minimizes a sum of logarithmic function terms representing screenline counts, trip origins, trip destinations and prior matrix along with a term for the derived cost matrix Derivation and details available in the Cube Analyst Reference Guide Analyst Drive’s primary objective aims to minimize the distance between the simulated volume AX and the count data b… i.e we wish to find a solution where AX-b=0 Secondary objective: Preserve the background matrix structure

Cube Analyst Drive: Static Estimation Single Class Estimation Problem (Multiple Class in Parallel)

Cube Analyst vs. Cube Analyst Drive: Multiple Static User Classes Multiclass static and dynamic estimation capabilities Static capabilities allow users to estimate multiple classes from fewer count sets New file format allows for multiple user class data to be stored in a single ICP file, requiring only a single Highway run. Analyst does not allow the user to estimate Multiple User Classes Must perform a separate Analyst run for each user class May require multiple Highway runs to produce individual files (e.g. ICP) for each class

Estimating Multiple User Classes with Fewer Count Sets in Analyst Drive Screenline Preprocessing Aggregate Estimation Given multiple matrices to estimate with fewer count sets, Analyst Drive will preprocess the input screenline files to produce an individual count set for each, and run the single class estimation problem in parallel Numerically Superior (more data points) Requires The assumption that the proportionality of class counts does not change with the evolving matrices Volume contributions from multiple class sets are summed and matched against a single count set Numerically Inferior (fewer data points) More theoretically sound as it does not require any assumptions on end behavior

Cube Analyst Drive: Static Estimation Aggregate Multiple Class Estimation Problem

Cube Analyst Drive: Dynamic Estimation Single Class Estimation Problem (Multiple Class in Parallel)

Differences In The Use Of Confidence Values Analyst Analyst Drive Uses confidence values to statistically provide value ranges for an estimated entry. → Less Reliable Data Provides for a larger variance and hence a wider range of possible values. Uses Confidences values to determine the weight a particular value contributes to the problem. → Less Reliable data contributes less to the cost function, and hence is not as important in the estimation.

Cube Analyst vs. Cube Analyst Drive: Numerical Methods Objective Function is minimized using the quasi-Newton BFGS method which requires an approximation to the Hessian matrix (expensive to compute, large storage penalty) Sequential Computation Objective Function is minimized via a conjugate gradient method which does not require a Hessian matrix Parallel Computation at various levels

Cube Analyst vs. Cube Analyst Drive: Capabilities Static Matrix Estimation with Highway X Static Matrix Estimation with PT Static Matrix Estimation for Large Networks (> 5000 Zones) Dynamic Matrix Estimation with Avenue Dynamic Matrix Estimation for any dynamic traffic assignment program Estimation with static screenline counts Estimation with static turning counts Estimation with dynamic screenline counts

Cube Analyst vs. Cube Analyst Drive: Capabilities Cont. Estimation with dynamic turning counts X Estimation with part trip matrices Simultaneous Estimation of Multiple User Classes Aggregate Multi-Class Static Estimation Static screenline count processing for estimating multiple user classes simultaneously from fewer sets of count data Available 64 bit program with no inherent memory constraints Multi-threaded for increased performance Advanced parallel computing capabilities for high performance clusters using the message passing interface (MPI)

Analyst vs. Analyst Drive: Files Analyst Drive does not accept all files that the original Analyst does New Format ICP files developed for Analyst Drive allow for multiple user classes and provide up to 80% space savings Matrix files contain multiple tabs for multiple user classes (static) and multiple time periods (dynamic)

Cube Analyst vs. Cube Analyst Drive: Block Diagrams

Cube File Compatibility Cube/Voyager Files Analyst Analyst Drive Cube Matrix X Highway Old Format ICP Highway New Format ICP Highway RCP Highway Screenline PT ICP PT Lines PT Screenline Voyager Path Network Network File Avenue Packet Log Static Turning counts Static Trip Ends

Cube File Compatibility Cont.: Analyst Specific Files Analyst Drive Model Params X Grad Search Dstr Defn Binr Local Mat Cntr Coordinates

Cube File Compatibility Cont.: Analyst Drive Specific Files Dynamic Screenline X Dynamic Turning counts GEN DYN FILES

So How Do The Solutions Compare? Both Analyst and Analyst Drive should produce similar solutions, but will not be identical. Because Analyst is statistics based, it is able to provide a solution within a range of values (similar to a confidence interval) w/ sensitivities. Analyst Drive is better able to preserve the structure of the background matrix.

Analyst vs. Analyst Drive Use Summary Good for small static problems with a wide range of input options Public Transit Networks. Maximum Likelihood Method: Better at using poor input to generate reasonable matrices ‘from scratch’. Dynamic Matrices Large Static Matrices Multiple Class Estimation FAST! Data Assimilation Method: Better at producing reliable matrices which preserve the defining structure of the background matrix.

Cube Analyst Drive Reference Guide - Version 1.01.12 References Cube Analyst Drive Reference Guide - Version 1.01.12 Cube Analyst Reference Guide – Version 6.1.0

Where to Find Reference Guides

Questions?