13 th TRB Transportation Planning Applications Conference, Reno, NV Computational Challenges and Advances in Transportation Computing Andres Rabinowicz.

Slides:



Advertisements
Similar presentations
Feedback Loops Guy Rousseau Atlanta Regional Commission.
Advertisements

Adventures in Transit PathFinding Jim Lam Jian Zhang Howard Slavin Srini Sundaram Andres Rabinowicz Caliper Corporation GIS in Public Transportation September,
Jeannie Wu, Planner Sep  Background  Model Review  Model Function  Model Structure  Transportation System  Model Interface  Model Output.
11 th TRB Transportation Planning Applications Conference, Daytona Beach, FL Achieving Planning Model Convergence Howard Slavin Jonathan Brandon Andres.
New Findings from the Application of Accelerated UE Traffic Assignments Howard Slavin Jonathan Brandon Andres Rabinowicz Paul Ricotta Srini Sundaram Caliper.
MULTICORE PROCESSOR TECHNOLOGY.  Introduction  history  Why multi-core ?  What do you mean by multicore?  Multi core architecture  Comparison of.
Khaled A. Al-Utaibi  Computers are Every Where  What is Computer Engineering?  Design Levels  Computer Engineering Fields  What.
Introduction CSCI 444/544 Operating Systems Fall 2008.
Presented by: Pascal Volet, ing. October 11,2007 Application of Dynameq in Montréal: bridging the gap between regional models and microsimulation Application.
What is the Model??? A Primer on Transportation Demand Forecasting Models Shawn Turner Theo Petritsch Keith Lovan Lisa Aultman-Hall.
16/13/2015 3:30 AM6/13/2015 3:30 AM6/13/2015 3:30 AMIntroduction to Software Development What is a computer? A computer system contains: Central Processing.
TRIP ASSIGNMENT.
Evaluation of Potential ITS Strategies under Non-recurrent Congestion Using Microscopic Simulation Lianyu Chu, University of California, Irvine Henry Liu,
Automatic loading of inputs for Real Time Evacuation Scenario Simulations: evaluation using mesoscopic models Josep M. Aymamí 15th TRB National Transportation.
Accelerating SQL Database Operations on a GPU with CUDA Peter Bakkum & Kevin Skadron The University of Virginia GPGPU-3 Presentation March 14, 2010.
Computer performance.
An Empirical Comparison of Microscopic and Mesoscopic Traffic Simulation Paradigms Ramachandran Balakrishna Daniel Morgan Qi Yang Caliper Corporation 14.
Source: NHI course on Travel Demand Forecasting (152054A) Session 10 Traffic (Trip) Assignment Trip Generation Trip Distribution Transit Estimation & Mode.
©2009 Proprietary and Confidential DTA in practice: Modeling dynamic networks in the real world Michael Mahut, Ph.D. INRO Montreal, Canada.
Traffic Assignment Convergence and its Effects on Selecting Network Improvements By Chris Blaschuk, City of Calgary and JD Hunt, University of Calgary.
TPB Models Development Status Report Presentation to the Travel Forecasting Subcommittee Ron Milone National Capital Region Transportation Planning Board.
STRATEGIES INVOLVED IN REMOTE COMPUTATION
Simultaneous Multithreading: Maximizing On-Chip Parallelism Presented By: Daron Shrode Shey Liggett.
Calculating Transportation System User Benefits: Interface Challenges between EMME/2 and Summit Principle Author: Jennifer John Senior Transportation Planner.
Introduction and Overview Questions answered in this lecture: What is an operating system? How have operating systems evolved? Why study operating systems?
Computer Performance Computer Engineering Department.
Types of Computers Mainframe/Server Two Dual-Core Intel ® Xeon ® Processors 5140 Multi user access Large amount of RAM ( 48GB) and Backing Storage Desktop.
Increasing Precision in Highway Volume through Adjustment of Stopping Criteria in Traffic Assignment and Number of Feedbacks Behruz Paschai, Kathy Yu,
Development of the Graphical User Interface and Improvement and Streamlining of NYMTC's Best Practice Model Jim Lam, Andres Rabinowicz, Srini Sundaram,
Multi-core architectures. Single-core computer Single-core CPU chip.
Cloud Computing & Amazon Web Services – EC2 Arpita Patel Software Engineer.
TRANSIMS Version 5 Application Concepts January 20, 2011 David Roden – AECOM.
Multi-Core Architectures
Utilizing Multi-threading, Parallel Processing, and Memory Management Techniques to Improve Transportation Model Performance Jim Lam Andres Rabinowicz.
NTERFACING THE MORPC REGIONAL MODEL WITH DYNAMIC TRAFFIC SIMULATION INTERFACING THE MORPC REGIONAL MODEL WITH DYNAMIC TRAFFIC SIMULATION David Roden (AECOM)
Comparing Dynamic Traffic Assignment Approaches for Planning
SJSU SPRING 2011 PARALLEL COMPUTING Parallel Computing CS 147: Computer Architecture Instructor: Professor Sin-Min Lee Spring 2011 By: Alice Cotti.
David B. Roden, Senior Consulting Manager Analysis of Transportation Projects in Northern Virginia TRB Transportation Planning Applications Conference.
1 Introduction to Transportation Systems. 2 PARTIII: TRAVELER TRANSPORTATION.
Incorporating Traffic Operations into Demand Forecasting Model Daniel Ghile, Stephen Gardner 22 nd international EMME Users’ Conference, Portland September.
Hyper Threading Technology. Introduction Hyper-threading is a technology developed by Intel Corporation for it’s Xeon processors with a 533 MHz system.
Planning Applications Conference, Reno, NV, May Impact of Crowding on Rail Ridership: Sydney Metro Experience and Forecasting Approach William Davidson,
Interactive Supercomputing Update IDC HPC User’s Forum, September 2008.
A Dynamic Traffic Simulation Model on Planning Networks Qi Yang Caliper Corporation TRB Planning Application Conference Houston, May 20, 2009.
DVRPC TMIP Peer Review Dynamic Traffic Assignment (DTA) Oct. 29 th, 2014.
Nanco: a large HPC cluster for RBNI (Russell Berrie Nanotechnology Institute) Anne Weill – Zrahia Technion,Computer Center October 2008.
Computing Environment The computing environment rapidly evolving ‑ you need to know not only the methods, but also How and when to apply them, Which computers.
TRB Planning Applications May 2009, Houston,TX Changing assignment algorithms: the price of better convergence Michael Florian and Shuguang He INRO.
Methodological Considerations for Integrating Dynamic Traffic Assignment with Activity-Based Models Ramachandran Balakrishna Daniel Morgan Srinivasan Sundaram.
11th TRB National Transportation Planning Applications Conference CORRADINO May 9, Validation of Speeds and Volumes in a Large Regional Model Southeast.
MULTICORE PROCESSOR TECHNOLOGY.  Introduction  history  Why multi-core ?  What do you mean by multicore?  Multi core architecture  Comparison of.
Chapter 5: Computer Systems Design and Organization Dr Mohamed Menacer Taibah University
3/12/2013Computer Engg, IIT(BHU)1 PARALLEL COMPUTERS- 2.
Application of Accelerated User Equilibrium Traffic Assignments Howard Slavin Jonathan Brandon Andres Rabinowicz Srinivasan Sundaram Caliper Corporation.
Advanced Computer Architecture pg 1 Embedded Computer Architecture 5SAI0 Chip Multi-Processors (ch 8) Henk Corporaal
Travel Demand Forecasting: Traffic Assignment CE331 Transportation Engineering.
Introduction Goal: connecting multiple computers to get higher performance – Multiprocessors – Scalability, availability, power efficiency Job-level (process-level)
Hybrid Parallel Implementation of The DG Method Advanced Computing Department/ CAAM 03/03/2016 N. Chaabane, B. Riviere, H. Calandra, M. Sekachev, S. Hamlaoui.
Hardware Architecture
Introduction to Mobile-Cloud Computing. What is Mobile Cloud Computing? an infrastructure where both the data storage and processing happen outside of.
Peter Vovsha, Robert Donnelly, Surabhi Gupta pb
Computer Science 2 What’s this course all about?
CS427 Multicore Architecture and Parallel Computing
Enabling machine learning in embedded systems
Chrissy Bernardo, Peter Vovsha, Gaurav Vyas (WSP),
After Evans: Working on an Approximation of a Combined Equilibrium Model Based on Precision Assignment May, 2011 TRB Planning Applications Conference,
Types of Computers Mainframe/Server
Multi-modal Bi-criterion Highway Assignment for Toll Roads Jian Zhang Andres Rabinowicz Jonathan Brandon Caliper Corporation /9/2018.
Jim Lam, Caliper Corporation Guoxiong Huang, SCAG Mark Bradley, BB&C
Presentation transcript:

13 th TRB Transportation Planning Applications Conference, Reno, NV Computational Challenges and Advances in Transportation Computing Andres Rabinowicz Howard Slavin Jonathan Brandon Srini Sundaram Caliper Corporation May 2011

13 th TRB Transportation Planning Applications Conference, Reno, NV Current Computational Burdens are High and are Getting Higher Many more zones for conventional models with 5, ,000 and up being used. Dynamic Traffic Assignments can multiply the computing time needed for static assignments by more than an order of magnitude Numerous complex choice model evaluations Agent-based models with millions of agents are at the core of activity models and traffic microsimulation

13 th TRB Transportation Planning Applications Conference, Reno, NV Computing Demands Keep Increasing Feedback Loops Calibration Runs Large numbers of scenarios Equilibrium Convergence Issues More complex and interdependent choices in disaggregate models of all types But run times need to be acceptable or compromises are made…

13 th TRB Transportation Planning Applications Conference, Reno, NV Progression of Intel Architecture ProcessorYearGFLOPMHZ Pentium Pentium Pro Pentium II Pentium III Pentium 4/Xeon Pentium M Core Duo QuadCore I I Source:

13 th TRB Transportation Planning Applications Conference, Reno, NV

History of the Multi-Core Processor Dual core chips introduced in 2005 Cores are like multiple CPU with some shared components Currently chips with 6-10 physical cores are available PCs can have multiple chips with multiple cores Expectations are that the number of cores per chip will keep growing Speculation is that by 2017, computers will have 100s of cores

13 th TRB Transportation Planning Applications Conference, Reno, NV Multi-core Desktop Computer

13 th TRB Transportation Planning Applications Conference, Reno, NV Hardware Improvements Keep Coming Multi-core chips now in low-end machines Hyper-threading—2 threads per physical core is standard Turbo-boost increases the clock speed when some cores are unused 64-bit hardware commonplace Memory constraints of less importance There are more cores per chip each year, but multiple cores are not used unless the software is designed to use them.

13 th TRB Transportation Planning Applications Conference, Reno, NV Effect of CPU Cores on Traffic Assignment

13 th TRB Transportation Planning Applications Conference, Reno, NV The 10 GFlop Personal Computer 10 4 more power for the money vs xShuttle $4, Cray Y-MP C916 $40,000, Sun HPC10000 $1,000,000

13 th TRB Transportation Planning Applications Conference, Reno, NV GFLOPS ManufacturerProcessorModelModel Number GFLOPs FP64GFLOPs FP32 AMDGPGPUFireStream AMDGPURadeon HD FujitsuCPUSPARC64VII128- IBMCPUPOWER IntelCPUCore 2 DuoE IntelCPUCore 2 ExtremeQX IntelCPUCore 2 QuadQ IntelCPUCore i IntelCPUCore i IntelCPUCore i IntelCPUCore i7980 XE IntelCPUItanium IntelCPUXeonW IntelCPUXeonX nVidiaGPGPUTeslaC nVidiaGPGPUTeslaC nVidiaGPUGeForce

STEPBPM PROCEDURE1996 BASE 2002 BASE 2005 BASE (64 bit) 1CREATE NEW SCENARIO10 min. 1 min. 2RUN HIGHWAY NETWORK BUILDER15 min. 5 min. 2 min. 3NETPREP20 min. 15 min. 11 min. 4HIGHWAY PRESKIMS12 hrs. 4 hrs 52 min. 1 hr 16 min. 5TRANSIT NETWORK DATABASE & SKIMS48 hrs. 18 hrs 35 min. 2 hrs 5 min. 6ACCESSIBILITY INDICIES2 hrs. 20 min. 15 min. 7HOUSEHOLD AUTO JOURNEY (HAJ)1 hrs. 15 min. 7 min. 8 MODE DESTINATION STOPS CHOICE (MDSC)18 hrs. 5 hrs 20 min. 2 hrs 33 min 9TRUCKS/COMMERCIAL VEHICLES MODEL2 hrs. 24 min. 10EXTERNAL MODEL5 min. 1 min. 11 PRE-ASSIGNMENT PROCESSING/TIME OF DAY (PAP)1 hrs. 13 min. 12HIGHWAY ASSIGNMENT16 hrs. 56 min. 13TRANSIT ASSIGNMENT72 hrs. 52 hrs 55 min. 3 hrs 30 min. TOTAL173 hrs. 87 hrs. 11 hrs 30 min. 12

13 th TRB Transportation Planning Applications Conference, Reno, NV Cloud Computing Good for parallel applications and distributed processing Low cost and peak capacity if needed for short periods of time Limited advantages for very large models with heavy data transfer requirements Somewhat slower than optimized hardware environments Potentially more expensive for heavy, continuous computing loads Private clouds can be optimized for demand models

13 th TRB Transportation Planning Applications Conference, Reno, NV Cloud Cost vs. Desktop Cost

13 th TRB Transportation Planning Applications Conference, Reno, NV Cloud Computing(Software as a Service – SaaS) AdvantagesDisadvantages Save Hardware CostRequires Stable Internet Connection Easier MaintenanceData Center crashes … all virtual machines will be affected Easy to DeployData Security Save EnergyCost of Data Transfer and Storage SpaceCost of instances Backup No control over hardware

13 th TRB Transportation Planning Applications Conference, Reno, NV Cloud Computing: Data 1.5 Mbps ModelInput GB Output GB PG County1.2 (1:42) 8.6 (13:17) SANDAG5.5 (7:50) 18 (1 day 3:49) NYMTC11 (16:59) 34 (2 day 4:32) Rolling Stones Get Off Of My Cloud Lyrics (m. jagger/k. richards) I live in an apartment on the ninety-ninth floor of my block And I sit at home looking out the window Imagining the world has stopped Then in flies a guy who's all dressed up like a union jack And says, Iive won five pounds if I have his kind of detergent pack I said, hey! you! get off of my cloud Hey! you! get off of my cloud Hey! you! get off of my cloud Don't hang around cause twos a crowd On my cloud, baby The telephone is ringing I say, hi, it's me. who is it there on the line? A voice says, hi, hello, how are you Well, I guess Im doin fine He says, it's three a.m., there's too much noise Don't you people ever wanna go to bed? Just cause you feel so good, do you have To drive me out of my head? I said, hey! you! get off of my cloud Hey! you! get off of my cloud Hey! you! get off of my cloud Don't hang around cause twos a crowd On my cloud baby I was sick and tired, fed up with this And decided to take a drive downtown It was so very quiet and peaceful There was nobody, not a soul around I laid myself out, I was so tired and I started to dream In the morning the parking tickets were just like A flag stuck on my window screen I said, hey! you! get off of my cloud Hey! you! get off of my cloud Hey! you! get off of my cloud Don't hang around cause twos a crowd On my cloud Hey! you! get off of my cloud Hey! you! get off of my cloud Hey! you! get off of my cloud Don't hang around, baby, twos a crowd

13 th TRB Transportation Planning Applications Conference, Reno, NV Software Strategies for Improved Performance Distributed Processing-locally or in the cloud Multi-threaded shared-memory model computing GPU/APU assisted general computing All of the above in combination Of course, better algorithms and better implementation are always good.

13 th TRB Transportation Planning Applications Conference, Reno, NV Distributed Processing on Multiple Computers Very simple to implement for independent processes such as different time periods Heavy overhead from data transfers Load balancing essential for good performance Requires user management

13 th TRB Transportation Planning Applications Conference, Reno, NV Multi-threading Benefits from shared memory Enables fine-grain parallelism unavailable from distributed processing Requires significant re-engineering of software components Yields enormous upside in performance Has a dark side if not done properly

13 th TRB Transportation Planning Applications Conference, Reno, NV Non-Threaded Program

13 th TRB Transportation Planning Applications Conference, Reno, NV Threaded Program

13 th TRB Transportation Planning Applications Conference, Reno, NV Software Issues in Multi-threading Correctness of computations Consistency of floating point calculations Consistency of results irrespective of the hardware used and the number of cores Data Races Memory Requirements

13 th TRB Transportation Planning Applications Conference, Reno, NV Multithreading Principal Challenges Data racing Deadlocks Starvation Livelock Order Dependency Computational Overhead Memory Overhead

13 th TRB Transportation Planning Applications Conference, Reno, NV Multithreading Principal Challenges Data racing Deadlocks Starvation Livelock Order Dependency Computational Overhead Memory Overhead

13 th TRB Transportation Planning Applications Conference, Reno, NV Multithreading Principal Challenges Data racing Deadlocks Starvation Livelock Order Dependency Computational Overhead Memory Overhead

13 th TRB Transportation Planning Applications Conference, Reno, NV Multithreading Principal Challenges Data racing Deadlocks Starvation Livelock Order Dependency Computational Overhead Memory Overhead

13 th TRB Transportation Planning Applications Conference, Reno, NV Multithreading Principal Challenges Data racing Deadlocks Starvation Livelock Order Dependency Computational Overhead Memory Overhead

13 th TRB Transportation Planning Applications Conference, Reno, NV Multithreading Principal Challenges Data racing Deadlocks Starvation Livelock Order Dependency Computational Overhead Memory Overhead

13 th TRB Transportation Planning Applications Conference, Reno, NV Multithreading Principal Challenges Data racing Deadlocks Starvation Livelock Order Dependency Computational Overhead Memory Overhead

13 th TRB Transportation Planning Applications Conference, Reno, NV Favorable Candidates for Multi-Threading Network Skims Traffic Assignments Matrix computations Nested Logit and more complex models Certain other activity model components Traffic micro-simulation

13 th TRB Transportation Planning Applications Conference, Reno, NV Running times for selected procedures over time

13 th TRB Transportation Planning Applications Conference, Reno, NV Multithreading: Matrix Operations

13 th TRB Transportation Planning Applications Conference, Reno, NV Multithreading: Transit Skims Seattle Model: 1091 TAZ; links; 6000 nodes; 850 Routes; Stops

13 th TRB Transportation Planning Applications Conference, Reno, NV Multithreading: Nested Logit SCAG Nested logit choice model (HBW) 4109 Zones 12 Modes

13 th TRB Transportation Planning Applications Conference, Reno, NV Multithreading: Nested Logit Model

13 th TRB Transportation Planning Applications Conference, Reno, NV Multithreading: Traffic Assignment

13 th TRB Transportation Planning Applications Conference, Reno, NV Multithreading: Large Scale Simulation 15 min period; 25,000 Trips; 80 square miles (downtown Phoenix and parts of Tempe; 634 miles of Freeway and Arterial links; 447 Signalized Intersections

13 th TRB Transportation Planning Applications Conference, Reno, NV Multithreading: Large Scale Simulation

13 th TRB Transportation Planning Applications Conference, Reno, NV Threading with CUDA Advanced GPU architectures provide hundreds of light weight threads

13 th TRB Transportation Planning Applications Conference, Reno, NV Multithreading with CUDA Pros More cores than on CPU Designed for Math Includes a Linear Algebra library Cons All data has to be copied Memory is limited All code resides on device

13 th TRB Transportation Planning Applications Conference, Reno, NV CUDA: Spherical Euclidian Distance Matrix Size: 6585 x 6585 on a 6 Core Computer

13 th TRB Transportation Planning Applications Conference, Reno, NV New Frontier Transportation Modeling Problems will absorb any new computing power that we are given Regional High Fidelity Microsimulation faster than real-time for prediction Dynamic O-D Estimation from counts Activity and Tour Re-optimization Full equilibration of activity models Network Traffic Signal Optimization(needed for forecasting the future with traffic simulation)

13 th TRB Transportation Planning Applications Conference, Reno, NV Conclusions Improvement to run times cannot exclusively rely on hardware Software needs to adapt to multi-core chips Multithreading algorithms is more complex and prone to problems than single-threaded ones Cloud computing is not Utopia New GPU computing can further improve run times