EVLA Data Processing PDR Scale of processing needs Tim Cornwell, NRAO.

Slides:



Advertisements
Similar presentations
Slides Prepared from the CI-Tutor Courses at NCSA By S. Masoud Sadjadi School of Computing and Information Sciences Florida.
Advertisements

Mr Greenhalgh S4 Computing Int 1 Things you could do with knowing before the Exam…
National Radio Astronomy Observatory June 13/14, 2005 EVLA Phase II Proposal Review EVLA Phase II Computing Development Bryan Butler (EVLA System Engineer.
IBM 1350 Cluster Expansion Doug Johnson Senior Systems Developer.
SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA, SAN DIEGO Disk and Tape Storage Cost Models Richard Moore & David Minor San Diego Supercomputer.
Chapter 1 Introduction 1.1A Brief Overview - Parallel Databases and Grid Databases 1.2Parallel Query Processing: Motivations 1.3Parallel Query Processing:
IBM RS/6000 SP POWER3 SMP Jari Jokinen Pekka Laurila.
Parallel Algorithms - Introduction Advanced Algorithms & Data Structures Lecture Theme 11 Prof. Dr. Th. Ottmann Summer Semester 2006.
Distributed Processing of Future Radio Astronomical Observations Ger van Diepen ASTRON, Dwingeloo ATNF, Sydney.
Computer Systems CS208. Major Components of a Computer System Processor (CPU) Runs program instructions Main Memory Storage for running programs and current.
Chapter 1 Sections 1.1 – 1.3 Dr. Iyad F. Jafar Introduction.
Virtual Network Servers. What is a Server? 1. A software application that provides a specific one or more services to other computers  Example: Apache.
* Definition of -RAM (random access memory) :- -RAM is the place in a computer where the operating system, application programs & data in current use.
Prepared by Careene McCallum-Rodney Hardware specification of a computer system.
CS246 Search Engine Scale. Junghoo "John" Cho (UCLA Computer Science) 2 High-Level Architecture  Major modules for a search engine? 1. Crawler  Page.
COMPUTER CONCEPTS.
How Computers Work. A computer is a machine f or the storage and processing of information. Computers consist of hardware (what you can touch) and software.
Department of Computer and Information Science, School of Science, IUPUI Dale Roberts, Lecturer Computer Science, IUPUI CSCI.
1 CHAPTER 2 COMPUTER HARDWARE. 2 The Significance of Hardware  Pace of hardware development is extremely fast. Keeping up requires a basic understanding.
ScotGrid: a Prototype Tier-2 Centre – Steve Thorn, Edinburgh University SCOTGRID: A PROTOTYPE TIER-2 CENTRE Steve Thorn Authors: A. Earl, P. Clark, S.
Types of Computers Mainframe/Server Two Dual-Core Intel ® Xeon ® Processors 5140 Multi user access Large amount of RAM ( 48GB) and Backing Storage Desktop.
Planning and Designing Server Virtualisation.
Hardware Trends. Contents Memory Hard Disks Processors Network Accessories Future.
Tim CornwellEVLA System PDR Dec 5-6, EVLA: Data Management EVLA sub-contracts data management to NRAO Data Management group –End-to-end processing.
S.T. MyersEVLA Advisory Committee Meeting September 6-7, 2007 EVLA Algorithm Research & Development Steven T. Myers (NRAO) CASA Project Scientist with.
ALMA Software B.E. Glendenning (NRAO). 2 ALMA “High Frequency VLA” in Chile Presently a European/North American Project –Japan is almost certainly joining.
Click to add text Introduction to the new mainframe: Large-Scale Commercial Computing © Copyright IBM Corp., All rights reserved. Chapter 2: Capacity.
1 Buses and types of computer. Paul Strickland Liverpool John Moores University.
Manchester HEP Desktop/ Laptop 30 Desktop running RH Laptop Windows XP & RH OS X Home server AFS using openafs 3 DB servers Kerberos 4 we will move.
ARGONNE NATIONAL LABORATORY Climate Modeling on the Jazz Linux Cluster at ANL John Taylor Mathematics and Computer Science & Environmental Research Divisions.
ALMA Archive Operations Impact on the ARC Facilities.
EVLA Transition to Science Operations: An Overview EVLA Advisory Committee Meeting, March 19-20, 2009 Bob Dickman AD, NM Operations.
EVLA Software Bryan Butler. 2007May22EVLA SAGE Meeting2 Requirements and Goals of EVLA Software Maximize scientific throughput of the instrument At a.
O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY Facilities and How They Are Used ORNL/Probe Randy Burris Dan Million – facility administrator.
Chapter 1 Computer Abstractions and Technology. Chapter 1 — Computer Abstractions and Technology — 2 The Computer Revolution Progress in computer technology.
Computer Organization & Assembly Language © by DR. M. Amer.
Sanjay BhatnagarEVLA Advisory Committee Meeting May 8-9, EVLA Algorithm Research & Development Progress & Plans Sanjay Bhatnagar CASA/EVLA.
Easy Deployment of the WRF Model on Heterogeneous PC Systems Braden Ward and Shing Yoh Union, New Jersey.
System Scalability. 1. General Observations The choice of platform for an application should consider the ability to grow the application with more users.
Tim CornwellEVLA Advisory May 10-11, EVLA: Data Management EVLA has sub-contracted data management to NRAO Data Management group End-to-end processing.
Recent progress in EVLA-specific algorithms EVLA Advisory Committee Meeting, March 19-20, 2009 S. Bhatnagar and U. Rau.
Status of the Bologna Computing Farm and GRID related activities Vincenzo M. Vagnoni Thursday, 7 March 2002.
EVLA Data Processing PDR E2E Data Archive System John Benson, NRAO July 18, 2002.
COMPASS Computerized Analysis and Storage Server Iain Last.
Unit C-Hardware & Software1 GNVQ Foundation Unit C Bits & Bytes.
Computer Hardware – Part 2 Basic Components V.T. Raja, Ph.D., Information Management Oregon State University.
Storage Systems CSE 598d, Spring 2007 Lecture ?: Rules of thumb in data engineering Paper by Jim Gray and Prashant Shenoy Feb 15, 2007.
 System Requirements are the prerequisites needed in order for a software or any other resources to execute efficiently.  Most software defines two.
Parallel IO for Cluster Computing Tran, Van Hoai.
Pathway to Petaflops A vendor contribution Philippe Trautmann Business Development Manager HPC & Grid Global Education, Government & Healthcare.
EVLA Data Processing PDR Pipeline design Tim Cornwell, NRAO.
12/19/01MODIS Science Team Meeting1 MODAPS Status and Plans Edward Masuoka, Code 922 MODIS Science Data Support Team NASA’s Goddard Space Flight Center.
SPRING 2012 Assembly Language. Definition 2 A microprocessor is a silicon chip which forms the core of a microcomputer the concept of what goes into a.
Information Technology (IT). Information Technology – technology used to create, store, exchange, and use information in its various forms (business data,
Introduction to Computers - Hardware
Hardware specifications
Berkeley Cluster: Zoom Project
Hardware September 19, 2017.
Architecture & Organization 1
Imaging and Calibration Challenges
How to Install Microsoft Office 2013?
My 5 Minutes of Fame David Maier OGI School of Science & Engineering
Architecture & Organization 1
Some Illustrative Use Cases
Types of Computers Mainframe/Server
CS246 Search Engine Scale.
TeraScale Supernova Initiative
EVLA Algorithm Research & Development
CS246: Search-Engine Scale
Implementation of a small-scale desktop grid computing infrastructure in a commercial domain    
Presentation transcript:

EVLA Data Processing PDR Scale of processing needs Tim Cornwell, NRAO

July , 2002EVLA Data Processing PDRTim Cornwell 2 Background EVLA correlator data rate will ~ 1000 times current correlator data rate Can 2009-era hardware handle the processing load? Can the software handle the processing load?

July , 2002EVLA Data Processing PDRTim Cornwell 3 Scale of EVLA data processing Peak data rate out of correlator backend ~ 25 MB/s Total data volume for Peak 8-hr observation ~ 700GB Floating point operations per float ~ Peak compute rate ~ 5Tflop Average/Peak computing load ~ 0.1 Average compute rate ~ 0.5Tflop Turnaround for 8-hr peak observation ~ 40 minutes Average/Peak data volume ~ 0.1 Data for Average 8-hr observation ~ 70GB Data for Average 1-yr ~ 80TB

July , 2002EVLA Data Processing PDRTim Cornwell 4 Scaling laws in computing “Rules of Thumb” by Gray and Shenoy – Examples: –1 Moore’s law: Things get 4x better every 3 years –2 You need an extra bit of addressing every 18 months –3 Storage capacities increase 100x per decade –4 Storage device throughput increases 10x per decade –7 NearlineTape:OnlineDisk:RAM storage cost ratios are approximately 1:3:300 –8 In ten years RAM will cost what disk costs today –9 A person can administer $1M of disk storage –14 Gilder’s law: Deployed bandwidth triples every year –15 Link bandwidth increases 4x every 3 years

July , 2002EVLA Data Processing PDRTim Cornwell 5 Ops per second per $

July , 2002EVLA Data Processing PDRTim Cornwell 6 Detailed analysis Analyze processing in terms of FFT and Gridding costs Find scaling laws for various types of processing Express in terms of 450MHz Pentium III with Ultra- SCSI disk Use Moore’s Law to scale to e.g –Performance/cost doubles every 18 months Many more details in EVLA Memo 24

July , 2002EVLA Data Processing PDRTim Cornwell 7 Scale of EVLA data processing Typical cost equation where units are in millions of visibilities or pixels

July , 2002EVLA Data Processing PDRTim Cornwell 8 Detailed analysis Analyze processing in terms of FFT and Gridding costs Find scaling laws for various types of processing Express in terms of 450MHz Pentium III with Ultra-SCSI disk Use Moore’s Law to scale to e.g –Performance/cost doubles every 18 months Many more details in EVLA Memo 24

July , 2002EVLA Data Processing PDRTim Cornwell 9 For NRAO…. Assume Moore’s Law holds to 2009 –Moore himself believes this…….. Cost of computing for EVLA –~ 10 – 20 processor parallel machine ~ $100K - $200K (2009) –Archive ~ 50TB per year ~ $5K - $10K (2009) Comparable to computing cost for ALMA Software costs –AIPS++ as-is can do much of the processing –Some development needed for high-end, pipelined processing –Some scientific/algorithmic work e.g. achieving full sensitivity, high dynamic range

July , 2002EVLA Data Processing PDRTim Cornwell 10 For the observer…. Moore’s Law gives ~ 64 fold increase for a desktop –I.e. $nK where n ~ 1-3 Many projects do-able on (2009-era) desktop –e.g km/s velocity range of HI for galaxy –e.g. Mosaic of SGRA West in all H recombination lines between 28 and 41 GHz Larger projects may require parallel machine or many days on a desktop –e.g. Full sensitivity continuum image of full resolution 20cm field –NRAO would provide access over the net

July , 2002EVLA Data Processing PDRTim Cornwell 11 Current activities Installed SAN at the AOC –Increasing disk storage to 4TB Eight processor IBM Netfinity 370 server –Running RedHat Linux –8GB memory –1 TB disk

July , 2002EVLA Data Processing PDRTim Cornwell 12 SAN Detail AOC Storage Area Network Diagram Eight processor, 8GB, 1TB IBM NetFinity running RedHat Linux

July , 2002EVLA Data Processing PDRTim Cornwell 13 Caveats Order of magnitude estimate –Development of computing uncertain Mix of science uncertain Processing requirements could escalate –Historically true –May be especially true for pipeline processing Moore’s Law will throttle EVLA –As it did for the VLA