PDSF Computing model Thomas Davis ASG/NERSC, LBNL LCCWS.

Slides:



Advertisements
Similar presentations
Buffers & Spoolers J L Martin Think about it… All I/O is relatively slow. For most of us, input by typing is painfully slow. From the CPUs point.
Advertisements

User Documentation.  You cannot build a system for a client and leave them without adequate documentation  Computer systems are complex, require configuration.
Device Tradeoffs Greg Stitt ECE Department University of Florida.
DB-13: Database Health Checks How to tell if you’re heading for The Wall Richard Shulman Principal Support Engineer.
Beowulf Supercomputer System Lee, Jung won CS843.
Computer Basics  A computer is an electronic machine that takes information, processes it,and stores it.  Computers are made up of hardware ( monitor,
OPNET Technologies, Inc. Performance versus Cost in a Cloud Computing Environment Yiping Ding OPNET Technologies, Inc. © 2009 OPNET Technologies, Inc.
September 4, 2014 Using National Cyberinfrastructure Tom Doak Carrie Ganote National Center for Genome Analysis Support.
Overview of Midrange Computing Resources at LBNL Gary Jung March 26, 2002.
Cloud Computing PRESENTED BY- Rajat Dixit (rd2392)
Scheduling under LCG at RAL UK HEP Sysman, Manchester 11th November 2004 Steve Traylen
SUMS Storage Requirement 250 TB fixed disk cache 130 TB annual increment for permanently on- line data 100 TB work area (not controlled by SUMS) 2 PB near-line.
MIS 175 Spring Learning Objectives When you finish this chapter, you will: –Recognize major components of an electronic computer. –Understand how.
Research Computing with Newton Gerald Ragghianti Nov. 12, 2010.
Presenter MaxAcademy Lecture Series – V1.0, September 2011 Introduction and Motivation.
Upgrade Strategy. Audit YYou can’t always start from scratch with a new system. This in not cost effective or wise. Therefore you should do an audit.
PC Construction and Maintenance Week 9 Review of PC concepts Key Points.
1.Training and education 2.Consulting 3.Travel 4.Hardware 5.Software Which of the following is not included in a firm’s IT infrastructure investments?
1 The Virtual Reality Virtualization both inside and outside of the cloud Mike Furgal Director – Managed Database Services BravePoint.
Installment Plans and Stocks Causes of the Great Depression.
Day 10 Hardware Fault Tolerance RAID. High availability All servers should be on UPSs –2 Types Smart UPS –Serial cable connects from UPS to computer.
Operating systems CHAPTER 7.
High-Performance Computing 12.1: Concurrent Processing.
Windows 2000 Advanced Server and Clustering Prepared by: Tetsu Nagayama Russ Smith Dale Pena.
CONSUMERS AND DEMAND. A. The Law of Demand 1. Demand = the amount of a good or service that consumers are willing and able to buy at different prices.
NSTXpool Computer Upgrade WP #1685 Bill Davis December 9, 2010.
Previously Fetch execute cycle Pipelining and others forms of parallelism Basic architecture This week we going to consider further some of the principles.
Planning and Designing Server Virtualisation.
VirtualBox What you need to know to build a Virtual Machine.
CASPUR Site Report Andrei Maslennikov Lead - Systems Karlsruhe, May 2005.
1 Computer and Network Bottlenecks Author: Rodger Burgess 27th October 2008 © Copyright reserved.
© CCI Learning Solutions Inc. 1 Lesson 6: Buying a Computer Hardware considerations Software considerations Price considerations Support or service considerations.
Prepared By : Bhavin Tank(S.Y.B.Sc.(IT)) College of Computer Science & IT, Junagadh Cloud Computing.
SLAC Site Report Chuck Boeheim Assistant Director, SLAC Computing Services.
JLab Scientific Computing: Theory HPC & Experimental Physics Thomas Jefferson National Accelerator Facility Newport News, VA Sandy Philpott.
9 February 2000CHEP2000 Paper 3681 CDF Data Handling: Resource Management and Tests E.Buckley-Geer, S.Lammel, F.Ratnikov, T.Watts Hardware and Resources.
Chapter 2 Turning Data into Something You Can Use © The McGraw-Hill Companies, Inc., 2000 Processing Hardware.
Chapter 4 Information Technology in Business: Hardware.
JLAB Computing Facilities Development Ian Bird Jefferson Lab 2 November 2001.
Cosc 4750 Maintenance & Analysis. Maintenance Contracts Annual cost of 10%-12% of component’s list price. On-site maintenance –usually within hours.
STAR Off-line Computing Capabilities at LBNL/NERSC Doug Olson, LBNL STAR Collaboration Meeting 2 August 1999, BNL.
Installing, running, and maintaining large Linux Clusters at CERN Thorsten Kleinwort CERN-IT/FIO CHEP
GISt lunch meeting OTB Research Institute for Housing, Urban and Mobility Studies Writing a DBMS buyers guide Wim de Haas Wilko Quak Based.
 The End to the Means › (According to IBM ) › 03.ibm.com/innovation/us/thesmartercity/in dex_flash.html?cmp=blank&cm=v&csr=chap ter_edu&cr=youtube&ct=usbrv111&cn=agus.
Infrastructure for Data Warehouses. Basics Of Data Access Data Store Machine Memory Buffer Memory Cache Data Store Buffer Bus Structure.
Randy MelenApril 14, Stanford Linear Accelerator Center Site Report April 1999 Randy Melen SLAC Computing Services/Systems HPC Team Leader.
Chapter VI What should I know about the sizes and speeds of computers?
Computer System Support   The costs of installing, operating and maintaining computer systems are called total costs o ownership or TCO  TCO includes.
PDSF and the Alvarez Clusters Presented by Shane Canon, NERSC/PDSF
Abstract Increases in CPU and memory will be wasted if not matched by similar performance in I/O SLED vs. RAID 5 levels of RAID and respective cost/performance.
SYSTEMSDESIGNANALYSIS 1 Chapter 13 The Systems Proposal Jerry Post Copyright © 1997.
CIP HPC CIP - HPC HPC = High Performance Computer It’s not a regular computer, it’s bigger, faster, more powerful, and more.
Tackling I/O Issues 1 David Race 16 March 2010.
Western Tier 2 Site at SLAC Wei Yang US ATLAS Tier 2 Workshop Harvard University August 17-18, 2006.
LBNL/NERSC/PDSF Site Report for HEPiX Catania, Italy April 17, 2002 by Cary Whitney
Thomas Baus Senior Sales Consultant Oracle/SAP Global Technology Center Mail: Phone:
The ‘stuff you can touch’. Evolution of Computer Hardware Abacus counting machines Babbage’s difference engine Hollerith’s tabulating machine ABC computer.
System Models Advanced Operating Systems Nael Abu-halaweh.
Operating Systems Shannon Gibson. What is an Operating System?  An operating system is the most important software that runs on a computer.
1 OF 17 INFORMATION TECHNOLOGY CAPITAL PLANNING FOR YOUR ENTERPRISE Steven Carpenter 14 October 2006.
Integration Lower sums Upper sums
Installation 1. Installation Sources
Green cloud computing 2 Cs 595 Lecture 15.
3.2 Virtualisation.
Computer Architecture
By Brandon, Ben, and Lee Parallel Computing.
Computer Services Business challenge
Designing a PC Farm to Simultaneously Process Separate Computations Through Different Network Topologies Patrick Dreher MIT.
Computer System.
Presentation transcript:

PDSF Computing model Thomas Davis ASG/NERSC, LBNL LCCWS

Contents " Why PDSF at a supercomputing center? " Short history of PDSF " What does PDSF do? " How do I get access to PDSF? " Hardware " Software

Why PDSF at a Supercomputing Center? " Sharing of resources – 24x7x365 glass house. – Can leverage some vendors expertise. – Access to expertise in data management, networking, and other important services. – Large cluster could be the next HPC system; so PDSF provides production experience with clusters.

Short history of PDSF " Came from SSC, crated and moved to Livermore. " No new hardware, software for close to 8 years. " Life support applied when NERSC moved to Berkeley from Livermore in 1997 " Has picked up experimental support ever since, with what appears to be a 1000% growth rate.

How do I get PDSF access? " Buy in service model. Clients pay for a fixed slices of CPU time/disk space. – Over time, as cluster grows, the amount of disk space and cpu time drops. Moore's law is in effect. " Client has no direct ownership of cluster. PDSF admins have the right to move resources around when needed. " Client can get more CPU power than what they buy – if no one else is using CPU's in cluster, then resources are available to other users. – LSF Fairshare is used to arbitrate between clients. " Equipment is retired based on Moore's law, and warranties.

PDSF Hardware model " CPU power is always moving. – We expect to retire any Intel based system after 3 years. " Disk size is climbing even faster; new drives replace old disks, increasing volume size and performance. " Memory constraints can also force retirement of systems.

Hardware, Continued. " New hardware is always bought as late as possible. " CPU speeds are based on the knee of the curve; 2x650mhz machines are better than 1x1Ghz machine. " Memory is also bought based on size; always buy largest dimm's when possible.

Software model. " Platform LSF is used for batch queuing. – Fought for better pricing.. but Platform still wants lots of money. " Redhat 6.1 is software base. – Only updated when needed; many experiments can't change. " Control of software is vigorous; because PDSF has many experiments, any software changes are controlled. " Opensource is preferred, but properiety software is acceptable.