Gruppo Server CCR michele.michelotto at pd.infn.it

Slides:

Advertisements

Similar presentations

Concurrent programming: From theory to practice Concurrent Algorithms 2014 Vasileios Trigonakis Georgios Chatzopoulos.

Advertisements

Hepmark project Evaluation of HEP worker nodes Michele Michelotto at pd.infn.it.

OPTERON (Advanced Micro Devices). History of the Opteron AMD's server & workstation processor line 2003: Original Opteron released o 32 & 64 bit processing.

INTEL COREI3 INTEL COREI5 INTEL COREI7 Maryam Zeb Roll#52 GFCW Peshawar.

A comparison of HEP code with SPEC benchmark on multicore worker nodes HEPiX Benchmarking Group Michele Michelotto at pd.infn.it.

HS06 on the last generation of CPU for HEP server farm Michele Michelotto 1.

Moving out of SI2K How INFN is moving out of SI2K as a benchmark for Worker Nodes performance evaluation Michele Michelotto at pd.infn.it.

111 *Other names and brands may be claimed as the property of others Q Sell Up Guide Intel ® Core™ i7 (Bloomfield) vs. Lynnfield Positioning Intel.

COMPUTER ARCHITECTURE

Transition to a new CPU benchmark on behalf of the “GDB benchmarking WG”: HEPIX: Manfred Alef, Helge Meinhard, Michelle Michelotto Experiments: Peter Hristov,

Microprocessors SUBTITLE Team 3: David Meadows David Foster Sichao Ni Khareem Gordon.

Basic Computer Structure and Knowledge Project Work.

Different CPUs CLICK THE SPINNING COMPUTER TO MOVE ON.

Computer Performance Computer Engineering Department.

Copyright © 2007 Heathkit Company, Inc. All Rights Reserved PC Fundamentals Presentation 27 – A Brief History of the Microprocessor.

High Performance Computing Processors Felix Noble Mirayma V. Rodriguez Agnes Velez Electric and Computer Engineer Department August 25, 2004.

History of Microprocessor MPIntroductionData BusAddress Bus

4 Dec 2006 Testing the machine (X7DBE-X) with 6 D-RORCs 1 Evaluation of the LDC Computing Platform for Point 2 SuperMicro X7DBE-X Andrey Shevel CERN PH-AID.

HS06 on new CPU, KVM virtual machines and commercial cloud Michele Michelotto 1.

Fast Benchmark Michele Michelotto – INFN Padova Manfred Alef – GridKa Karlsruhe 1.

Computer Architecture By Chris Van Horn. CPU Basics “Brains of the Computer” Fetch Execute Cycle Instruction Branching.

Hyper Threading Technology. Introduction Hyper-threading is a technology developed by Intel Corporation for it’s Xeon processors with a 533 MHz system.

Alpha Supplement CS 740 Oct. 14, 1998

HS06 on last generation of HEP worker nodes Berkeley, Hepix Fall ‘09 INFN - Padova michele.michelotto at pd.infn.it.

Processors with Hyper-Threading and AliRoot performance Jiří Chudoba FZÚ, Prague.

Chap 4: Processors Mainly manufactured by Intel and AMD Important features of Processors: Processor Speed (900MHz, 3.2 GHz) Multiprocessing Capabilities.

HS06 performance per watt and transition to SL6 Michele Michelotto – INFN Padova 1.

HEPMARK2 Consiglio di Sezione 9 Luglio 2012 Michele Michelotto - Padova.

PROCESSOR Ambika | shravani | namrata | saurabh | soumen.

I7’s Core. Intel’s Core i7 Content Overview Socket SSE 4.2 Instruction Set Cores –Intel Quickpath Interconnect –Nehalem - new micro-architecture –EP,

From Westmere to Magny-cours: Hep-Spec06 Cornell U. - Hepix Fall‘10 INFN - Padova michele.michelotto at pd.infn.it.

Lab Activities 1, 2. Some of the Lab Server Specifications CPU: 2 Quad(4) Core Intel Xeon 5400 processors CPU Speed: 2.5 GHz Cache : Each 2 cores share.

Programming Multi-Core Processors based Embedded Systems A Hands-On Experience on Cavium Octeon based Platforms Lab Exercises: Lab 1 (Performance measurement)

Parallel Computers Today Oak Ridge / Cray Jaguar > 1.75 PFLOPS Two Nvidia 8800 GPUs > 1 TFLOPS Intel 80- core chip > 1 TFLOPS  TFLOPS = floating.

New CPU, new arch, KVM and commercial cloud Michele Michelotto 1.

The last generation of CPU processor for server farm. New challenges Michele Michelotto 1.

© 2008 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice ProLiant G5 to G6 Processor Positioning.

PASTA 2010 CPU, Disk in 2010 and beyond m. michelotto.

Hardware Architecture

HPC/HTC vs. Cloud Benchmarking An empirical evaluation of the performance and cost implications Kashif Iqbal - PhD ICHEC, NUI Galway,

Moving out of SI2K How INFN is moving out of SI2K as a benchmark for Worker Nodes performance evaluation Michele Michelotto at pd.infn.it.

CERN IT Department CH-1211 Genève 23 Switzerland t IHEPCCC/HEPiX benchmarking WG Helge Meinhard / CERN-IT Grid Deployment Board 09 January.

Sobolev(+Node 6, 7) Showcase +K20m GPU Accelerator.

G058 - Lecture XX Example Upgrade Mr C Johnston ICT Teacher

1 ECE 734 Final Project Presentation Fall 2000 By Manoj Geo Varghese MMX Technology: An Optimization Outlook.

SI2K and beyond Michele Michelotto – INFN Padova CCR – Frascati 2007, May 30th.

Parallel Computers Today LANL / IBM Roadrunner > 1 PFLOPS Two Nvidia 8800 GPUs > 1 TFLOPS Intel 80- core chip > 1 TFLOPS  TFLOPS = floating point.

Benchmarking of CPU models for HEP application

Measuring Performance II and Logic Design

Intel and AMD processors

CPU Central Processing Unit

Brief introduction about “Grid at LNS”

Microprocessor Microarchitecture Introduction

Evaluation of HEP worker nodes Michele Michelotto at pd.infn.it

CCR Autunno 2008 Gruppo Server

Worker Node per HEP – Technology update

How to benchmark an HEP worker node

Low Power processors in HEP

Gruppo Server CCR michele.michelotto at pd.infn.it

How INFN is moving out of SI2K has a benchmark for Worker Nodes

Geant4 MT Performance Soon Yung Jun (Fermilab)

Hot Processors Of Today

Transition to a new CPU benchmark

The Parallel Revolution Has Started: Are You Part of the Solution or Part of the Problem? Dave Patterson Parallel Computing Laboratory (Par Lab) & Reliable.

CERN Benchmarking Cluster

Introduction to Microprocessors

INFN - Padova michele.michelotto at pd.infn.it

Unit 2 Computer Systems HND in Computing and Systems Development

Parallel Computers Today

Types of Computers Mainframe/Server

Presentation transcript:

Gruppo Server CCR michele.michelotto at pd.infn.it Status Report

michele michelotto - INFN Padova All_cpp SPECint2006 (12 applications) Well established, published values available HEP applications are mostly integer calculations Correlations with experiment applications shown to be fine SPECfp2006 (17 applications) SPECall_cpp2006 (7 applications) Exactly as easy to run as is SPECint2006 or SPECfp2006 No published values (not necessarily a drawback) Takes about 6 h (SPECint2006 or SPECfp2006 are about 24 h) Best modelling of FP contribution to HEP applications Important memory footprint Proposal to WLCG to adopt SPECall_cpp 2006, in parallel and call it HEP SPEC06 CCR 03/09 michele michelotto - INFN Padova

michele michelotto - INFN Padova SPEC CPP 471.omnetpp 473.astar 483.xalancbmk 444.amd 447.dealII 450.soplex 453.povray Integer tests Floating Point tests CCR 03/09 michele michelotto - INFN Padova

michele michelotto - INFN Padova ccp_all on FP CCR 03/09 michele michelotto - INFN Padova

Relative performances CCR 03/09 michele michelotto - INFN Padova

michele michelotto - INFN Padova Hep-Spec06 Machine SPEC2000 SPEC2006 int 32 SPEC2006 fp 32 SPEC2006 CPP 32 lxbench01 1501 11.06 9.5 10.24 lxbench02 1495 10.09 7.7 9.63 lxbench03 4133 28.76 25.23 28.03 lxbench04 5675 36.77 27.85 35.28 lxbench05 6181 39.39 29.72 38.21 lxbench06 4569 31.44 27.82 31.67 lxbench07 9462 60.89 43.47 57.52 lxbench08 10556 64.78 46.48 60.76 CCR 03/09 michele michelotto - INFN Padova

michele michelotto - INFN Padova Conversion factor Choose an approssimate conversion factor (~5%) Giving more weight to modern processor We choose “4” to stress that we don’t care precision but easiness of portability To validate we measured the whole GridKa and found it ok CCR 03/09 michele michelotto - INFN Padova

michele michelotto - INFN Padova More cpu CCR 03/09 michele michelotto - INFN Padova

michele michelotto - INFN Padova CPU Outlook Intel Xeon Harpertown 54xx AMD Opteron Shanghai Intel Xeon Nehalem 55xx CCR 03/09 michele michelotto - INFN Padova

michele michelotto - INFN Padova Harpertwon Cache L1 32+32KB Cache L2 3MB/core Xeon 5420: 2.0 Ghz  Xeon 5460: 3.3 GHZ FSB 1333 MHz Xeon 5472: 3.0 Ghz  Xeon 5492: 3.4 GHZ FSB: 1666 MHz CCR 03/09 michele michelotto - INFN Padova

michele michelotto - INFN Padova Shanghai Opterpn 2376 2.3 GHz  2384SE 2.7 GHz 45 nm process: 75W SE 105W 2.8 Ghz HE 2344 1.7GHz  HE2376 2.3 GHz Cache L1 64+64 KB Cache L2 0.5MB/core Cache L3 6MB shared (cfr. 2MB Barcelona) CCR 03/09 michele michelotto - INFN Padova

michele michelotto - INFN Padova Nehalem “gainestown” 45 nm Cache L1 32+32 KB Cache L2 256KB/core Cache L3 3MB shared 80W: E5502 1.86 GHz  E5540 2.53 GHz 95W: X5550 2.66 GHz  X5570 2.93 GHz 29 Marzo 2009? Dual Thread, Turbo Mode CCR 03/09 michele michelotto - INFN Padova

michele michelotto - INFN Padova Nehalem: 3 DDR3 channels per socket Opteron 2 DDR2 channel Harpertown Through Front Side Bus CCR 03/09 michele michelotto - INFN Padova

michele michelotto - INFN Padova Nehalem memory The standard Worker node 2 socket, 4 core per socket, 2GB per core  We specify 16 GB total memory per WN Nehalem 6 or 12/18 memory slot (at least 6 slots filled?) 2, 4 or 8 GB per core Total memory: 12, 24 or 48 GB Do we need to double memory to run two thread per core? CCR 03/09 michele michelotto - INFN Padova

michele michelotto - INFN Padova CCR 03/09 michele michelotto - INFN Padova

michele michelotto - INFN Padova Core i7 Single processor, 4 core, 8 logical cpu with Thread enabled 8 MB L3 cache 920 2.66 GHz  975 3.33 GHz Back to single socket WN? CCR 03/09 michele michelotto - INFN Padova

michele michelotto - INFN Padova 1 Corei7 vs 2x2384 CCR 03/09 michele michelotto - INFN Padova

michele michelotto - INFN Padova CCR 03/09 michele michelotto - INFN Padova