Evaluating current processors performance and machines stability R. Esposito 2, P. Mastroserio 2, F. Taurino 2,1, G. Tortone 2 1 INFM, Sez. di Napoli,

Slides:



Advertisements
Similar presentations
Topics to be discussed Introduction Performance Factors Methodology Test Process Tools Conclusion Abu Bakr Siddiq.
Advertisements

11 Measuring performance Kosarev Nikolay MIPT Feb, 2010.
Performance What differences do we see in performance? Almost all computers operate correctly (within reason) Most computers implement useful operations.
CS2100 Computer Organisation Performance (AY2014/2015) Semester 2.
The Central Processing Unit: What Goes on Inside the Computer.
Computer Performance CS350 Term Project-Spring 2001 Elizabeth Cramer Bryan Driskell Yassaman Shayesteh.
Communication Pattern Based Node Selection for Shared Networks
Parallelized variational EM for Latent Dirichlet Allocation: An experimental evaluation of speed and scalability Ramesh Nallapati, William Cohen and John.
Computational Astrophysics: Methodology 1.Identify astrophysical problem 2.Write down corresponding equations 3.Identify numerical algorithm 4.Find a computer.
I/O Hardware n Incredible variety of I/O devices n Common concepts: – Port – connection point to the computer – Bus (daisy chain or shared direct access)
Analysis and Performance Results of a Molecular Modeling Application on Merrimac Erez, et al. Stanford University 2004 Presented By: Daniel Killebrew.
PARALLEL PROCESSING The NAS Parallel Benchmarks Daniel Gross Chen Haiout.
Capacity Planning in SharePoint Capacity Planning Process of evaluating a technology … Deciding … Hardware … Variety of Ways Different Services.
How a Computer Processes Data Hardware. Major Components Involved: Central Processing Unit Types of Memory Motherboards Auxiliary Storage Devices.
QTIP Version 0.2 4th August 2015.
CPU Performance Assessment As-Bahiya Abu-Samra *Moore’s Law *Clock Speed *Instruction Execution Rate - MIPS - MFLOPS *SPEC Speed Metric *Amdahl’s.
Using Standard Industry Benchmarks Chapter 7 CSE807.
CMSC 611: Advanced Computer Architecture Benchmarking Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material adapted.
Digital Graphics and Computers. Hardware and Software Working with graphic images requires suitable hardware and software to produce the best results.
Evaluation of Windows 7 RC Build 7100 By Muswera Walter Supervisor: Mr John Ebden Consultants: Billy Morgan and Jill Japp.
Microprocessors SUBTITLE Team 3: David Meadows David Foster Sichao Ni Khareem Gordon.
Qtip Revised project scope July QTIP overview QTIP aims to develop a framework for bottoms up testing of NFVI platforms QTIP aims to test: Computing.
Test Configuration for Control For the test configuration we used a VME based control system, constituted by a VME crate with a VMPC4a from Cetia (CPU.
INTRODUCTION TO MICROPROCESSORS
ECE 526 – Network Processing Systems Design Network Processor Architecture and Scalability Chapter 13,14: D. E. Comer.
Simultaneous Multithreading: Maximizing On-Chip Parallelism Presented By: Daron Shrode Shey Liggett.
Computer Performance Computer Engineering Department.
A TCP/IP transport layer for the DAQ of the CMS Experiment Miklos Kozlovszky for the CMS TriDAS collaboration CERN European Organization for Nuclear Research.
Lecture 2b: Performance Metrics. Performance Metrics Measurable characteristics of a computer system: Count of an event Duration of a time interval Size.
Copyright 1995 by Coherence LTD., all rights reserved (Revised: Oct 97 by Rafi Lohev, Oct 99 by Yair Wiseman, Sep 04 Oren Kapah) IBM י ב מ 7-1 Measuring.
National Center for Supercomputing Applications University of Illinois at Urbana-Champaign Cell processor implementation of a MILC lattice QCD application.
Recap Technology trends Cost/performance Measuring and Reporting Performance What does it mean to say “computer X is faster than computer Y”? E.g. Machine.
Computers organization & Assembly Language Chapter 0 INTRODUCTION TO COMPUTING Basic Concepts.
1 CHAPTER 2 THE ROLE OF PERFORMANCE. 2 Performance Measure, Report, and Summarize Make intelligent choices Why is some hardware better than others for.
10/19/2015Erkay Savas1 Performance Computer Architecture – CS401 Erkay Savas Sabanci University.
Frank Casilio Computer Engineering May 15, 1997 Multithreaded Processors.
Amy Apon, Pawel Wolinski, Dennis Reed Greg Amerson, Prathima Gorjala University of Arkansas Commercial Applications of High Performance Computing Massive.
A Measurement Based Memory Performance Evaluation of High Throughput Servers Garba Isa Yau Department of Computer Engineering King Fahd University of Petroleum.
Advanced Computer Architecture Fundamental of Computer Design Instruction Set Principles and Examples Pipelining:Basic and Intermediate Concepts Memory.
From lecture slides for Computer Organization and Architecture: Designing for Performance, Eighth Edition, Prentice Hall, 2010 CS 211: Computer Architecture.
Performance measurement with ZeroMQ and FairMQ
Resource Predictors in HEP Applications John Huth, Harvard Sebastian Grinstein, Harvard Peter Hurst, Harvard Jennifer M. Schopf, ANL/NeSC.
1 CS/COE0447 Computer Organization & Assembly Language CHAPTER 4 Assessing and Understanding Performance.
Computer Architecture
Performance Analysis of the Compaq ES40--An Overview Paper evaluates Compaq’s ES40 system, based on the Alpha Only concern is performance: no power.
Nanco: a large HPC cluster for RBNI (Russell Berrie Nanotechnology Institute) Anne Weill – Zrahia Technion,Computer Center October 2008.
Replicating Memory Behavior for Performance Skeletons Aditya Toomula PC-Doctor Inc. Reno, NV Jaspal Subhlok University of Houston Houston, TX By.
6/29/1999PDPTA'991 Performance Prediction for Large Scale Parallel Systems Yuhong Wen and Geoffrey C. Fox Northeast Parallel Architecture Center (NPAC)
Performance Performance
September 10 Performance Read 3.1 through 3.4 for Wednesday Only 3 classes before 1 st Exam!
Performance Analysis of HPC with Lmbench Didem Unat Supervisor: Nahil Sobh July 22 nd 2005 netfiles.uiuc.edu/dunat2/www.
EGRE 426 Computer Organization and Design Chapter 4.
Exploiting Task-level Concurrency in a Programmable Network Interface June 11, 2003 Hyong-youb Kim, Vijay S. Pai, and Scott Rixner Rice Computer Architecture.
FroNtier Stress Tests at Tier-0 Status report Luis Ramos LCG3D Workshop – September 13, 2006.
Programming Multi-Core Processors based Embedded Systems A Hands-On Experience on Cavium Octeon based Platforms Lab Exercises: Lab 1 (Performance measurement)
VU-Advanced Computer Architecture Lecture 1-Introduction 1 Advanced Computer Architecture CS 704 Advanced Computer Architecture Lecture 1.
Operating Systems A Biswas, Dept. of Information Technology.
Information Technology (IT). Information Technology – technology used to create, store, exchange, and use information in its various forms (business data,
Computer Organization CS345 David Monismith Based upon notes by Dr. Bill Siever and from the Patterson and Hennessy Text.
Measuring Performance II and Logic Design
CS203 – Advanced Computer Architecture
Fast and Robust Hashing for Database Operators
Execution time Execution Time (processor-related) = IC x CPI x T
Chapter 1: Introduction
Multi-Processing in High Performance Computer Architecture:
CMSC 611: Advanced Computer Architecture
Chapter 13: I/O Systems I/O Hardware Application I/O Interface
CMSC 611: Advanced Computer Architecture
Execution time Execution Time (processor-related) = IC x CPI x T
Computer Organization and Design Chapter 4
Presentation transcript:

Evaluating current processors performance and machines stability R. Esposito 2, P. Mastroserio 2, F. Taurino 2,1, G. Tortone 2 1 INFM, Sez. di Napoli, Italy 1 INFN, Sez. di Napoli, Italy Benchmarks and Stress Tests Accurately estimate performance of currently available processors is becoming a key activity, particularly in HENP environment, where high computing power is crucial. This document describes the methods and programs, opensource or freeware, used to benchmark processors, memory and disk subsystems and network connection architectures. These tools are also useful to stress test new machines, before their acquisition or before their introduction in a production environment, where high uptimes are requested. The “benchmarking suite” shown in this poster consists in some free applications used to evaluate and stress test each machine subsystem: CPU, memory hierarchy, disks, network. CPU and Memory CPU and MemoryGLIBENCH This tool executes Dhrystones (MIPS), Whetstones (MFLOPS), Matrix operations, Number crunching, Floating point and Memory throughput tests.NBENCH Based on beta release 2 of BYTE Magazine's BYTEmark benchmark program (previously known as BYTE's Native Mode Benchmarks), and runs 10 tests to compare the running machine with an AMD It returns a three indexes: Memory, Integer and Floating-point.BYTEBENCH Used to test a *nix machine in different ways. It runs arithmetic tests, system tests like process spawning or context switching.LMBENCH It’s a series of micro benchmarks intended to measure basic operating system and hardware system metrics. The benchmarks fall into three general classes: bandwidth, latency, and ``other''.UBENCH Ubench is executing rather senseless mathematical integer and floating-point calculations for 3 mins concurrently using several processes, and the result is Ubench CPU benchmark. It is executing rather senseless memory allocation and memory to memory copying operations for another 3 mins concurrently using several processes, and the result is Ubench MEM benchmark. CHEP03 - March 24-28, La Jolla, California MEMPERF It measures the memory bandwidth in a 2 dimensional way. First it varies the block size which provides information of the throughput in different memory system hierarchys (different cache levels). Secondly it varies the access pattern from contiguous blocks to different strided accesses.STREAM The STREAM benchmark is a simple synthetic benchmark program that measures sustainable memory bandwidth (in MB/s) and the corresponding computation rate for simple vector kernels.LLCBENCH It groups three benchmarks: BlasBench, to test BLAS routines; CacheBench, to test cache memory; MPBench, to test MPI implementations Disks and Network BONNIE++ A modified version of Bonnie, which creates, reads, writes and deletes very big files.IOZONE This benchmark generates and measures a variety of file operations: Read, write, re-read, re-write, read backwards, read strided, fread, fwrite, random read/write, pread/pwrite variants, aio_read, aio_write, mmapNETPERF It provides tests for both unidirecitonal throughput, and end-to-end latency with TCP, UDP, sockets. Conclusions This set of benchmarks allows us to accurately characterize raw performances of available machines. Though many commercial or free benchmark tools are currently available, we have chosen the ones shown in this poster because, in our experience, they seem to give a satisfying performance analysis of every single hardware component. Furthermore this suite of benchmarks has proven to be a valid tool to stress test machines before starting production activities. NETPIPE This tool can benchmark network communications with non standard hardware, like high speed interconnections used in cluster environments.PALLAS It’s a complex benchmarks used to evaluate MPI performance. It provides a concise set of benchmarks targeted at measuring the most important MPI functions. Example of results: athlon vs Xeon 2.2Ghz Example of glibench results: athlon vs Xeon 2.2Ghz Example of and results on Intel epro100 fast ethernet NIC Example of NETPIPE and PALLAS results on Intel epro100 fast ethernet NIC Example of results obtained by,and: NBENCH, STREAM and MEMPERF: APFLOAT It is a high performance arbitrary precision package that can be used to perform calculations involving millions of digits, such as .POVRAY This well known program creates 3dimensional graphics, using standard, Athlon optimized or Pentium optimized binaries. Example of results obtained by and POVRAY and APFLOAT on a dual Athlon and dual Xeon 2.2Ghz