FroNtier Stress Tests at Tier-0 Status report Luis Ramos LCG3D Workshop – September 13, 2006.

Slides:



Advertisements
Similar presentations
Copyright © SoftTree Technologies, Inc. DB Tuning Expert.
Advertisements

Concurrent programming: From theory to practice Concurrent Algorithms 2014 Vasileios Trigonakis Georgios Chatzopoulos.
IWR Ideen werden Realität Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Institut für Wissenschaftliches Rechnen Status of Database Services.
GoldenGate Monitoring and Troubleshooting
SKELETON BASED PERFORMANCE PREDICTION ON SHARED NETWORKS Sukhdeep Sodhi Microsoft Corp Jaspal Subhlok University of Houston.
Presented by: Yash Gurung, ICFAI UNIVERSITY.Sikkim BUILDING of 3 R'sCLUSTER PARALLEL COMPUTER.
Capacity Planning and Predicting Growth for Vista Amy Edwards, Ezra Freeloe and George Hernandez University System of Georgia 2007.
Managing Change with Real Application Testing and Snapshot Standby Barry Hodges Senior Solution Architect, Sales Consulting, Oracle NZ.
System Monitoring & Inventory Presented By:- Vipul Patel. Linesh Patel. Bhumika Vyas. Group No:-25.
Module 2: Planning to Install SQL Server. Overview Hardware Installation Considerations SQL Server 2000 Editions Software Installation Considerations.
BNL Oracle database services status and future plans Carlos Fernando Gamboa RACF Facility Brookhaven National Laboratory, US Distributed Database Operations.
Capacity Planning in SharePoint Capacity Planning Process of evaluating a technology … Deciding … Hardware … Variety of Ways Different Services.
Measuring zSeries System Performance Dr. Chu J. Jong School of Information Technology Illinois State University 06/11/2012 Sponsored in part by Deer &
Database monitoring and service validation Dirk Duellmann CERN IT/PSS and 3D
MCITP Administrator: Microsoft SQL Server 2005 Database Server Infrastructure Design Study Guide (70-443) Chapter 1: Designing the Hardware and Software.
TPB Models Development Status Report Presentation to the Travel Forecasting Subcommittee Ron Milone National Capital Region Transportation Planning Board.
Introduction to HP LoadRunner Getting Familiar with LoadRunner >>>>>>>>>>>>>>>>>>>>>>
LCG 3D StatusDirk Duellmann1 LCG 3D Throughput Tests Scheduled for May - extended until end of June –Use the production database clusters at tier 1 and.
ASGC 1 ASGC Site Status 3D CERN. ASGC 2 Outlines Current activity Hardware and software specifications Configuration issues and experience.
Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.
Profiling Grid Data Transfer Protocols and Servers George Kola, Tevfik Kosar and Miron Livny University of Wisconsin-Madison USA.
Slide 1 DESIGN, IMPLEMENTATION, AND PERFORMANCE ANALYSIS OF THE ISCSI PROTOCOL FOR SCSI OVER TCP/IP By Anshul Chadda (Trebia Networks)-Speaker Ashish Palekar.
Boosting Event Building Performance Using Infiniband FDR for CMS Upgrade Andrew Forrest – CERN (PH/CMD) Technology and Instrumentation in Particle Physics.
AUTHORS: STIJN POLFLIET ET. AL. BY: ALI NIKRAVESH Studying Hardware and Software Trade-Offs for a Real-Life Web 2.0 Workload.
CERN - IT Department CH-1211 Genève 23 Switzerland t Tier0 database extensions and multi-core/64 bit studies Maria Girone, CERN IT-PSS LCG.
1 Selecting LAN server (Week 3, Monday 9/8/2003) © Abdou Illia, Fall 2003.
High Performance Computing Processors Felix Noble Mirayma V. Rodriguez Agnes Velez Electric and Computer Engineer Department August 25, 2004.
Sensitivity of Cluster File System Access to I/O Server Selection A. Apon, P. Wolinski, and G. Amerson University of Arkansas.
1 Wenguang WangRichard B. Bunt Department of Computer Science University of Saskatchewan November 14, 2000 Simulating DB2 Buffer Pool Management.
1 Specification and Implementation of Dynamic Web Site Benchmarks Sameh Elnikety Department of Computer Science Rice University.
ATLAS Scalability Tests of Tier-1 Database Replicas WLCG Collaboration Workshop (Tier0/Tier1/Tier2) Victoria, British Columbia, Canada September 1-2, 2007.
May PEM status report. O.Bärring 1 PEM status report Large-Scale Cluster Computing Workshop FNAL, May Olof Bärring, CERN.
Databases E. Leonardi, P. Valente. Conditions DB Conditions=Dynamic parameters non-event time-varying Conditions database (CondDB) General definition:
Srihari Makineni & Ravi Iyer Communications Technology Lab
Testing… Testing… 1, 2, 3.x... Performance Testing of Pi on NT George Krc Mead Paper.
1 Evaluation of Cooperative Web Caching with Web Polygraph Ping Du and Jaspal Subhlok Department of Computer Science University of Houston presented at.
1 Database mini workshop: reconstressing athena RECONSTRESSing: stress testing COOL reading of athena reconstruction clients Database mini workshop, CERN.
1 Oracle Enterprise Manager Slides from Dominic Gélinas CIS
The DCS lab. Computer infrastructure Peter Chochula.
Using Heterogeneous Paths for Inter-process Communication in a Distributed System Vimi Puthen Veetil Instructor: Pekka Heikkinen M.Sc.(Tech.) Nokia Siemens.
Integration of the ATLAS Tag Database with Data Management and Analysis Components Caitriana Nicholson University of Glasgow 3 rd September 2007 CHEP,
LHCb File-Metadata: Bookkeeping Carmine Cioffi Department of Physics, Oxford University UK Metadata Workshop Oxford, 04 July 2006.
3D Testing and Monitoring Lee Lueking LCG 3D Meeting Sept. 15, 2005.
Distributed Logging Facility Castor External Operation Workshop, CERN, November 14th 2006 Dennis Waldron CERN / IT.
PROOF Benchmark on Different Hardware Configurations 1 11/29/2007 Neng Xu, University of Wisconsin-Madison Mengmeng Chen, Annabelle Leung, Bruce Mellado,
Scaling up from local DB to distributed DB Cristiano Bozza European Emulsion Group Nagoya, Jan 2004 Presented by Giuseppe Grella.
PIC port d’informació científica DateText1 November 2009 (Elena Planas) PIC Site review.
Status of tests in the LCG 3D database testbed Eva Dafonte Pérez LCG Database Deployment and Persistency Workshop.
ATLAS FroNTier cache consistency stress testing David Front Weizmann Institute 1September 2009 ATLASFroNTier chache consistency stress testing.
AMGA-Bookkeeping Carmine Cioffi Department of Physics, Oxford University UK Metadata Workshop Oxford, 05 July 2006.
Database Project Milestones (+ few status slides) Dirk Duellmann, CERN IT-PSS (
BNL dCache Status and Plan CHEP07: September 2-7, 2007 Zhenping (Jane) Liu for the BNL RACF Storage Group.
1 Evaluation of Cooperative Web Caching with Web Polygraph Ping Du and Jaspal Subhlok Department of Computer Science University of Houston presented at.
LIOProf: Exposing Lustre File System Behavior for I/O Middleware
Remigius K Mommsen Fermilab CMS Run 2 Event Building.
Database Requirements Updates from LHC Experiments WLCG Grid Deployment Board Meeting CERN, Geneva, Switzerland February 7, 2007 Alexandre Vaniachine (Argonne)
CERN IT Department CH-1211 Genève 23 Switzerland t Load testing & benchmarks on Oracle RAC Romain Basset – IT PSS DP.
A Web Based Job Submission System for a Physics Computing Cluster David Jones IOP Particle Physics 2004 Birmingham 1.
Abhinav Kamra, Vishal Misra CS Department Columbia University
Database Replication and Monitoring
Lee Lueking WLCG Workshop DB BoF 22 Jan. 2007
Diskpool and cloud storage benchmarks used in IT-DSS
BDII Performance Tests
David Front Weizmann institute May 2007
SAM at CCIN2P3 configuration issues
Conditions Data access using FroNTier Squid cache Server
Moodle Scalability What is Scalability?
Software System Performance
Performance And Scalability In Oracle9i And SQL Server 2000
Presentation transcript:

FroNtier Stress Tests at Tier-0 Status report Luis Ramos LCG3D Workshop – September 13, 2006

Luis Ramos – September 13 th, 2006 FroNtier Stress Tests - 2/18 Outline 1.Test Plan 2.Test Setup 3.Main Results 4.Conclusions

Luis Ramos – September 13 th, 2006 FroNtier Stress Tests - 3/18 Objectives Develop a benchmark for Frontier servers –DB schema independent Build a tool that identifies performance bottlenecks of a given setup Performance analysis of the software stack –CORAL / Frontier plugin –Frontier Client –Squid –Frontier Servlet

Luis Ramos – September 13 th, 2006 FroNtier Stress Tests - 4/18 Test Plan How fast are the individual components? –Database –Application Server –Cache Server –Network Explore performance impact of: –Different data (strucutre, content, size, storage type, compression) –Different caching policies –Different access methods How do DB throughput, network bandwidth, payload size, # of clients or server CPU correlate?

Luis Ramos – September 13 th, 2006 FroNtier Stress Tests - 5/18 Metrics and Parameters Metrics: –Individual and total throughput –Server errors –CPU consumption/load (clients, frontier server, squid server) –Memory usage and disk space needs –Network bandwidth usage Parameters: –# of client nodes –# of test clients –Payload sizes –Database structure and content –Caching policy

Luis Ramos – September 13 th, 2006 FroNtier Stress Tests - 6/18 FroNtier Test Setup

Luis Ramos – September 13 th, 2006 FroNtier Stress Tests - 7/18 Server Test Setup Hardware setup: –1 server running Frontier & Squid: Dual Intel Xeon CPU 2.80GHz 2Gb RAM HD 150Gb Fast Ethernet (100Mbps) –1 Backend Oracle Database 10gR2 (cooldev) Software Setup: –FroNtier v3.3 –Frontier Squid v1.0rc4

Luis Ramos – September 13 th, 2006 FroNtier Stress Tests - 8/18 Client Test Setup Hardware: –Dedicated lxplus nodes Dual Pentium III 1GHz 500Mb RAM HD 6Gb Fast Ethernet (100Mbps) Software: –CORAL_1_5_3 –FrontierClient v2.5.1

Luis Ramos – September 13 th, 2006 FroNtier Stress Tests - 9/18 The Test Client C++ CORAL/FrontierPlugin test –Queries the server constantly –Gathers results –Outputs measures Until shutdown message received Python controller script –Starts a number of clients Manages test client ramp up –Gathers measures –Generates structured data for plotting

Luis Ramos – September 13 th, 2006 FroNtier Stress Tests - 10/18 Test Cases Network Analysis Throughput analysis –CORAL Oracle Plugin –CORAL Frontier Plugin Directly to FroNtier Through SQUID

Luis Ramos – September 13 th, 2006 FroNtier Stress Tests - 11/18 Network Analysis Test tool that checks network performance between multiple given client nodes and a single server –Generates TCP/IP traffic between clients and server –Each client shows an individual throughput Done using the netcat utility

Luis Ramos – September 13 th, 2006 FroNtier Stress Tests - 12/18 Throughput analysis Frontier Server Up to 150 clients running against a single server (direct FroNtier server access, no Squid involved) Old version of FroNtier -> no compression! New Frontier version with compression

Luis Ramos – September 13 th, 2006 FroNtier Stress Tests - 13/18 Throughput analysis Oracle, FroNtier and Squid Oracle vs Frontier Server vs Squid Cache Hits

Luis Ramos – September 13 th, 2006 FroNtier Stress Tests - 14/18 Throughput analysis Notes on previous plots Direct Frontier access –NOZIP version: 3MBps (bottleneck is the database) –ZIP version: 0,3MBps (bottleneck is the server CPU) –ZIP version can get 10 times slower than NOZIP version Production setup with 3 FroNtier nodes will perform better! Squid access –NOZIP version: 8MBps –ZIP version: 14MBps user preceived throughput can be bigger than the network throughput (due to compression) –ZIP version can get 2 times faster than NOZIP version Oracle access - 1,34MBps –First guess, should be faster then FroNtier direct access in any case! –Second thought, each client is repeatedelly creating DB connections which is quite heavy for OraclePlugin and not so much for Frontier because frontier servlet reuses connections

Luis Ramos – September 13 th, 2006 FroNtier Stress Tests - 15/18 Throughput analysis Some predictions CMS predicted real world access patern –10% direct FroNtier access –90% Squid access Factors (from previous slide) –SQUID_ZIP_time = SQUID_NOZIP_time / 2 –FRT_ZIP_time = FRT_NOZIP_time * 10 Some calculations: –Real NOZIP query time = 181% ZIP query time Prediciton test: –Tests run from FNAL nodes will produce new data in different network conditions

Luis Ramos – September 13 th, 2006 FroNtier Stress Tests - 16/18 Future work Run multi client throughput tests: –Using experiment DB content –Using COOL generated queries –Changing the ratio of cached queries –From FNAL to measure the impact of a poorer network connection –Analyzing the Frontier server error rate with the new Frontier v3.3

Luis Ramos – September 13 th, 2006 FroNtier Stress Tests - 17/18 Conclusions FroNtier is ready for production Some performance indicators were obtained –More real performance indicators should be now obtained from the production setup Test scripts developed –Next step: turn scripts easily reusable by others (Richard Hansen is running the test suite for ATLAS) Tests will continue!

Luis Ramos – September 13 th, 2006 FroNtier Stress Tests - 18/18 Questions? Ideas?