Presentation on developments for the period May - Sep 2006 on Fabric Management C. S. R.C. Murthy, Rohitashva Sharma, Salim A. Pathan & Dinesh Sarode.

Slides:



Advertisements
Similar presentations
26/05/2004HEPIX, Edinburgh, May Lemon Web Monitoring Miroslav Šiket CERN IT/FIO
Advertisements

CCTracker Presented by Dinesh Sarode Leaf : Bill Tomlin IT/FIO URL
Module 13: Performance Tuning. Overview Performance tuning methodologies Instance level Database level Application level Overview of tools and techniques.
Performance Testing - Kanwalpreet Singh.
Database System Concepts and Architecture
Setting up of condor scheduler on computing cluster Raman Sehgal NPD-BARC.
15 Chapter 15 Web Database Development Database Systems: Design, Implementation, and Management, Fifth Edition, Rob and Coronel.
Software Frameworks for Acquisition and Control European PhD – 2009 Horácio Fernandes.
1.1 Installing Windows Server 2008 Windows Server 2008 Editions Windows Server 2008 Installation Requirements X64 Installation Considerations Preparing.
SSH : The Secure Shell By Rachana Maheswari CS265 Spring 2003.
Interpret Application Specifications
The World Wide Web and the Internet Dr Jim Briggs 1WUCM1.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 8: Implementing and Managing Printers.
TCP/IP Protocol Suite 1 Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Chapter 19 Domain Name System (DNS)
Apache Jakarta Tomcat Suh, Junho. Road Map Tomcat Overview Tomcat Overview History History What is Tomcat? What is Tomcat? Servlet Container.
Platform as a Service (PaaS)
NovaBACKUP 10 xSP Technical Training By: Nathan Fouarge
CVSQL 2 The Design. System Overview System Components CVSQL Server –Three network interfaces –Modular data source provider framework –Decoupled SQL parsing.
CERN IT Department CH-1211 Genève 23 Switzerland t Integrating Lemon Monitoring and Alarming System with the new CERN Agile Infrastructure.
Course 6421A Module 7: Installing, Configuring, and Troubleshooting the Network Policy Server Role Service Presentation: 60 minutes Lab: 60 minutes Module.
.NET, and Service Gateways Group members: Andre Tran, Priyanka Gangishetty, Irena Mao, Wileen Chiu.
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
M. Taimoor Khan * Java Server Pages (JSP) is a server-side programming technology that enables the creation of dynamic,
Beyond DHTML So far we have seen and used: CGI programs (using Perl ) and SSI on server side Java Script, VB Script, CSS and DOM on client side. For some.
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
1 Web Server Administration Chapter 1 The Basics of Server and Web Server Administration.
LSC Segment Database Duncan Brown Caltech LIGO-G Z.
7/2/2003Supervision & Monitoring section1 Supervision & Monitoring Organization and work plan Olof Bärring.
Module 7: Fundamentals of Administering Windows Server 2008.
Winrunner Usage - Best Practices S.A.Christopher.
20411B 8: Installing, Configuring, and Troubleshooting the Network Policy Server Role Presentation: 60 minutes Lab: 60 minutes After completing this module,
Needs for Accessing Database To make your web site more dynamic and maintainable, you can display information on your web pages that are retrieved from.
Statistics Monitor of SPMSII Warrior Team Pu Su Heng Tan Kening Zhang.
Ramiro Voicu December Design Considerations  Act as a true dynamic service and provide the necessary functionally to be used by any other services.
Scalable Web Server on Heterogeneous Cluster CHEN Ge.
Putting it all together Dynamic Data Base Access Norman White Stern School of Business.
CVSQL 2 The Revenge of the SQL. The present Read-only access to CVS repository logs Language is a subset of SQL XML interface for returning results Built-in.
Oracle 10g Database Administrator: Implementation and Administration Chapter 2 Tools and Architecture.
_______________________________________________________________________________________________________________ E-Commerce: Fundamentals and Applications1.
CERN IT Department CH-1211 Geneva 23 Switzerland t Daniel Gomez Ruben Gaspar Ignacio Coterillo * Dawid Wojcik *CERN/CSIC funded by Spanish.
MCTS Guide to Microsoft Windows Server 2008 Applications Infrastructure Configuration (Exam # ) Chapter Five Windows Server 2008 Remote Desktop Services,
1 The new Fabric Management Tools in Production at CERN Thorsten Kleinwort for CERN IT/FIO HEPiX Autumn 2003 Triumf Vancouver Monday, October 20, 2003.
Lesson Overview 3.1 Components of the DBMS 3.1 Components of the DBMS 3.2 Components of The Database Application 3.2 Components of The Database Application.
Installing, running, and maintaining large Linux Clusters at CERN Thorsten Kleinwort CERN-IT/FIO CHEP
INTRODUCTION TO DBS Database: a collection of data describing the activities of one or more related organizations DBMS: software designed to assist in.
TCP/IP Protocol Suite 1 Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Chapter 19 Domain Name System (DNS)
CASTOR evolution Presentation to HEPiX 2003, Vancouver 20/10/2003 Jean-Damien Durand, CERN-IT.
EGEE User Forum Data Management session Development of gLite Web Service Based Security Components for the ATLAS Metadata Interface Thomas Doherty GridPP.
ClearQuest XML Server with ClearCase Integration Northwest Rational User’s Group February 22, 2007 Frank Scholz Casey Stewart
ASP-2-1 SERVER AND CLIENT SIDE SCRITPING Colorado Technical University IT420 Tim Peterson.
Install, configure and test ICT Networks
 Distributed Database Concepts  Parallel Vs Distributed Technology  Advantages  Additional Functions  Distribution Database Design  Data Fragmentation.
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF CF Monitoring: Lemon, LAS, SLS I.Fedorko(IT/CF) IT-Monitoring.
Lemon security. Previous security enhancements user lemon: lemon-db-admin-OraMon will create user lemon (Miro). - OraMon switches to user lemon at its.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Presentation on developments for the period Oct Feb 2007 C.S.R.C.Murthy, Salim A. Pathan, Rohitashva Sharma & Dinesh Sarode.
4.1 © 2004 Pearson Education, Inc. Exam Managing and Maintaining a Microsoft® Windows® Server 2003 Environment Lesson 12: Implementing Security.
1 Information Retrieval and Use De-normalisation and Distributed database systems Geoff Leese September 2008, revised October 2009.
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF CC Monitoring I.Fedorko on behalf of CF/ASI 18/02/2011 Overview.
 Project Team: Suzana Vaserman David Fleish Moran Zafir Tzvika Stein  Academic adviser: Dr. Mayer Goldberg  Technical adviser: Mr. Guy Wiener.
Retele de senzori Curs 1 - 1st edition UNIVERSITATEA „ TRANSILVANIA ” DIN BRAŞOV FACULTATEA DE INGINERIE ELECTRICĂ ŞI ŞTIINŢA CALCULATOARELOR.
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
Platform as a Service (PaaS)
WP4 meeting Heidelberg - Sept 26, 2003 Jan van Eldik - CERN IT/FIO
Platform as a Service (PaaS)
Consulting Services JobScheduler Architecture Decision Template
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
Analysis models and design models
PerformanceBridge Application Suite and Practice 2.0 IT Specifications
Presentation transcript:

Presentation on developments for the period May - Sep 2006 on Fabric Management C. S. R.C. Murthy, Rohitashva Sharma, Salim A. Pathan & Dinesh Sarode

Lemon Oramon monitoring currently receives ~250 samples/sec from nearly 3000 machines Lemon Oramon monitoring currently receives ~250 samples/sec from nearly 3000 machines Future estimate: machines (including on-behalf entities) and ~1000 samples/sec. Future estimate: machines (including on-behalf entities) and ~1000 samples/sec. Security services using public/private keys to be integrated Security services using public/private keys to be integrated CPU & memory Stability determination of Oramon and Oracle require stress tests CPU & memory Stability determination of Oramon and Oracle require stress tests Targets: 2000 sample/sec, hosts ~10000 and 1024 bit digital signatures Targets: 2000 sample/sec, hosts ~10000 and 1024 bit digital signatures A few phases of the tests completed during Mar – Apr 2006 and work continued A few phases of the tests completed during Mar – Apr 2006 and work continued Oramon performance Enhancements

Full scale tests launched from ~2000 lxbatch hosts with virtual hosts, RSA1024/sha1 authentication and aggregate rate of 2000 samples/sec. Full scale tests launched from ~2000 lxbatch hosts with virtual hosts, RSA1024/sha1 authentication and aggregate rate of 2000 samples/sec. Large number of virtual hosts & higher speeds made oracle response very slow. Large number of virtual hosts & higher speeds made oracle response very slow. High demand of CPU & memory for OraMon as well as Oracle High demand of CPU & memory for OraMon as well as Oracle Ever-increasing memory consumption by oramon and crashing of the oramon due to malloc failure at these speeds Ever-increasing memory consumption by oramon and crashing of the oramon due to malloc failure at these speeds Conclusion: The whole system is not stable at higher rates and large number of virtual hosts Conclusion: The whole system is not stable at higher rates and large number of virtual hosts Contd…

Investigation findings: Investigation findings: Bulk update failures cause reading of entire latest table from database and comparing each record in queue with all the retrieved records Bulk update failures cause reading of entire latest table from database and comparing each record in queue with all the retrieved records Bug in the code resulting in non-convergence of latest tables data Bug in the code resulting in non-convergence of latest tables data Large interval partitioning at target speeds puts heavy load on primary key comparison during history table inserts Large interval partitioning at target speeds puts heavy load on primary key comparison during history table inserts Solutions implemented: Solutions implemented: Efficient handling of latest table updates using sql “MERGE” facility. Efficient handling of latest table updates using sql “MERGE” facility. 3 hour interval partitioning scheme at target speed 2000 samples/sec 3 hour interval partitioning scheme at target speed 2000 samples/sec Oramon performance Enhancements Contd…

Results: Results: Achieved stable operation of Oramon & Oracle at ~2000 samples/sec, ~10000 virtual hosts and public key based authentication. Achieved stable operation of Oramon & Oracle at ~2000 samples/sec, ~10000 virtual hosts and public key based authentication. Recommendations are made w.r.t current and future operating setup of Oramon and monitoring. Recommendations are made w.r.t current and future operating setup of Oramon and monitoring. TCP 2700 samples/sec, rsa-sha1, 9500 virtual hosts, 6hours CPU utilization - Oramon server VM - Oramon server

TCP 1600 samples/sec, rsa-sha1, 9500 virtual hosts, 48 hours CPU utilization - Oramon server VM – Oramon Server CPU utilization - Db server Disk IO write – DB server

Handling of duplicate samples while inserting into historical tables Handling of duplicate samples while inserting into historical tables Duplicates are inevitable due to redundant monitoring of on-behalf entities Duplicates are inevitable due to redundant monitoring of on-behalf entities History table insertion and latest table update fails if there are duplicate samples in the queue History table insertion and latest table update fails if there are duplicate samples in the queue Implemented duplicates removal and sub sequent insertion & update of samples to prevent loss of data Implemented duplicates removal and sub sequent insertion & update of samples to prevent loss of data OraMon failure recovery, duplicates handling and other bug fixes

Oramon recovery from temporary db connect failures Oramon recovery from temporary db connect failures Temporary connect failures of database should be handled properly Temporary connect failures of database should be handled properly Implemented connection recovery to overcome temporary db failures Implemented connection recovery to overcome temporary db failures Graceful shutdown of OraMon on receiving signals Graceful shutdown of OraMon on receiving signals Proper DB disconnection on oramon shut down desirable Proper DB disconnection on oramon shut down desirable Implemented graceful shutdown of oramon on signals Implemented graceful shutdown of oramon on signals Oramon crashes on long metric field lengths Oramon crashes on long metric field lengths A few new metrics have field lengths > 1000 bytes A few new metrics have field lengths > 1000 bytes Identified the cause for the bug and removed it Identified the cause for the bug and removed it Contd…

Encryption of samples is a RFE of lemon security Encryption of samples is a RFE of lemon security Implementation decided to be based on public/private key of the server Implementation decided to be based on public/private key of the server Asymmetric key encryption can’t be done on data size exceeding modulus of key Asymmetric key encryption can’t be done on data size exceeding modulus of key Typical modulus is 308 for 1024 bit key length Typical modulus is 308 for 1024 bit key length Recursive encryption/decryption is necessary to achieve full sample encryption Recursive encryption/decryption is necessary to achieve full sample encryption Public/Private key based lemon samples encryption

Module implemented to do sample stream encryption/decryption recursively Module implemented to do sample stream encryption/decryption recursively TCP/UDP transports require changes to maintain and transmit normal and encrypted samples TCP/UDP transports require changes to maintain and transmit normal and encrypted samples Identified where and what changes to make in the source and implementation soon to follow Identified where and what changes to make in the source and implementation soon to follow Public/Private key based lemon samples encryption Contd…

Wassh2 Re-engineering Parallel SSH execution tool currently being used in CERN-CC Parallel SSH execution tool currently being used in CERN-CC Executes shell commands on remote hosts in parallel Executes shell commands on remote hosts in parallel Communicates with CDB Communicates with CDB Written in different languages like Perl, Python, C and Haskell Written in different languages like Perl, Python, C and Haskell

Wassh2 - Task Re-implementation of existing Wassh Re-implementation of existing Wassh It involves - It involves - Comparison of existing Wassh with other open-source solutions Comparison of existing Wassh with other open-source solutions Preparation of a design document depending upon the comparison and existing RFEs Preparation of a design document depending upon the comparison and existing RFEs Isolation of Wassh and CDB communication Isolation of Wassh and CDB communication Implementation of Wassh2 Implementation of Wassh2

Wassh2 - Design Wassh2 Front-End SSM (Optional Site-specific module) Parallel Engine SSH SSH Servers Wassh- Decorate User Options Target Selection Options List of Hosts [Parallelism Options Shell Command Output Formatted Output

Wassh2 Front-End Responsible for handling options Responsible for handling options Enumerates basic target selection options like: Enumerates basic target selection options like: Targets with numeric wildcards e.g. lxb[01-10] Targets with numeric wildcards e.g. lxb[01-10] Read target list from file/STDIN Read target list from file/STDIN Uses optional Site Specific Module (SSM) to expand site specific nomenclature Uses optional Site Specific Module (SSM) to expand site specific nomenclature Communication with other modules Communication with other modules

Wassh2 – CERN-SSM Interface between CERN CDB and Wassh front-end Interface between CERN CDB and Wassh front-end Uses HTTP/XML based CDBSQL API to connect to CDB and gather XML information Uses HTTP/XML based CDBSQL API to connect to CDB and gather XML information Returns list of hosts depending upon target selection options Returns list of hosts depending upon target selection options If options > 1, options are ANDed to generate host list If options > 1, options are ANDed to generate host list

Wassh2 – Work done Design document is prepared Design document is prepared First prototype version of Wassh2 is developed First prototype version of Wassh2 is developed Modifications suggested by CERN team are being done Modifications suggested by CERN team are being done

Sensor-Exception Responsible for generating exception/alarms based on local metrics Responsible for generating exception/alarms based on local metrics Generated exceptions are used to display alarms in LAS (Lemon Alarm System) Generated exceptions are used to display alarms in LAS (Lemon Alarm System) Supports logical correlation of multiple metrics Supports logical correlation of multiple metrics

Sensor-Exception Work done in sensor-exception Work done in sensor-exception Support for on-behalf metric correlation Support for on-behalf metric correlation Generates exception for all on-behalf metrics Generates exception for all on-behalf metrics Support for alarm state management Support for alarm state management Exception/alarm can be turned off dynamically without stopping monitoring Exception/alarm can be turned off dynamically without stopping monitoring Support for minimum occurrences of exception Support for minimum occurrences of exception Suppresses transient alarms Suppresses transient alarms

Lemon XML-Gateway XML-Gateway is an interface to Lemon monitoring-repository (MR). XML-Gateway is developed to overcome shortcomings of existing SOAP interface. SOAP-Server crashes when number of samples exceeds ( For large queries).

Lemon XML-Gateway Development work includes:- Development work includes:- 1. Developing new methods for fetching data from monitoring-repository (oramon as well as flatmon). 2. Development of XML wrapper classes. 3. Development of gateway program to receive client requests. 4. XML-Schema to describe the XML data. Contd…

Raw XML data can be requested from XML- Gateway using HTTP protocol. Raw XML data can be requested from XML- Gateway using HTTP protocol. Performance of XML-Gateway is evaluated and found satisfactory. Performance of XML-Gateway is evaluated and found satisfactory. Next part is to develop Lemon XML-API in each of the following languages: PERL, PHP, C++, Python, and Java. Next part is to develop Lemon XML-API in each of the following languages: PERL, PHP, C++, Python, and Java. Lemon XML-Gateway Contd…

Lemon XML-Gateway Performance Graphs ( OraMon) Contd…

Lemon XML-Gateway Performance Graphs ( OraMon) Contd…

Lemon XML-Gateway Performance Graphs ( FlatMon) Contd…

Lemon XML-Gateway Performance Graphs ( FlatMon) Contd…

Lemon XML-API Lemon XML-API is decided to be developed in C++. Lemon XML-API is decided to be developed in C++. Later SWIG will be used to generate interface in other languages. Later SWIG will be used to generate interface in other languages. Lemon XML-API includes methods to fetch XML data from Lemon XML-Gateway. Lemon XML-API includes methods to fetch XML data from Lemon XML-Gateway. Also methods will be provided to query local data. Also methods will be provided to query local data.

Lemon XML-API Development work is going on for C++ API. Development work is going on for C++ API. libxml++ will be used for parsing XML on client side. libxml++ will be used for parsing XML on client side. Some C++ classes are developed for Lemon XML-API taking into account SWIG limitations. Some C++ classes are developed for Lemon XML-API taking into account SWIG limitations. Contd…

CCTracker Currently provides display only (read) interface Currently provides display only (read) interface Now designed to initiate updates thro’ CCTracker Client Now designed to initiate updates thro’ CCTracker Client Design makes clear separation between generic & site-specific components and is highly configurable Design makes clear separation between generic & site-specific components and is highly configurable

CCTracker Design CCTracker Client CCTracker Client Servlet DB CCService Site Specific Logic Linux Windows Direct XML SOAP Web Service Updates View / Read

Developments XML Schema defined XML Schema defined Java Object binding for the XML implemented with Castor API Java Object binding for the XML implemented with Castor API Database mapping of XML/Java objects with Castor JDO Database mapping of XML/Java objects with Castor JDO useful for development and testing of new features that initiates updates useful for development and testing of new features that initiates updates

Developments … CCService (a web service) CCService (a web service) Handles the database updates Handles the database updates Authenticate & authorize users for managing Computer Center Authenticate & authorize users for managing Computer Center Implements use cases of managing infrastructure objects Implements use cases of managing infrastructure objects functionality is also implemented as client & server exchanging xml messages thro’ http post functionality is also implemented as client & server exchanging xml messages thro’ http post

CCTracker Client – new features Default view shows empty & filled racks with different colors Default view shows empty & filled racks with different colors Infrastructure related to the logical model i.e. domain, cluster & sub cluster is now shown Infrastructure related to the logical model i.e. domain, cluster & sub cluster is now shown View properties (right mouse click) features implemented View properties (right mouse click) features implemented infrastructure objects – display location and attributes, other relevant information in tabbed view infrastructure objects – display location and attributes, other relevant information in tabbed view logical objects – linked to infrastructure objects logical objects – linked to infrastructure objects

Snapshots Castor Cluster in FIO Domain

Snapshots View Properties

Thank You…