Quality Service Assurance: Monitoring, Testing, and Metrics Eugene G. Yu Lingjun Kang

Slides:



Advertisements
Similar presentations
TeraGrid Deployment Test of Grid Software JP Navarro TeraGrid Software Integration University of Chicago OGF 21 October 19, 2007.
Advertisements

CWIC Development Team Yuanzheng Shao Lingjun Kang Archie Warnock
CWIC Exception & Error Handling Archie Warnock Lingjun Kang Genyong Yu
Software Delivery. Software Delivery Management  Managing Requirements and Changes  Managing Resources  Managing Configuration  Managing Defects 
Unauthorized Reproduction Prohibited SkyPoint Alarm Integration Add-On Using OnGuard Alarms to create events in SkyPoint Also called ‘SkyPoint V0’ CR4400.
Robofest 2001 Online Management System Jim Needham MCS 4833/01 Senior Project Dr. Chan-Jin Chung, Ph.D.
Importance of Data Quality April 21, Agenda  Uses & Benefits of Data  Elements of Quality  IBC Data Quality Safeguards  Metrics  Data Quality.
CWIC Metrics Eugene G. Yu Lingjun Kang Archie Warnock
Microsoft Office Project 2003 Resource Kits James Scott & Roy Riley Technical Content Development Microsoft Corporation.
Slide 1 of 9 Presenting 24x7 Scheduler The art of computer automation Press PageDown key or click to advance.
Sharepoint Portal Server Basics. Introduction Sharepoint server belongs to Microsoft family of servers Integrated suite of server capabilities Hosted.
Module 16: Software Maintenance Using Windows Server Update Services.
Module 13 Automating SQL Server 2008 R2 Management.
Support for Client Partners Eugene G. Yu Lingjun Kang Archie Warnock
Virtual Memory Tuning   You can improve a server’s performance by optimizing the way the paging file is used   You may want to size the paging file.
WEB329 ASP.NET: A Lap Around the New Enhancements for Web Developers in Microsoft Visual Studio 2005 Omar Khan Lead Program Manager Web Platform and Tools.
Project Proposal: Academic Job Market and Application Tracker Website Project designed by: Cengiz Gunay Client: Cengiz Gunay Audience: PhD candidates and.
Working with Applications Lesson 7. Objectives Administer Internet Explorer Secure Internet Explorer Configure Application Compatibility Configure Application.
January, 23, 2006 Ilkay Altintas
Welcome 2013 User Group Meeting Voting on New Features.
Maintaining a Microsoft SQL Server 2008 Database SQLServer-Training.com.
1 CSE 403 System Testing and Test Plans These lecture slides are copyright (C) Marty Stepp, They may not be rehosted, sold, or modified without expressed.
Getting started on informaworld™ How do I register my institution with informaworld™? How is my institution’s online access activated? What do I do if.
CERN - IT Department CH-1211 Genève 23 Switzerland t Monitoring the ATLAS Distributed Data Management System Ricardo Rocha (CERN) on behalf.
May 14, 2001E. Gallas/Trigger Database1 Status of the Trigger Database Elizabeth Gallas, Rich Wellner, Vicky White Fermilab - Computing Division See my.
CWIC/IDN synchronization Lingjun Kang Michael Morahan
Course Presentation EEL5881, Fall, 2003 Project: Network Reliability Tests Project: Network Reliability Tests Team: Gladiator Team: Gladiator Shuxin Li.
Team Members David Haas Yun Tang Robert Njoroge Tom Kerwin Clients Facilities Management Don Anderson Rick Klein.
Page 1 CSISS Center for Spatial Information Science and Systems Design and Implementation of CWIC Metrics Weiguo Han, Liping Di, Yuanzheng Shao, Lingjun.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment, Enhanced Chapter 11: Monitoring Server Performance.
Building better access: Ovid Resources that work for you, Ovid Tools that you can work with. Dana Johnson & Sean O’Doherty Ovid Technologies Access ‘98.
Module 10: Monitoring ISA Server Overview Monitoring Overview Configuring Alerts Configuring Session Monitoring Configuring Logging Configuring.
TEAM Basic TotalElectrostatic ManagementAwareness&
Statistics Monitor of SPMSII Warrior Team Pu Su Heng Tan Kening Zhang.
Real Time Monitors, Inc. Switch Expert™. 2 Switch Expert™ Overview Switch Expert ™ (SE) currently deployed at 80% percent of the INSIGHT-100.
1Copyright © 2012, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy Classification from Slide 8 Contract Management.
© IGD 2011 For subscribers who usually log in via a company intranet link.
The huge amount of resources available in the Grids, and the necessity to have the most up-to-date experimental software deployed in all the sites within.
Optimizer Deployment Centralized Database module on Optimizer hub server Each monitored server has an instance of optimizer installed.
1 Implementing Monitoring and Reporting. 2 Why Should Implement Monitoring? One of the biggest complaints we hear about firewall products from almost.
Page 1 CSISS Center for Spatial Information Science and Systems CWIC Metrics: Current and Future Weiguo Han, Liping Di, Yuanzheng Shao, Lingjun Kang Center.
Chapter 2 Securing Network Server and User Workstations.
Mercury – A Service Oriented Web-based system for finding and retrieving Biogeochemical, Ecological and other land- based data National Aeronautics and.
CWIC + OPENSEARCH: IT WORKS WITH BROWSERS, TOO Archie Warnock Lingjun Kang Genyong Yu
Module 13: Monitoring Resources and Performance. Overview Using Task Manager to Monitor System Performance Using Performance and Maintenance Tools to.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Using GStat 2.0 for Information Validation.
Compuware Corporation Deliver Reliable Applications Faster Dave Kapelanski Automated Testing Manager.
CS5604: Final Presentation ProjOpenDSA: Log Support Victoria Suwardiman Anand Swaminathan Shiyi Wei Department of Computer Science, Virginia Tech December.
Integrated Mobile Marketing Platform Emergic mConnector Integrated -Mobile Marketing Platform Presented By: Sales Person Name ID: Mobile:
Click to edit Master title style Click to edit Master text styles –Second level Third level –Fourth level »Fifth level 1 CustomerSoft ESP Contact Operations.
Microsoft Partner Conference Integrated Innovation Don Kerr Partner Technology Specialist.
Rule Executor Detect Channel
Monitoring Health of Remote Computers Presenter: Win Worrall.
T EST T OOLS U NIT VI This unit contains the overview of the test tools. Also prerequisites for applying these tools, tools selection and implementation.
DDM Central Catalogs and Central Database Pedro Salgado.
Automated Testing April 2001WISQA Meeting Ronald Utz, Automated Software Testing Analyst April 11, 2001.
Support of contact information with multiple organization structure in CWIC & IDN Lingjun Kang Eugene.
Service Charging Platform. EMS (Entity Management System) 0 Logging Agent Provides detailed activity logs and reports all raw facts as they happen to.
Simulation Production System Science Advisory Committee Meeting UW-Madison March 1 st -2 nd 2007 Juan Carlos Díaz Vélez.
ConTZole Tomáš Kubeš, 2010 atlas-tz-monitoring.cern.ch An Interactive ATLAS Tier-0 Monitoring.
Windows Vista Configuration MCTS : Internet Explorer 7.0.
CWIC Development Team Yuanzheng Shao Lingjun Kang Archie Warnock
Configuration Management in CWIC Archie Warnock – A/WWW Enterprises Lingjun Kang – GMU Eugene G. Yu - GMU.
 1- Definition  2- Helpdesk  3- Asset management  4- Analytics  5- Tools.
Stavroula Balopoulou , Angelo Lykiardopoulos, Sissy Iona HCMR-HNODC
The Ultimate SharePoint Admin Tool
Presented by [Harshit Agrawal] 04/03/2017
Printer Admin Print Job Manager
Mark Quirk Head of Technology Developer & Platform Group
WGISS WGISS Connected Data Assets Status Report October, 2019 CWIC Team Eugene Yu (GMU), Archie Warnock (A/WWW), Li Lin (GMU)
Presentation transcript:

Quality Service Assurance: Monitoring, Testing, and Metrics Eugene G. Yu Lingjun Kang Archie Warnock

Outline Monitoring – Service availability Testing – Integration – Status testing – Routine testing Metrics – Integrated Troubleshooting – Internal view and query for details (to track down problems)

Monitoring – current status Purposes – Service availability Current status – Uptime robot What: connector endpoints, CWIC service endpoints Method: uptime robot Frequency: Hourly – GetCapability What: GetCapability, OSDD, Xml validity (content-level) Method: python script (Archie) Frequency: every 4 hours

Monitoring – requirements & future Requirements – To visualize, manage, and subscribe to the service availability status online Proposals – Online service availability portal/page Functions – Monitor and archive the service availability status and statistics – Allow browsing and reviewing current status and archives online – Allow subscription for notification through /sms Method – Manage the status in database for every check – Determine the frequency of status check – Web services for notification subscription – Visual integration as a separate page in the Metrics portal

Testing – current status Purposes – Assure integrity of software at different levels (unit, integration, system, regression, content) Current status – Unit testing – Integration testing – System testing – Regressional testing – Routine functional testing

Testing – unit testing What – Testing the functionality of modules. How – JUNIT GetRecordByID (35) GetRecords(80) Dataset (number of datasets) Wrapper (3) Who – Connector developer When – During development

Testing – integration testing What – Testing the functionality of integration into mediator. How – JUNIT – Scripting Who – Mediator integrator/developer When – Integration of connector into mediator

Testing – system testing What – Testing the functionality of the system as a whole. How – Script – Manual access (test pages, test clients) Who – Developer team – Testers? – producers, users, clients (CWICSmart Test instance, CWICStart test instance) When – After system deployed into DEV and/or TEST servers

Testing – regression testing What – Testing the proper functionality of the system after a change is applied How – Script – Manual access (test pages, test clients) – Implicit monitoring: check validity of xml (in monitoring) Who – Developers When – After a change is applied and the program is deployed into TEST server before being incorporated into the operational service

Testing – routine functional monitoring What – Testing the proper functionality of the system in operation (part of the routine monitoring) How – Script Who – Operational support When – Repetitively testing in a pre-determined frequency

Testing – requirements & future Requirements – Clear specifications for each type of testing – Automate the testing Proposals – Specifications Provide guidelines for each type of specifications to clarify what, how, who, and when to run the testing Record and archive the testing records to keep track on all the info – what, how, who, when – Automation Automated tools/cases/scripts Frequency to run the automated test Record the testing – what, how, who, and when in a database for diagnosis Password-protected Web-based archive of tests and status to help tracing the problem

Metrics – current status Purposes – Summary of usages Current status – Updates Integrated metrics Remote search time – Outlier screening longer than usual: >500 seconds mostly likely caused by power outage or anything beyond control of computing service

Metrics – requirements & future Requirements – Detecting and filtering outliers beyond computing service – Visualizing and browsing archives of status Proposals – Service status page Browsing archives of status Displaying current status Subscribing to alerts (abnormal detection and alerting) – Outlier detection and filtering Outlier detection Criteria to filter out

Troubleshooting – current status Purposes – Tracking down details Current status – Debugging Searching the logging database manually Examining logging files individually Ad-hoc approach – Document for troubleshooting guide (?) – notification from uptime robot and python monitoring scripts

Troubleshooting – requirements & future Requirements – Browsing the logging in details Proposals – Secured service usage detail query Password-protected page for querying the logging database Visualize and report IP-specific usages Visualize and report date-specific usages Generic SQL-based query and reporting Integrated it as one protected page in the metrics portal