© 2013 IBM Corporation Defect Analytics Marist/IBM Joint Studies Michael Gildein 09 June 2014.

Slides:



Advertisements
Similar presentations
Summary XBRL Challenge Objective: Tools that rely on XBRL data, e.g., tool that extracts data for multi-company comparison via desktop application; or.
Advertisements

Pentaho Open Source BI Goldwin. Pentaho Overview Pentaho is the commercial open source software for Business Pentaho is the commercial open source software.
C6 Databases.
Data Manager Business Intelligence Solutions. Data Mart and Data Warehouse Data Warehouse Architecture Dimensional Data Structure Extract, transform and.
® IBM Software Group © 2010 IBM Corporation What’s New in Profiling & Code Coverage RAD V8 April 21, 2011 Kathy Chan
University of Florida Incident Tracking and Reporting Kathy Bergsma
CX Analytics: Best Practices in Measuring For Success
Visual Studio Online. What it Provides Visual Studio Online, based on the capabilities of Team Foundation Server with additional cloud services, is the.
Validata Release Coordinator Accelerated application delivery through automated end-to-end release management.
Data Warehouse Architecture Sakthi Angappamudali Data Architect, The Oregon State University, Corvallis 16 th May, 2005.
OLAP Cubes and Pivot Tables Leveraging the Power of a Microsoft EPM Solution EPM Customization Series Part 1 February 21 st, 2007 Brendan Giles, PMP, MCP.
Chapter 3 Database Management
Components and Architecture CS 543 – Data Warehousing.
Data Warehouse success depends on metadata
Microsoft SharePoint 2013 SharePoint 2013 as a Developer Platform
Altosoft Copyright ® 2012 altosoft.com8/3/2012 Sandy Follin, Sr. Account Executive Steve Schrader, Sr. Sales Engineer.
Understanding Analysis Services Architecture. Microsoft Data Warehousing Overview OLTP Source DTS DW Storage Analysis Services Clients OLE DB for OLAP,
ETL By Dr. Gabriel.
BUSINESS INTELLIGENCE/DATA INTEGRATION/ETL/INTEGRATION AN INTRODUCTION Presented by: Gautam Sinha.
Accelerating Product and Service Innovation © 2013 IBM Corporation IBM Integrated Solution for System z Development (ISDz) Henk van der Wijk 23 Januari.
SharePoint 2010 Business Intelligence Module 6: Analysis Services.
L/O/G/O Metadata Business Intelligence Erwin Moeyaert.
SharePoint 2010 Business Intelligence Module 2: Business Intelligence.
Best Practices for Data Warehousing. 2 Agenda – Best Practices for DW-BI Best Practices in Data Modeling Best Practices in ETL Best Practices in Reporting.
What’s New for Business and IT Cognos 8 BI Version 8.4.
1 The following presentation is from the Oracle Webcast “What’s New in P6 EPPM Release 8.1.” As a partner, you may not use the Oracle Power Point template,
Activity Running Time DurationIntro0 2 min Setup scenario 2 2 min SQL BI components & concepts 4 5 min Data input (Let’s go shopping) 9 7 min Whiteboard.
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie.
OLAP Cubes and Pivot Tables Leveraging the Power of a Microsoft EPM Solution EPM Customization Series Part 1 February 21 st, 2007 Brendan Giles, PMP, MCP.
Software Test Metrics When you can measure what you are speaking about and express it in numbers, you know something about it; but when you cannot measure,
Using SAS® Information Map Studio
© 2008 IBM Corporation ® IBM Cognos Business Viewpoint Miguel Garcia - Solutions Architect.
C6 Databases. 2 Traditional file environment Data Redundancy and Inconsistency: –Data redundancy: The presence of duplicate data in multiple data files.
Corporate Performance Management “ Delivers a complete line of sight with best of breed solutions at each point ”
Building Data and Document-Driven Decision Support Systems How do managers access and use large databases of historical and external facts?
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
Data Staging Data Loading and Cleaning Marakas pg. 25 BCIS 4660 Spring 2012.
Introduction to Software Project Estimation I (Condensed) Barry Schrag Software Engineering Consultant MCSD, MCAD, MCDBA Bellevue.
© 2007 IBM Corporation IBM Information Management Accelerate information on demand with dynamic warehousing April 2007.
Project Portfolio Management Business Priorities Presentation.
Mining Document Collections to Facilitate Accurate Approximate Entity Matching Presented By Harshda Vabale.
Hints and Tips on Managing IBM Cognos Planning Models Effectively Pat Bann / Paresh Kerai Cognos Planning Consultants.
7 Strategies for Extracting, Transforming, and Loading.
Rajesh Bhat Director, PLM Analytics Applications
TRAC Software for creation of supplier contract risk profile
Performance Testing Test Complete. Performance testing and its sub categories Performance testing is performed, to determine how fast some aspect of a.
Information Integration 15 th Meeting Course Name: Business Intelligence Year: 2009.
Summary Cognos 8 BI. Objectives  In this module we will examine:  major innovations in Cognos 8  review of new functionality in Cognos 8  customer.
© 2013 IBM Corporation Accelerating Product and Service Innovation Service Virtualization Testing in Managed Environments Michael Elder, IBM Senior Technical.
Cognos 8: Software for Management Information System.
Managing Data Resources File Organization and databases for business information systems.
System Software Laboratory Databases and the Grid by Paul Watson University of Newcastle Grid Computing: Making the Global Infrastructure a Reality June.
Data Mining & OLAP What is Data Mining? Data Mining is the set of activities used to find new, hidden, or unexpected patterns in data.
Open Governance Platform
Build Fundamentals and Continuous Integration
Hadoop-based Distributed Web Crawler
Databases and Information Management
IBM Content and Predictive Analytics for Healthcare How it works
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Business Intelligence for Project Server/Online
MANAGING DATA RESOURCES
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie
DAT381 Team Development with SQL Server 2005
Technical Capabilities
Analytics, BI & Data Integration
Office 365 Development July 2014.
Mark Quirk Head of Technology Developer & Platform Group
What's New in eCognition 9
Presentation transcript:

© 2013 IBM Corporation Defect Analytics Marist/IBM Joint Studies Michael Gildein 09 June 2014

© 2013 IBM Corporation2 Agenda  Team  Background  Conventional Defect Analysis & Tracking  Data  Original Project  Infrastructure  Defect Data Warehouse  Current Research  Future Defect Analytics - Marist/IBM Joint Studies

© 2013 IBM Corporation3 Team Defect Analytics - Marist/IBM Joint Studies Michael Gildein (~2 d/w):  Team Lead, Design, Data Research, Cognos Reports, Server Administrator Emily Metruck (Spare Time):  Cognos Reports Greg Cremmins (~10 hrs/wk):  Marist/IBM Joint Studies Intern  Product Research, Research Experiments  ECRL Sandbox Environment

© 2013 IBM Corporation4 Background Defect Analytics - Marist/IBM Joint Studies IBM z/OS Operating System  Mature software product – 50 Years!  Multiple releases (V1R11 ~ Current) Extensive Testing  Unit Test  Function Test  System Test  Performance Test  Integration Test SVT - Every potential defect recorded  Defect record management system –> Adequate # of records

© 2013 IBM Corporation5 Conventional Defect Analysis & Tracking  Root Cause Analysis (RCA)  Defect Density –Defects/KLoc (New & Changed)  Defect Tracking Trends –Normal Distribution (Bell Curve) –Test Execution Comparison  Defect Concentrations  Defect Pooling (Disjoint Pool A, B) –DefectsTotal = ( DefectsA * DefectsB ) / DefectsA&B  Defect Seeding –IndigenousDefectsTotal = ( SeededDefectsPlanted / SeededDefectsFound ) * IndigenousDefectsFound  Defect Source Identification and Localization  Orthogonal Defect Classification (ODC) Defect Analytics - Marist/IBM Joint Studies

© 2013 IBM Corporation6 Background Defect Analytics - Marist/IBM Joint Studies Marist/IBM JOINT STUDIES:  GA Released Scrubbed Data  ECRL Research Environment  Analytics Giveback  Research Intern Automatically predict defect validity upon record creation in real time to leverage historic empirical test data to drive a smarter test process. ORIGINAL MISSION EXPANDED MISSION Provide defect analytics, prediction capabilities and research data correlations in order to leverage historic empirical development data to drive a smarter development process.

© 2013 IBM Corporation7 Data Defect Analytics – Marist/IBM Joint Studies Defect Management System  Consistent # of Records per Release  zOS V1R11, V1R12, V1R13, V2R1

© 2013 IBM Corporation88 Validity Prediction Defect Analytics - Marist/IBM Joint Studies ACCURACY:  >70% Overall on average  ~92% confidence on average for a true positive  Academic field research averages ~70% accuracy in defect predictions RESULTS:  Proves strong correlation in defects  Establish analytics infrastructures & ETL process  Process retro-active reporting  Real time monitoring  Defect fields (55) - Component, severity, tester,…  360 Different algorithm variations executed  Aggregate voting scheme  Hourly data pulls and predictions PROCESS: GOAL: Defect prediction of valid or invalid

© 2013 IBM Corporation9 Process Flow Defect Analytics - Marist/IBM Joint Studies

© 2013 IBM Corporation10 Physical Infrastructure Clients Windows Server IBM Data Studio IBM DB2 IBM SPSS Modeler Server IBM Cognos Server Defect Record Management Servers Additional Input Data Servers System z Host IBM SPSS Modeler Client IBM Cognos Framework Manager IBM Cognos Metric Designer zLinux Scripts & Data Crawlers Defect Analytics – Marist/IBM Joint Studies End User Web Interface

© 2013 IBM Corporation11 Products Active  IBM DB2  IBM HTTP Server  IBM WebSphere  IBM SPSS Modeler  IBM Cognos Business Intelligence Server  IBM Cognos Insight  Eclipse  Homegrown Data Crawlers & Apps  Make relationships Research  IBM SPSS Modeler Collaboration and Deployment Services (CnDs)  IBM SPSS Text Analytics  IBM SPSS Catalyst  IBM Cognos TM1  IBM InfoSphere BigInsights  IBM Classification Module (ICM)  IBM Content Analytics Defect Analytics – Marist/IBM Joint Studies

© 2013 IBM Corporation12 Predictive Retro-Active Real Time Cross-phase analysis Development Testing Field Contribution breakdown Hot modules Defect arrival Team discovery Root Cause (ODC) Service time + More Services Predictive Validity prediction In release Cross release Retro-Active Real Time Monitor defect queues High severity & hot defects Defect backlogs Inactivity thresholds Advanced duplicate searching Defect trends => Test plan adjustments Defect Analytics – Marist/IBM Joint Studies

© 2013 IBM Corporation13 Defect Warehouse Defect Analytics - Marist/IBM Joint Studies  Relational Model  Extract Transform and Load (ETL)  Handle data issues  Map information across defect systems  Extract history, notes  Correlate records  Run calculations – Service time

© 2013 IBM Corporation14 Defect OLAP Cube Defect Analytics - Marist/IBM Joint Studies  11 Dimensions  Open Date  Closed Date  Release  Component  Test Phase  SubPhase  Answer  Root Cause  Trigger  Severity  State  Answer  2 Metrics  ID, Age  IBM TM1 - In Memory  Systematic procedure for frequent defect summary analysis  Speed - 5+ min to < 5 sec  Resource consumption decrease  Recalculating aggregates every report versus on updates Queries Scheduler, V1, 06/09/2000 => (D12345, 36 Days) All Components, All Releases, 06/09/2000 => 246 Days

© 2013 IBM Corporation15 Defect Unification Mapping Defect Analytics - Marist/IBM Joint Studies  Test phases use different defect systems  Multiple product us different defect systems  Mapping between systems  Cross product trends

© 2013 IBM Corporation16 Current Research Defect Analytics - Marist/IBM Joint Studies GOALS  Increase Validity Prediction Accuracy  When do we have a stable model? (Time || #)  Model update frequency  Tester note text analysis  Time under test OTHER TASKS  Defect OLAP cube expansion  BigInsights & MapReduce exploration  Automated trend analysis

© 2013 IBM Corporation17 Dump Scoring Workload Analytics Future Defect Density Prediction Defect Analytics - Marist/IBM Joint Studies

© 2013 IBM Corporation18 Workload Analytics Activity thresholds  Defects Which  Defects Combinations  Defects Coverage Need updates? Frequency Build updates recommend workload Dump Scoring Real time dump analysis Smart matching Search on dump information pieces Trends in dumps Score dump on likelihood of problem Defect Density Prediction Expected defect counts per release/component More accurate test planning Identify risks Cross test phase analysis SVT Analytics Future Research Defect Analytics - Marist/IBM Joint Studies

© 2013 IBM Corporation Questions? Michael Gildein 9 June 2014